Roberto  D.

Roberto D.

Young professional working towards a diploma in Data Science with experience in Data Engineering.

, Brazil

Experience: 1 Year


Young professional working towards a diploma in Data Science with experience in Data Engineering.

6005.52 USD / Year

  • Immediate: Available

1 Year

Now you can Instantly Chat with Roberto !

About Me

  • Young professional working towards a diploma in Data Science with experience in Python. Focused in Pyspark, Database Administration, SQL, No-SQL, Airflow, OOP, AWS Cloud, Docker, ETL, Django, Data Modeling with Dash Plotly and Command Line<...

Show More

Portfolio Projects





In this project,

  1. I used speedtest-CLI (Linux software) to collect data about the internet velocity of my residence and used a scheduler with the Cron(Linux Software) to repeat the task periodically.

  2. Then I organized the output data to be ingested into a dataset in a CSV format using the Pandas (Python module).

  3. From the dataset, I produce an interactive graph about my upload and download rate using the Plotly (Python module).

  4. And finally, I share the graph on the internet through the S3(AWS Simple Cloud Storage ) using the Boto3 (Python module).

link to see the output on AWS SERVER: GRAPH

To use the code for the first time:

  1. First install the requirements, with "" using bash.

  2. Secondoly, create the dataset with "" using python.

Use the "" to run:


To collect data about your internet speed.


To populate the database with the new data.


To vizualize the models in the dashboard.

To see the dashboard access this address "" in your web-browser.

You also can vizualize the database in terminal with the "".

Show More Show Less





General description

  • Project made by me in my internship on Minstry of Communications of Brazil, to integrate spreadsheets from different sectors linked to GESAC Financial Control (WIFI-BRAZIL that aims to bring the connection to public schools).

Final gol:

  • Pass the data to the PowerBI. To create a model that can integrate different sectors linked to GESAC Financial Control.



  • Control of Parliamentary Amendments


  • Control of Credit Note and Commitment

Controle de Empenhos e NC 2022.xlsm


  • Business Understanding: preparation of formal documents for understanding the data based on the initial stages of CRISP-DM management. Data dictionary creation.

  • Make recommendations, so that future data could be inserted atomically following database normalizations.

  • Advanced data ingestion for reading excel files with different formats.

  • Create a primary key for the worksheets with the names of Deputies, and Senators to circumvent the inconsistency of the data entered by different professionals from the Ministry of Communications. To then do the modeling of Financial Control within PowerBI.

  • Create an algorithm that would make a comparison of entities to fill in the civil name of all Parliamentarians. This filling was done based on a control of all the different parliamentary names given in a dataset with the civil name of each one (Spreadsheet: Proponentes.xlsx), a bank created by me from an extraction of the Open Government Data.

Public Data


Show More Show Less

System And MyScripts


System And MyScripts



Here I put together my main scripts made by me. DESCRIPTION OF THE FILES and FOLDERS:


calcalus_2: this project that I created to help solve problems on Calculus_2 using "Sympy", a Python module for advanced math operations such as integrals and deliveries.


In this project, I tested an open-source app for running, and use its data in the gpx to explore information characteristics and produce graphs using "mplleaflet" and "folium", Python modules.


This project has its separate repository outside of here. You can find it in the link below:


In this project I made a program to help my girlfriend learn English, listening, conjugating verbs, and spelling. While practicing typing on the keyboard. For this I used:

-gtts and Playsound (Python Modules) - to reproduce the sound of the words.

-os (Python Modules) - to control the file system.

-pynput (Python Modules) - to control the input on the keyboard

-coloram (Python Modules) -to produce colors on the terminal, indicating in which part of the text the person is while it typing its spelling.

-random (Python Modules) - to pick a random word in the data frame for the code execute.

Show More Show Less