Diogo Toledo
@diogotoledo
Data Engineer building scalable PySpark and AWS pipelines for business-ready analytics.
What I'm looking for
I'm a passionate Data Engineer with a strong track record designing and operating end-to-end data pipelines using PySpark and AWS to transform raw data into business-ready insights.
At ALELO I build EMR-based data pipelines across Bronze, Silver and Gold layers, orchestrate workflows with Apache Airflow, and use S3 and Athena to deliver reliable analytics. I monitor and track tasks through Azure DevOps and participate actively in agile ceremonies.
Previously I developed ETL pipelines with Informatica PowerCenter, optimized advanced SQL across Oracle and SQL Server, and created Power BI dashboards to support public health decision-making. I also collaborated on machine learning pipelines using Python libraries such as Pandas and Scikit-learn.
I hold postgraduate training in Data Engineering and Architecture and bring cross-cultural experience from living in the U.S.; I'm open to remote international roles where I can contribute to cloud migrations, modern data stack architecture, and high-impact data products.
Experience
Work history, roles, and key accomplishments
Data Engineer
Extractta
Jun 2024 - Present (1 year 4 months)
Develop and manage scalable PySpark data pipelines across Bronze/Silver/Gold layers, orchestrate Airflow workflows on AWS (S3, EMR, Athena), and maintain pipeline reliability to support ALELO's analytics needs.
Data Engineer/Analyst
Prefeitura de Belo Horizonte
Aug 2022 - May 2024 (1 year 9 months)
Developed and optimized advanced SQL ETL processes (Oracle, SQL Server) and Informatica PowerCenter pipelines, and created Power BI dashboards to improve public health data accessibility and reporting.
Data Scientist Jr.
Boreal Fintech
Oct 2021 - Aug 2022 (10 months)
Collaborated on analytics and ML pipelines, performed EDA and modeling using Python, and supported deployment and monitoring of end-to-end machine learning solutions to inform business decisions.
Education
Degrees, certifications, and relevant coursework
IGTI
Postgraduate Degree, Data Processing
2024 - 2025
Postgraduate degree in Data Processing and Data Processing Technology/Technician completed between April 2024 and February 2025.
Faculdade Pitágoras
Technologist, Data Science
2021 - 2023
Technologist program in Data Science within Information Technology completed from July 2021 to December 2023.
Centro Universitário UNA
Bachelor of Business Administration, Business Administration
2010 - 2016
Bachelor in Business Administration with emphasis in International Trade completed from 2010 to 2016.
Colégio COTEMIG
Technical Course, Management Informatics
2001 - 2003
Technical integrated course in Management Informatics completed from 2001 to 2003.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Interested in hiring Diogo?
You can contact Diogo and 90k+ other talented remote workers on Himalayas.
Message DiogoFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
