Guilherme Santos
@guilhermesantos3
Senior Data Engineer building governed Azure/Databricks lakehouses, automated pipelines, and REST data services.
What I'm looking for
I’m a Senior Data Engineer with 4+ years designing and operating enterprise-scale Lakehouse platforms on Azure and Databricks, delivering end-to-end automation in regulated financial environments. I combine strong ETL/pipeline engineering with REST API development and full CI/CD deployment to make data reliable, governed, and usable. I’ve also published applied deep learning research (2024) and I’m currently pursuing an M.Eng. focused on signal processing and computer vision.
At BRQ Digital Solutions, I led the greenfield build of a centralized cloud Lakehouse for non-financial KPIs at one of Latin America’s largest banks. I designed and operated batch pipelines with Databricks (PySpark), Python, and Azure Data Factory, reducing manual reporting effort by ~60% through pipeline automation and data consolidation across 8+ business domains. I implemented Medallion Architecture with data quality validation, governance standards, and CI/CD workflows using Git and GitHub Actions, and I built REST APIs and optimized Delta Lake data models for analytical consumption.
Previously at LexisNexis Risk Solutions, I progressed from Data Engineer I to III (ETL & Pipeline Engineering), taking on pipeline architecture ownership, cross-team technical support, and junior mentoring. I engineered production ingestion pipelines processing ~3–4TB across 8 sources, led storage/performance optimization for a strategic client in the Brazilian credit bureau sector (tables up to 12TB; ~80TB data lake), and reclaimed ~15% of disk capacity via deduplication—addressing recurring 100% disk saturation incidents. I also delivered LGPD-mandated data removal and workflow adaptations for alphanumeric CNPJ identifiers, and served as a cross-team resource for ETL diagnosis and root-cause analysis.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
BRQ Digital Solutions
Oct 2025 - Present (6 months)
Leading the greenfield build of a centralized cloud lakehouse for non-financial KPIs at a major Latin American bank. Designing Databricks (PySpark) batch pipelines and Delta Lake models, automating ~60% of manual reporting across 8+ business domains, and building REST APIs and data access layers.
Data Engineer I–III
LexisNexis Risk Solutions
Jun 2021 - Oct 2025 (4 years 4 months)
Designed, maintained, and evolved production data ingestion pipelines processing ~3–4TB across 8 structured source systems, progressing from Engineer I to III over 4 years. Improved reliability for a major client by reclaiming ~15% of disk capacity via deduplication (~100% disk saturation incidents resolved) and implemented LGPD-mandated ingestion changes for alphanumeric CNPJ identifiers.
Full Stack Developer
Radocc Softwares
Nov 2019 - May 2021 (1 year 6 months)
Developed R&D for agricultural technology products using ReactJS/TypeScript and Java, and built mobile features with Flutter. Designed embedded systems and assembled PCB circuits for IoT-based agricultural monitoring applications.
Education
Degrees, certifications, and relevant coursework
Universidade Tecnológica Federal do Paraná
Master of Engineering, Production Systems Management
2024 -
Pursuing an M.Eng. in Production Systems Management with research in video-based rPPG estimation, signal processing, and computer vision for physiological monitoring.
Universidade Tecnológica Federal do Paraná
Bachelor of Engineering, Computer Engineering
2016 - 2021
Earned a B.Eng. in Computer Engineering from UTFPR.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Portfolio
app.sel7.com.brJob categories
Skills
Interested in hiring Guilherme?
You can contact Guilherme and 90k+ other talented remote workers on Himalayas.
Message GuilhermeFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
