HimalayasHimalayas logo
Guilherme SantosGS
Open to opportunities

Guilherme Santos

@guilhermesantos3

Senior Data Engineer building governed Azure/Databricks lakehouses, automated pipelines, and REST data services.

Brazil
Message

What I'm looking for

I’m looking for 100% remote opportunities worldwide where I can build governed Azure/Databricks lakehouses, automate reliable pipelines, deliver REST data services, and grow my work in signal processing and computer vision.

I’m a Senior Data Engineer with 4+ years designing and operating enterprise-scale Lakehouse platforms on Azure and Databricks, delivering end-to-end automation in regulated financial environments. I combine strong ETL/pipeline engineering with REST API development and full CI/CD deployment to make data reliable, governed, and usable. I’ve also published applied deep learning research (2024) and I’m currently pursuing an M.Eng. focused on signal processing and computer vision.

At BRQ Digital Solutions, I led the greenfield build of a centralized cloud Lakehouse for non-financial KPIs at one of Latin America’s largest banks. I designed and operated batch pipelines with Databricks (PySpark), Python, and Azure Data Factory, reducing manual reporting effort by ~60% through pipeline automation and data consolidation across 8+ business domains. I implemented Medallion Architecture with data quality validation, governance standards, and CI/CD workflows using Git and GitHub Actions, and I built REST APIs and optimized Delta Lake data models for analytical consumption.

Previously at LexisNexis Risk Solutions, I progressed from Data Engineer I to III (ETL & Pipeline Engineering), taking on pipeline architecture ownership, cross-team technical support, and junior mentoring. I engineered production ingestion pipelines processing ~3–4TB across 8 sources, led storage/performance optimization for a strategic client in the Brazilian credit bureau sector (tables up to 12TB; ~80TB data lake), and reclaimed ~15% of disk capacity via deduplication—addressing recurring 100% disk saturation incidents. I also delivered LGPD-mandated data removal and workflow adaptations for alphanumeric CNPJ identifiers, and served as a cross-team resource for ETL diagnosis and root-cause analysis.

Experience

Work history, roles, and key accomplishments

LS

Data Engineer I–III

LexisNexis Risk Solutions

Jun 2021 - Oct 2025 (4 years 4 months)

Designed, maintained, and evolved production data ingestion pipelines processing ~3–4TB across 8 structured source systems, progressing from Engineer I to III over 4 years. Improved reliability for a major client by reclaiming ~15% of disk capacity via deduplication (~100% disk saturation incidents resolved) and implemented LGPD-mandated ingestion changes for alphanumeric CNPJ identifiers.

RS

Full Stack Developer

Radocc Softwares

Nov 2019 - May 2021 (1 year 6 months)

Developed R&D for agricultural technology products using ReactJS/TypeScript and Java, and built mobile features with Flutter. Designed embedded systems and assembled PCB circuits for IoT-based agricultural monitoring applications.

Education

Degrees, certifications, and relevant coursework

Universidade Tecnológica Federal do Paraná logoUP

Universidade Tecnológica Federal do Paraná

Master of Engineering, Production Systems Management

2024 -

Pursuing an M.Eng. in Production Systems Management with research in video-based rPPG estimation, signal processing, and computer vision for physiological monitoring.

Universidade Tecnológica Federal do Paraná logoUP

Universidade Tecnológica Federal do Paraná

Bachelor of Engineering, Computer Engineering

2016 - 2021

Earned a B.Eng. in Computer Engineering from UTFPR.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan