Ricardo Roa
@ricardoroa
Senior Data Engineer specializing in cloud-native ETL, Snowflake, and AWS automation.
What I'm looking for
I am a Senior Data Engineer with extensive experience designing, developing, and optimizing cloud-based data solutions, primarily on AWS and Snowflake. I specialize in building end-to-end ETL/ELT pipelines, data models, and scalable infrastructure using Terraform, Docker, and CI/CD.
Across roles at Hakkoda, Nagarro, Globant and others, I optimized ETL pipelines, automated Snowflake deployments with Schemachange and CI/CD, and developed containerized Lambda deployments. I integrated monitoring workflows (Slack–AWS–Snowflake) and implemented version-controlled infrastructure to ensure high availability and data reliability.
I have built ML-enabled fuzzy-matching and entity-resolution pipelines, designed analytical models and Power BI dashboards, and led cloud migrations to BigQuery and GCP storage. I focus on observability and cost-aware design, implementing Datadog metrics, Athena-based data quality checks, and right-sized compute for production workloads.
I hold certifications in Snowflake Data Engineering, DBT Fundamentals, and AWS Data Engineering, and I bring a strong analytical mindset, a passion for reliable automation, and a focus on delivering measurable business impact through clean data engineering practices.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Hakkoda
Jan 2025 - Oct 2025 (9 months)
Modified and optimized AWS Lambda and Glue ETL pipelines to ingest multiple data sources into Snowflake, implemented CI/CD with Terraform and GitHub to deploy infrastructure, and built Athena-based data quality checks to ensure dataset integrity.
Senior Data Engineer
Bluecloud
Jan 2025 - May 2025 (4 months)
Advised on Snowflake architecture and best practices, designed standardized data loading patterns and CI/CD deployments using GitHub Actions and Schemachange, and promoted Snowpark and right-sized warehouses to improve pipeline maintainability and performance.
Senior Data Engineer
Kubikware
Apr 2024 - Apr 2025 (1 year)
Refactored AWS Lambda codebase, implemented Pytest coverage and GitLab CI/CD, containerized Lambdas with Docker, and built AWS Glue jobs and Datadog dashboards to improve observability and reduce storage costs via S3 profiling.
Senior Data Engineer
Nagarro
May 2024 - Jan 2025 (8 months)
Deployed Lambda-based services with Terraform and Docker, designed Snowflake data models and DBT transformations, automated Snowflake object deployments with Schemachange, and integrated Slack–Snowflake–AWS ingestion workflows with CI/CD automation.
Built ML-enabled fuzzy-matching and ETL pipelines using RecordLinkage, Rapidfuzz and AWS Glue to consolidate entity data into Redshift, deployed parallel Lambdas for summarization, and materialized Glue/Athena tables for downstream analytics.
Senior Data Engineer
Chubut IT
Sep 2022 - Nov 2022 (2 months)
Built marketing ETL pipelines using Airflow and Python, migrated data into BigQuery via Cloud Functions, and automated S3→GCS transfers to enable scalable reporting and unified campaign analytics.
Data Engineer
Tech Mahindra
Mar 2022 - Nov 2022 (8 months)
Led S3→GCS and BigQuery migration workflows with Airflow, implemented incremental loading, materialized views and partitioning to improve query performance, and developed Glue jobs to unify historical Parquet datasets into Athena tables.
Data Engineer
Heinsohn
Sep 2021 - Mar 2022 (6 months)
Designed analytical models and Power BI dashboards for energy consumption and subsidy analysis, implemented DAX and SSAS tabular models, and applied Python analytics for outlier detection in KWh usage.
IT Lead / SQL Developer
Coodescor
Sep 2020 - Mar 2021 (6 months)
Developed internal Java applications and T-SQL stored procedures for operational modules, implemented SLA policies and backup strategies, and improved network coverage with Ubiquiti deployments.
Data Engineer / IT Lead
Clinica Valle del Sinú
Dec 2019 - Jun 2020 (6 months)
Built institutional website, extended wireless LAN coverage, implemented SQL automation for admission validation, and managed backups to ensure data integrity and continuity.
Support Engineer
Clinica Casa del Niño
Sep 2017 - Dec 2018 (1 year 3 months)
Developed SQL reporting scripts for regulatory compliance and quality indicators, and created a Java claims application to improve tracking and resolution processes.
IT Instructor
Servicio Nacional de Aprendizaje (SENA)
Nov 2013 - Dec 2016 (3 years 1 month)
Designed and implemented IT maintenance plans and LAN connectivity projects for educational centers, and delivered practical SQL and Microsoft Office training to improve students' technical proficiency.
Support Engineer
Baysis
May 2012 - May 2013 (1 year)
Improved hospital LAN topology and developed T-SQL queries for ERP financial reporting, optimizing system reliability and reporting accuracy.
Education
Degrees, certifications, and relevant coursework
Fundación Universitaria San Martín
System Engineer, Systems Engineering
2010 -
Completed studies in Systems Engineering at Fundación Universitaria San Martín, focusing on software development, databases, and IT systems.
Tech stack
Software and tools used professionally
Amazon Redshift
Apache Spark
AWS Glue
Amazon S3
Google Cloud Storage
GitHub
GitLab
GitHub Actions
GitLab CI
Pandas
PySpark
dbt
Liquibase
MySQL
Microsoft SQL Server
Terraform
Java
JSON
TensorFlow
scikit-learn
Windows
Datadog
Quora
WordPress
AWS Lambda
Google Cloud Functions
Azure SQL Database
pytest
Airflow
Azure Analysis Services
Amazon Athena
SQL
Google Ads
LangChain
Soda
Cursor
Instructor
Enhance
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Ricardo?
You can contact Ricardo and 90k+ other talented remote workers on Himalayas.
Message RicardoFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
