Arturo Aragón Landa
@arturoaragnlanda
I’m a Data Engineer specializing in Microsoft Fabric, Databricks, and Azure lakehouse architectures to deliver reliable analytics.
What I'm looking for
I’m a Data Engineer specializing in Microsoft Fabric and Databricks on Azure, with deep experience in Lakehouse and Medallion architectures. At EY, I’m currently designing fact and dimension tables, ingestion pipelines, and business metrics for a Finance engagement in the Gaming/Entertainment industry.
I build end-to-end lakehouse solutions using Medallion architecture (Bronze/Silver/Gold) within a Data Mesh of domain-aligned workspaces. I focus on metric engineering and dimensional modeling (star schema) so Power BI reports and KPI dashboards have trustworthy, production-ready data.
Across cross-industry work in automotive, banking, telecom, and logistics, I’ve delivered reliable ETL/ELT pipelines and supported data quality through exploratory analysis before production. I’ve also built ETL workflows with Pentaho and even supported a GenAI/RAG prototype integrated into existing pipelines, while working directly with client constraints around compliance and source complexity.
My background also includes large-scale Databricks processing with Apache Spark, Azure Data Lake Gen2 storage, and helping migrate workloads from Synapse to Databricks for better performance. I bring strong engineering fundamentals—from SQL optimization and Power BI (DAX/Power Query) to backend work with Django and DevOps support—and I’m Microsoft Certified: Fabric Data Engineer Associate (DP-700).
Experience
Work history, roles, and key accomplishments
Designed Lakehouse and Data Mesh solutions in Microsoft Fabric using Medallion architecture, including star-schema fact and dimension tables for Finance KPIs. Built ingestion pipelines with Fabric Data Factory and PySpark and supported DevOps promotion across Dev/UAT/Prod workspaces with Azure DevOps.
Data Engineer (Contract)
VinkOS
Oct 2025 - Jan 2026 (3 months)
Built Pentaho ETL workflows and deployed/troubleshot Pentaho on Linux servers using shell scripts. Contributed to a Retrieval-Augmented Generation (RAG) prototype integrated with Pentaho pipelines for contextual answers from internal data.
Data Engineer
X-Data
Feb 2025 - Sep 2025 (7 months)
Developed and managed Databricks tables and large-scale Apache Spark processing jobs. Ingested data from external sources with Azure Data Factory and supported workloads migration from Azure Synapse Analytics to Databricks.
Wrote and optimized SQL queries and built PySpark ETL workflows for reliable daily processing. Delivered Power BI dashboards using DAX/Power Query and supported data management initiatives aligned with internal IT security standards.
Supported ETL development in PySpark for the data integration team and integrated external APIs into internal workflows. Wrote SQL and managed Snowflake pipelines with an emphasis on storage/query performance, and deployed web applications on Microsoft Azure.
DevOps Intern
WebForce
May 2022 - Jul 2022 (2 months)
Implemented web application features using Laravel (PHP), Node.js, and Express with Bootstrap-based UIs and SQL backends. Wrote Jest unit tests and assisted with deployment workflows and development environment setup.
Education
Degrees, certifications, and relevant coursework
Universidad del Caribe
Bachelor of Data Engineering and Organizational Intelligence, Data Engineering and Organizational Intelligence
2020 - 2024
Earned a Bachelor’s degree in Data Engineering and Organizational Intelligence at Universidad del Caribe from 2020 to 2024.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Arturo?
You can contact Arturo and 90k+ other talented remote workers on Himalayas.
Message ArturoFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
