Skip to main content
MM
Looking for a job

Mark Modesto

@markmodesto

Senior data engineer specializing in Azure/AWS ETL/ELT pipelines, data lake/lakehouse, and AI-driven validation at enterprise scale.

United States
Message

What I'm looking for

I’m looking to build enterprise data lakes/lakehouses on Azure or AWS—owning ETL/ELT, governance, and CI/CD, with room to apply LLM-based anomaly detection while partnering with Data Science and Product in Agile, mission-critical environments.

I’m a Senior Data Engineer with 10+ years building enterprise-scale data platforms, ETL/ELT pipelines, and cloud-based analytics systems across healthcare, manufacturing, and operations. I focus on scalable lake and lakehouse architectures that support BI dashboards, AI/ML-driven insights, and automated data validation.

At MSI (Jul 2024–Present), I designed and deployed Azure-based ETL/ELT pipelines using Azure Databricks, Azure Data Factory, SQL Server, Snowflake, Python, and PySpark—enforcing enterprise data governance, security, and compliance. I implemented AI-assisted anomaly detection and LLM-driven validation workflows that reduced manual effort by 65%, built reusable PySpark + dbt frameworks for 50+ pipelines (35% less development time), and led migrations to modern Azure Data Lakehouse architecture for near real-time reporting.

Previously at Health Catalyst (Jan 2019–May 2024), I delivered AWS ETL/ELT pipelines (S3, Glue, Lambda, Redshift, Snowflake, dbt, PySpark) with HIPAA-aligned security, and created AI/LLM-driven NLP pipelines to improve downstream ML outcomes by 30%. Earlier at Motorola (Jan 2015–Nov 2018), I built manufacturing ETL pipelines and optimized SQL/PLSQL performance (up to 3x), while mentoring teams and leading Agile delivery to keep pipelines reliable, observable, and analytics-ready.

Experience

Work history, roles, and key accomplishments

MS
Current

Senior Data Engineer

MSI

Jul 2024 - Present (1 year 11 months)

Designed and deployed enterprise-scale Azure ETL/ELT pipelines across multi-terabyte ERP and operational datasets, enforcing governance, security, and compliance. Implemented LLM-driven anomaly detection and dbt/PySpark standardization, cutting manual validation effort by 65% and reducing development time by 35%.

HC

Senior Data Engineer

Health Catalyst

Jan 2019 - May 2024 (5 years 4 months)

Built AWS ETL/ELT pipelines for clinical, claims, and operational healthcare datasets with HIPAA compliance, improving automated monitoring and trusted delivery of analytics datasets. Developed AI/LLM-driven NLP workflows and automated data quality validation, reducing manual intervention by 60% and improving downstream ML model accuracy by 30%.

MO

Data Engineer

Motorola

Jan 2015 - Nov 2018 (3 years 10 months)

Developed and maintained large-scale SQL Server/Oracle and PySpark ETL pipelines for manufacturing and operational telemetry, enabling dashboards for predictive maintenance and fault detection. Automated transformation and validation workflows, reducing manual effort by 70%, and optimized SQL/PL-SQL queries for up to 3x faster reporting performance.

Education

Degrees, certifications, and relevant coursework

Lewis University logoLU

Lewis University

Bachelor's Degree, Computer Science

2010 - 2014

Earned a bachelor's degree in Computer Science from Lewis University (2010–2014).

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan