Open to opportunities

Alina Paudel

@alinapaudel

Message

I am a results-driven Data Engineer specializing in cloud-scale, HIPAA-compliant data platforms.

United States

Message

What I'm looking for

I seek a senior data engineering role building secure, HIPAA-compliant, cost-optimized cloud data platforms with ML-enabled pipelines, strong data quality, and cross-functional impact.

I am a results-driven Data Engineer with 6+ years building secure, scalable data platforms across healthcare, insurance, and behavioral analytics. I specialize in real-time and batch pipelines using PySpark, Kafka, and dbt across AWS, Azure, and GCP.

At Diverge Health and Spring Health I delivered production-grade ELT and streaming systems, enabling reverse ETL with Hightouch that improved patient outreach targeting by 30% and reduced issue detection time by 50%. I drove data quality and governance—implementing Great Expectations, dbt tests, metadata lineage, and HIPAA-compliant controls—to sustain 99%+ accuracy and improve audit readiness. I also optimized cloud costs and performance, reducing processing times and cloud expenses through query tuning, auto-scaling, and architecture changes.

I enjoy collaborating with product, clinical, and data science teams to translate business requirements into analytics, dashboards, and ML-ready features. I’m seeking roles where I can lead engineering of resilient, cost-effective data platforms and mentor teams while delivering measurable business outcomes.

Experience

Work history, roles, and key accomplishments

Current

Business Analyst / Data Engineer

Current

Diverge Health

Sep 2023 - Present (2 years 10 months)

Built scalable ELT pipelines with Python, dbt, and Snowflake to power patient segmentation and readmission prediction, improving patient outreach targeting by 30%. Implemented reverse ETL with Hightouch and event-driven pipelines using Dagster and Kafka, achieving 99%+ data accuracy and reducing issue detection time by 50%.

Python DBT Snowflake HighTouch Dagster Kafka Great Expectations Power BI

Data Engineer

Spring Health

May 2021 - Aug 2023 (2 years 3 months)

Developed and optimized ETL pipelines using AWS Glue, Lambda, Step Functions and Snowflake, reducing query complexity by 50% and improving Redshift query performance by 35%. Automated data quality with Great Expectations and CI/CD using Terraform and CodePipeline, cutting deployment time from 3 days to 3 hours and reducing processing time by 60%.

Python AWS Glue Snowflake Kafka DBT Redshift Terraform Great Expectations Airflow

Data Engineer

MetLife

Jul 2019 - Apr 2021 (1 year 9 months)

Assisted in building PySpark and AWS Glue ETL pipelines to ingest and transform EHR and claims data, contributing to improved real-time claims ingestion. Contributed dbt transformations for Snowflake/Redshift analytics and implemented data quality checks to support KPI dashboards for operations and actuarial teams.