Alina Paudel
@alinapaudel
I am a results-driven Data Engineer specializing in cloud-scale, HIPAA-compliant data platforms.
What I'm looking for
I am a results-driven Data Engineer with 6+ years building secure, scalable data platforms across healthcare, insurance, and behavioral analytics. I specialize in real-time and batch pipelines using PySpark, Kafka, and dbt across AWS, Azure, and GCP.
At Diverge Health and Spring Health I delivered production-grade ELT and streaming systems, enabling reverse ETL with Hightouch that improved patient outreach targeting by 30% and reduced issue detection time by 50%. I drove data quality and governance—implementing Great Expectations, dbt tests, metadata lineage, and HIPAA-compliant controls—to sustain 99%+ accuracy and improve audit readiness. I also optimized cloud costs and performance, reducing processing times and cloud expenses through query tuning, auto-scaling, and architecture changes.
I enjoy collaborating with product, clinical, and data science teams to translate business requirements into analytics, dashboards, and ML-ready features. I’m seeking roles where I can lead engineering of resilient, cost-effective data platforms and mentor teams while delivering measurable business outcomes.
Experience
Work history, roles, and key accomplishments
Business Analyst / Data Engineer
Diverge Health
Sep 2023 - Present (1 year 11 months)
Built scalable ELT pipelines with Python, dbt, and Snowflake to power patient segmentation and readmission prediction, improving patient outreach targeting by 30%. Implemented reverse ETL with Hightouch and event-driven pipelines using Dagster and Kafka, achieving 99%+ data accuracy and reducing issue detection time by 50%.
Data Engineer
Spring Health
May 2021 - Aug 2023 (2 years 3 months)
Developed and optimized ETL pipelines using AWS Glue, Lambda, Step Functions and Snowflake, reducing query complexity by 50% and improving Redshift query performance by 35%. Automated data quality with Great Expectations and CI/CD using Terraform and CodePipeline, cutting deployment time from 3 days to 3 hours and reducing processing time by 60%.
Data Engineer
MetLife
Jul 2019 - Apr 2021 (1 year 9 months)
Assisted in building PySpark and AWS Glue ETL pipelines to ingest and transform EHR and claims data, contributing to improved real-time claims ingestion. Contributed dbt transformations for Snowflake/Redshift analytics and implemented data quality checks to support KPI dashboards for operations and actuarial teams.
Education
Degrees, certifications, and relevant coursework
University of the Cumberlands
Master of Business Administration, Business Administration
Master of Business Administration degree from the University of the Cumberlands.
Tech stack
Software and tools used professionally
OpenAPI
Airbyte
Fivetran
Apache Spark
AWS Glue
Apache Flink
Talend
Amazon Quicksight
AWS IAM
Amazon S3
Google Cloud Storage
AWS Step Functions
GitHub
GitLab
Kubernetes
AWS CodePipeline
Jenkins
GitHub Actions
GitLab CI
Salesforce
NumPy
Pandas
PySpark
AWS Glue DataBrew
dbt
DB
Sqoop
MySQL
PostgreSQL
MongoDB
Cassandra
Hadoop
Vertica
Gmail
Databricks
Redis
Terraform
Azure DevOps
Jira
JavaScript
JSON
TensorFlow
PyTorch
MLflow
scikit-learn
Streamlit
Kafka
Apache NiFi
Apache Pulsar
FastAPI
PagerDuty
Grafana
Prometheus
Datadog
GraphQL
Elasticsearch
Avro
AWS Lambda
Azure Functions
Azure SQL Database
Kafka Streams
pytest
Airflow
Apache Beam
Root Cause
Amazon EMR
Amazon Athena
SQL
Amazon SageMaker
Azure Cosmos DB
XGBoost
AWS KMS
Mode Analytics
Dagster
Hightouch
Monte Carlo
Delta Lake
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Alina?
You can contact Alina and 90k+ other talented remote workers on Himalayas.
Message AlinaFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
