Shubham Shah
@shubhamshah1
Senior Data Engineer specializing in cloud-native, scalable ETL/ELT and large-scale healthcare data platforms.
What I'm looking for
I am a Senior Data Engineer with 7+ years designing and deploying cloud-native data solutions across AWS, Azure, and GCP for healthcare, financial services, and retail. I build scalable ETL/ELT pipelines, enterprise data warehouses, and streaming architectures that support mission-critical analytics and regulatory compliance.
My work includes engineering 100+ ETL pipelines processing 10TB+ daily, implementing Delta Lake and Medallion architectures, and orchestrating reliable 24/7 workflows with strong observability and security controls (HIPAA, SOC2, PCI-DSS). I have improved data quality, reduced latency from hours to seconds, and scaled processing while cutting infrastructure costs.
I bring hands-on expertise with Databricks, PySpark, Spark Streaming, Airflow, Kafka, Snowflake, Redshift, Azure Synapse, Terraform, and containerized deployments, and I focus on enabling self-service analytics, robust data governance, and production-grade ML/AI integrations.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Change Healthcare
Jun 2023 - Present (2 years 7 months)
Architected and optimized 100+ ETL pipelines processing 10TB+ healthcare data daily, improving data quality 40% and reducing analytics latency from hours to seconds via hybrid batch/real-time ingestion and streaming technologies.
Data Engineer
Capital One
Aug 2021 - May 2023 (1 year 9 months)
Designed Azure-based ETL workflows and Medallion architecture on ADLS Gen2, improving data reuse 30% and delivering 99.95% pipeline reliability with sub-second streaming ingestion for live risk dashboards.
ETL Developer
Signify Health
Mar 2020 - Jul 2021 (1 year 4 months)
Built Spark and Hadoop ETL workflows ingesting 50+ healthcare feeds, accelerating daily refreshes from 4 hours to 45 minutes and achieving 98%+ processing reliability through incremental loads and data contracts.
Python Developer
Best Buy
Jul 2018 - Dec 2019 (1 year 5 months)
Developed Python automation and ETL scripts for inventory and POS data, achieving 100% accuracy in quarterly audits and reducing manual reporting effort by 70% across 500+ locations.
Education
Degrees, certifications, and relevant coursework
Clark University
Master of Science, Computer Science
Master of Science in Computer Science from Clark University.
Tech stack
Software and tools used professionally
Amazon Redshift
Azure Synapse
Apache Spark
AWS Glue
Talend
Amazon Quicksight
AWS Step Functions
Kubernetes
Jenkins
Pandas
PySpark
dbt
DB
Sqoop
PostgreSQL
MongoDB
Cassandra
Hadoop
HBase
Gmail
Databricks
Redis
Terraform
Azure DevOps
Jira
Java
JSON
XML
Kafka
Apache NiFi
Azure Monitor
Linux
Airflow
SQL
Azure Cosmos DB
Hugging Face
Apache Iceberg
LangChain
Ray
Delta Lake
Great Expectations
Cosmos
Bash
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Shubham?
You can contact Shubham and 90k+ other talented remote workers on Himalayas.
Message ShubhamFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
