Robin Sah
@robinsah
Senior Data Engineer delivering scalable, compliant cloud data platforms and cost-saving pipeline optimizations.
What I'm looking for
I am a Senior Data Engineer with 7+ years building scalable, compliant data platforms for healthcare and financial firms. I design cloud-native architectures that process terabytes of data while reducing infrastructure costs and improving pipeline reliability.
I have deep expertise across Azure, AWS, and GCP, with hands-on experience in Databricks, Snowflake, Airflow, dbt, Terraform, and CI/CD automation. My work has produced measurable outcomes, including multi-hundred-thousand-dollar annual savings and major latency and runtime improvements.
I lead data governance and quality efforts—implementing HIPAA/SOX-compliant frameworks, data lineage, and automated validations—that eliminate compliance violations and reduce downstream anomalies. I also build RAG/LLM-powered solutions to accelerate document search and decision-making.
I mentor junior engineers and collaborate closely with analytics, risk, and business stakeholders to deliver trusted, analytics-ready datasets that enable faster, data-driven decisions and operational resilience.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Cencora
Apr 2024 - Present (1 year 9 months)
Designed cloud data lake processing 5TB+ weekly pharmaceutical data, reducing storage costs 22% and enabling real-time inventory decisions; implemented medallion architecture and data quality gates, reducing downstream anomalies 40% and cutting Databricks costs 35% ($150K annual savings).
Built streaming pipelines at 500K events/sec with <100ms latency to enable real-time recommendations (22% higher conversion) and automated feature pipelines, halving ML development cycle time while reducing ETL runtime 55% and cutting monthly cloud costs $40K.
ETL Developer
Northwestern Mutual
Feb 2018 - Oct 2022 (4 years 8 months)
Designed Kimball-based dimensional warehouse and maintained 200+ SSIS ETL workflows processing 20M+ records daily, improving report generation time 65% and reducing post-load anomalies 92% through automated validation and reconciliation controls.
Education
Degrees, certifications, and relevant coursework
The George Washington University
Master of Science, Data Analytics
Master of Science in Data Analytics from The George Washington University.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Robin?
You can contact Robin and 90k+ other talented remote workers on Himalayas.
Message RobinFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
