Himalayas logo
SS
Open to opportunities

Shiva Sah

@shivasah

Experienced Data Engineer with a passion for scalable data solutions.

United States
Message

What I'm looking for

I seek a role that fosters innovation and collaboration, focusing on data-driven solutions.

I am an experienced Data Engineer with about 7 years of expertise in building scalable data platforms and ETL pipelines across AWS, Azure, and GCP. My proficiency in technologies such as Spark, Snowflake, and Databricks allows me to deliver high-performance data solutions that drive analytics and machine learning readiness. I thrive in Agile environments and excel at collaborating cross-functionally to meet business objectives.

At Pfizer, I led a global initiative to modernize clinical and manufacturing analytics by implementing a production-grade lakehouse platform. This project not only streamlined data access across teams but also supported regulatory submissions and real-time analytics. My commitment to data quality and governance has resulted in significant improvements in operational efficiency and compliance across various organizations, including Wells Fargo and Johnson & Johnson.

Experience

Work history, roles, and key accomplishments

PF
Current

Lead Data Engineer

Pfizer

Sep 2023 - Present (2 years)

Led Pfizer's global initiative to modernize clinical and manufacturing analytics by delivering a production-grade lakehouse platform integrating genomics, assay, batch, and safety data. This unified foundation supports ML workloads, regulatory submissions, and GxP-compliant real-time analytics across vaccine and oncology programs. Spearheaded the design and implementation of a Medallion-layer lake

WF

Senior Data Engineer

Wells Fargo

Jan 2021 - Present (4 years 8 months)

Led cloud transformation initiatives across Wells Fargo's retail and commercial banking lines by designing and scaling a unified financial data platform for real-time fraud detection, regulatory compliance (SOX, CCAR), customer insights, and credit risk analytics across Azure and GCP. Architected scalable ETL pipelines using Azure Data Factory and Databricks (PySpark) to process high-volume credit

JJ

Data Engineer

Johnson & Johnson

Sep 2018 - Present (7 years)

Built a cloud-native data platform to integrate Medicare, Medicaid, and clinical datasets, enabling real-time analytics, ML workflows, and regulatory compliance across J&J's pharma and med-tech units using AWS, Azure, Spark, Kafka, and Snowflake. Designed and optimized ETL pipelines using PySpark, processing large-scale healthcare datasets from Medicare, Medicaid, and commercial lines to support a

Education

Degrees, certifications, and relevant coursework

KC

Kalamazoo College

Bachelor's, Computer Science & Business and Economics

Studied Computer Science and Business and Economics, gaining foundational knowledge in both technical and economic principles. Developed skills in problem-solving, data analysis, and business strategy.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Shiva Sah - Lead Data Engineer - Pfizer | Himalayas