Shruti Shah
@shrutishah3
Senior Data Engineer specializing in cloud-native data platforms, Lakehouse architectures, and ETL/ELT pipelines.
What I'm looking for
I am a Senior Data Engineer with 7+ years designing, building, and supporting enterprise-scale data platforms across retail, healthcare, and insurance domains. I specialize in cloud-native architectures (primarily Azure, with hands-on AWS and GCP experience) and deliver scalable ETL/ELT pipelines and Lakehouse solutions.
I've led migrations from legacy systems to cloud platforms, implemented Databricks and Snowflake-based solutions, and optimized distributed processing for high-volume workloads to reduce costs and improve performance. I combine strong technical expertise—PySpark, Apache Spark, BigQuery, Azure Data Factory—with a focus on governance, security, and CI/CD automation.
I collaborate closely with data scientists, analysts, and business stakeholders to produce analytics-ready datasets, ensure compliance with regulatory standards, and mentor junior engineers while documenting architecture and operational best practices to support reliable production systems.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
The Hartford
Apr 2024 - Present (1 year 10 months)
Architected a GCP-native enterprise data platform and led migration from AWS, reducing cloud infrastructure costs by 25% while enabling batch and near-real-time analytics for underwriting, claims, and risk domains.
Data Engineer
Johnson & Johnson
Dec 2022 - Apr 2024 (1 year 4 months)
Designed and delivered Azure-native data pipelines and a Medallion Azure Data Lake, improving ETL performance and ensuring FDA/HIPAA-compliant data delivery for clinical and R&D analytics.
Developed and maintained enterprise ETL pipelines with Informatica PowerCenter and Oracle PL/SQL, optimizing batch workloads and dimensional models to support retail analytics and inventory reporting.
Backend Developer
Pfizer
Mar 2019 - Oct 2021 (2 years 7 months)
Supported backend services and RESTful APIs for clinical and operational data platforms, improving data validation and reducing data quality issues in daily ingestion feeds.
Education
Degrees, certifications, and relevant coursework
Montclair State University
Master of Science, Computer Science
Master of Science in Computer Science focusing on advanced computing concepts and practical applications relevant to data engineering.
Tech stack
Software and tools used professionally
Azure Synapse
Apache Spark
AWS Glue
AWS IAM
Microsoft Azure
Google Cloud Platform
Amazon S3
Google Cloud Storage
GitHub
Jenkins
NumPy
Pandas
PySpark
dbt
MySQL
PostgreSQL
MongoDB
IBM DB2
Gmail
Django
Databricks
Azure DevOps
Jira
Java
JSON
Azure Monitor
Linux
Windows
AWS Lambda
Azure SQL Database
OAuth2
Airflow
Apache Beam
Time Analytics
Root Cause
Amazon EMR
SQL
Delta Lake
Bash
Enhance
Unity Catalog
Task
Factory
Beam
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Shruti?
You can contact Shruti and 90k+ other talented remote workers on Himalayas.
Message ShrutiFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
