Sean Patel
@seanpatel
Senior Data Engineer with expertise in real-time data pipelines.
What I'm looking for
I am a Senior Data Engineer with extensive experience in architecting and maintaining real-time data pipelines. At Amazon Web Services (AWS), I have successfully automated data ingestion from over 15 sources and implemented robust data quality checks using Great Expectations and Pytest. My work has significantly reduced end-to-end pipeline latency by 75% through the migration from batch to streaming architecture, enhancing operational analytics and product telemetry.
Previously, at Stripe, I designed and scaled the internal financial reporting pipeline, processing billions of transactions daily. My collaboration with the machine learning team resulted in a model-serving feature store that accelerated the deployment of fraud and underwriting models. I also established data governance practices that improved data integrity and reduced storage costs by 40% through effective partitioning and archiving strategies.
My journey began at CVS Health, where I developed ETL pipelines to unify patient records and built KPI dashboards that enabled clinical teams to track medication adherence. I am passionate about mentoring junior engineers and continuously improving data processes to drive business value.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Amazon Web Services (AWS)
Jan 2021 - Present (4 years 6 months)
Architected and maintained real-time data pipelines using Kinesis, Glue, and Redshift to support product telemetry and operational analytics for AWS services. Automated data ingestion from 15+ sources and implemented robust data quality checks using Great Expectations and Pytest.
Senior Data Engineer
Stripe
Sep 2017 - Dec 2020 (3 years 3 months)
Designed and scaled Stripe's internal financial reporting pipeline using Snowflake, Airflow, and S3, processing billions of transactions daily. Developed core Python data services to calculate key revenue metrics (MRR, ARR, churn), used by FP&A and exec teams.
Data Engineer
CVS Health
Feb 2014 - Aug 2017 (3 years 6 months)
Developed ETL pipelines to unify patient records across retail and pharmacy systems using SSIS, PostgreSQL, and Python. Enabled clinical teams to track medication adherence by building KPI dashboards in PowerBI with automated refreshes.
Education
Degrees, certifications, and relevant coursework
University of Illinois Urbana-Champaign
Master's Degree of Computer Science, Computer Science
Completed a Master's degree in Computer Science. Focused on advanced topics in the field.
Availability
Location
Authorized to work in
Job categories
Interested in hiring Sean?
You can contact Sean and 90k+ other talented remote workers on Himalayas.
Message SeanFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
