Neel Shah
@neelshah
Senior Data Engineer with expertise in scalable data solutions.
What I'm looking for
I am a Senior Data Engineer with over 10 years of hands-on experience in building and managing scalable data solutions across Azure and AWS. My expertise lies in developing robust ETL pipelines, optimizing data warehousing, and delivering advanced analytics. I have successfully leveraged tools like Microsoft Fabric, Microsoft Synapse, Databricks, Snowflake, Spark, and Python to drive insights and improve operations in sectors such as fintech, e-commerce, and social media.
I thrive in agile, cross-functional teams, aligning data strategies with business goals to support smarter decision-making and deliver measurable impact. At Stripe, I designed and implemented a scalable cloud-native data architecture that streamlined data flow for real-time financial analytics. My role involved mentoring junior engineers and collaborating with product teams to optimize data access for downstream analytics.
Previously at Reddit, I led the modernization of data platforms and developed event-driven data models that supported personalized recommendations and trust & safety dashboards. My commitment to data governance and best practices has consistently improved team productivity and pipeline reliability.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Stripe, Inc.
Jan 2023 - Present (2 years 5 months)
Designed and implemented a scalable, cloud-native data architecture on AWS and Snowflake using Medallion Architecture, streamlining raw-to-insight data flow to support real-time financial analytics and risk modeling. Built end-to-end ELT pipelines using Python, SQL, and dbt to power Stripe’s internal data products, ensuring accurate reconciliation of transaction and payment data across services.
Lead Data Engineer
Reddit, Inc.
Dec 2021 - Present (3 years 6 months)
Led Reddit’s data platform modernization by rebuilding legacy pipelines into scalable, modular ELT frameworks using dbt, Airflow, and Snowflake. Designed and managed real-time streaming systems with Kafka and Spark Structured Streaming to support feed ranking, content tagging, and user activity tracking.
Senior Data Engineer
Reddit, Inc.
Jun 2017 - Present (8 years)
Built and maintained distributed data pipelines on AWS using Spark, EMR, Redshift, and S3, processing billions of user interactions daily. Created and maintained star schema–based models in Redshift and Snowflake to support product, ads, and community teams with performant, scalable data access.
Data Engineer
Reddit, Inc.
Oct 2015 - Present (9 years 8 months)
Built Reddit’s foundational data infrastructure on AWS, configuring secure VPCs, EC2, RDS, and S3 for scalable batch data processing. Created ingestion pipelines to centralize Reddit post, comment, vote, and session data into the company’s first data lake.
Education
Degrees, certifications, and relevant coursework
University of Texas at Austin
Bachelor of Science, Computer Science
Studied Computer Science at the University of Texas at Austin. The curriculum covered fundamental concepts and advanced topics in the field, preparing for a career in data engineering.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Neel?
You can contact Neel and 90k+ other talented remote workers on Himalayas.
Message NeelFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
