Prerit Mahajan
@preritmahajan1
Senior Data Engineer building scalable ETL/ELT and lakehouse architectures that power analytics and AI/ML pipelines.
What I'm looking for
I’m a Senior Data Engineer with 10+ years designing and maintaining scalable ETL/ELT pipelines, data models, and lakehouse/data warehouse architectures across AWS, Azure, and GCP. I turn raw, semi-structured data into reliable, structured formats that directly support analytics and AI/ML workflows.
At Oasys International, I architected enterprise pipelines using Python, PySpark, NumPy, and SQL—reducing analysis turnaround by 30% and cutting 8-hour processing runs to under 6 hours. I also built Medallion Architecture models for 500,000+ record datasets and deployed data quality monitoring aligned with Great Expectations across 5+ concurrent workstreams.
I’ve driven production impact through REST API integrations, CI/CD-driven automation (reducing manual operations by 70%), and cross-team collaboration—improving congestion prediction model accuracy by 25%. Earlier roles include building ELT/ETL systems for 5M+ records/month with 99.9% uptime, and delivering high-performance ML and data services at Nike supporting 50M+ users during global launches.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Oasys International, LLC
Apr 2021 - Jan 2026 (4 years 9 months)
Architected and maintained enterprise ETL/ELT pipelines that reduced analysis turnaround by 30% and cut 8-hour runs to under 6 hours. Built Medallion Architecture data models and Great Expectations-aligned data quality monitoring, improving data integrity across 5+ concurrent research workstreams.
Data Engineer
The Judge Group
May 2017 - Mar 2021 (3 years 10 months)
Designed and maintained ELT/ETL pipelines across 6 integrations for a top-10 U.S. health payor, processing 5M+ member/claims records per month while reducing processing delays by 50% and increasing throughput by 40%. Implemented data models, validation rules, and batch workflows with Airflow to reduce data quality incidents by 60% and cut manual operations by 80%, saving 20+ hours/week.
Data Scientist
Nike
Sep 2013 - Apr 2017 (3 years 7 months)
Built high-performance Python and Java data services on the SNKRS platform, sustaining 2M+ simultaneous requests with sub-100ms latency during global launch events. Developed an ML-based anti-bot detection pipeline that reduced automated abuse by 65% and protected $10M+ in annual GMV.
SQL Developer
Jan 2012 - Aug 2013 (1 year 7 months)
Developed SQL and C++ components for Mesa to support analytics pipelines processing 10B+ rows/day across petabyte-scale, multi-datacenter environments. Contributed to MapReduce ETL optimizations and improved cross-datacenter reliability from 97.2% to 99.4% through consistency and replication mechanisms.
Education
Degrees, certifications, and relevant coursework
Harvard University
Master's degree, Computer Science
2008 - 2012
Earned a Master's degree in Computer Science at Harvard University from 2008 to 2012.
University of Illinois Urbana-Champaign
Bachelor's degree, Computer Science
2004 - 2008
Earned a Bachelor's degree in Computer Science at the University of Illinois Urbana-Champaign from 2004 to 2008.
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Prerit?
You can contact Prerit and 90k+ other talented remote workers on Himalayas.
Message PreritFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
