Ashwani kumar
@ashwanikumar8
Senior Data Engineer designing scalable data pipelines with big data and ML workflows.
What I'm looking for
I’m a Senior Data Engineer with strong expertise in designing and implementing scalable data architectures. I focus on leveraging big data technologies to optimize data workflows and improve data accessibility, so teams can make better data-driven decisions.
At GEP, I created the SDM(v2) data pipeline in PySpark and Scala-Spark to handle end-to-end ingestion, cleansing, consolidation, profile reporting, transliteration (via ChatGPT), and quality checks. The Spark code processed over 300 million transactions (~500 GB), and I built an ADB automation test pipeline for regression testing. I also owned the release process through a CI/CD pipeline, optimized performance using persist, broadcast joins, accumulators, and partitioning, and implemented medallion architecture and timely maintenance (optimize, vacuum) to reduce infra cost.
Previously at OPTUM - UHG, I worked as an Analyst on healthcare cost prediction using machine learning and statistics—covering models like regression, gradient boost, LightGBM, and random forests. I automated pipelines and delivered outputs to US stakeholders, generating revenue worth $500K, and I led a $300K “Group Cost Predictors” project from India while driving analytical feature engineering and insights.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
GEP
Jul 2022 - Present (3 years 11 months)
Built and owned SDM(v2) Spark pipelines in PySpark/Scala-Spark for ingestion, cleansing, consolidation, profiling, transliteration, and quality checks across ~300M transactions (~500 GB). Implemented medallion architecture and Unity Catalog access controls, optimized Spark performance (persist/broadcast/partitioning), and delivered CI/CD releases with ADB regression test automation.
Analyst
Optum UHG
Jun 2019 - Jul 2022 (3 years 1 month)
Developed healthcare cost predictor models using ML/statistical methods (regression, gradient boosting, LightGBM, random forests) based on claims and demographic factors, producing insights for neonates, maternity costs, and outcomes. Automated data pipelines and delivered client-ready results, generating $500K in revenue; also led a $300K Group Cost Predictors project for a US payer client.
Education
Degrees, certifications, and relevant coursework
Bangalore Institute of Technology
Bachelor of Engineering, Computer Science and Engineering
Grade: 7.29 GPA
Completed a BE in Computer Science and Engineering at Bangalore Institute of Technology with a 7.29 GPA.
Gulmohur High School
Indian School Certificate (ISC), ISC
Grade: 88.20%
Completed ISC (XII) at Gulmohur High School, Jamshedpur, scoring 88.20%.
Gulmohur High School
Indian Certificate of Secondary Education (ICSE), ICSE
Grade: 92.50%
Completed ICSE (X) at Gulmohur High School, Jamshedpur, scoring 92.50%.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Salary expectations
Social media
Interested in hiring Ashwani?
You can contact Ashwani and 90k+ other talented remote workers on Himalayas.
Message AshwaniFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
