Ashwani kumar
@ashwanikumar8
Senior Data Engineer designing scalable data pipelines with big data and ML workflows.
What I'm looking for
I’m a Senior Data Engineer with strong expertise in designing and implementing scalable data architectures. I focus on leveraging big data technologies to optimize data workflows and improve data accessibility, so teams can make better data-driven decisions.
At GEP, I created the SDM(v2) data pipeline in PySpark and Scala-Spark to handle end-to-end ingestion, cleansing, consolidation, profile reporting, transliteration (via ChatGPT), and quality checks. The Spark code processed over 300 million transactions (~500 GB), and I built an ADB automation test pipeline for regression testing. I also owned the release process through a CI/CD pipeline, optimized performance using persist, broadcast joins, accumulators, and partitioning, and implemented medallion architecture and timely maintenance (optimize, vacuum) to reduce infra cost.
Previously at OPTUM - UHG, I worked as an Analyst on healthcare cost prediction using machine learning and statistics—covering models like regression, gradient boost, LightGBM, and random forests. I automated pipelines and delivered outputs to US stakeholders, generating revenue worth $500K, and I led a $300K “Group Cost Predictors” project from India while driving analytical feature engineering and insights.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
GEP
Jul 2022 - Present (3 years 9 months)
Built and owned SDM(v2) Spark pipelines in PySpark/Scala-Spark for ingestion, cleansing, consolidation, profiling, transliteration, and quality checks across ~300M transactions (~500 GB). Implemented medallion architecture and Unity Catalog access controls, optimized Spark performance (persist/broadcast/partitioning), and delivered CI/CD releases with ADB regression test automation.
Analyst
Optum UHG
Jun 2019 - Jul 2022 (3 years 1 month)
Developed healthcare cost predictor models using ML/statistical methods (regression, gradient boosting, LightGBM, random forests) based on claims and demographic factors, producing insights for neonates, maternity costs, and outcomes. Automated data pipelines and delivered client-ready results, generating $500K in revenue; also led a $300K Group Cost Predictors project for a US payer client.
Education
Degrees, certifications, and relevant coursework
Bangalore Institute of Technology
Bachelor of Engineering, Computer Science and Engineering
Grade: 7.29 GPA
Completed a BE in Computer Science and Engineering at Bangalore Institute of Technology with a 7.29 GPA.
Gulmohur High School
Indian School Certificate (ISC), ISC
Grade: 88.20%
Completed ISC (XII) at Gulmohur High School, Jamshedpur, scoring 88.20%.
Gulmohur High School
Indian Certificate of Secondary Education (ICSE), ICSE
Grade: 92.50%
Completed ICSE (X) at Gulmohur High School, Jamshedpur, scoring 92.50%.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Salary expectations
Social media
Interested in hiring Ashwani?
You can contact Ashwani and 90k+ other talented remote workers on Himalayas.
Message AshwaniFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
