Skip to main content
Aarsh UserAU
Looking for a job

Aarsh User

@aarshuser

Data Engineer focused on scalable ETL/ELT pipelines, real-time streaming, and optimized data warehouses.

India
Message

What I'm looking for

I’m looking for a role where I can own end-to-end ETL/ELT and real-time data streaming, optimize pipelines for low-latency analytics, and build reliable warehouse models using Python, SQL, Spark, and cloud platforms—collaborating closely with stakeholders.

I’m a Data Engineer specializing in end-to-end ETL/ELT pipeline development and large-scale pipeline optimization across AWS and Azure. I translate business requirements into robust data solutions that keep downstream analytics fast and reliable.

At Innovaccer, I designed and optimized Python and PySpark ETL/ELT pipelines that load client EHR and claims data into a Databricks-backed Snowflake warehouse, processing 50M+ records weekly at multi-terabyte scale. I engineered SQL-based transformation layers in PostgreSQL and Snowflake—using indexing, partitioning, materialized views, and stored procedures—to reduce query latency for BI consumers.

I also built Change Data Capture (CDC) pipelines using AWS DMS for real-time incremental ingestion, eliminating full-load dependencies to ensure low-latency data availability. Performance improvements have been measurable, including reducing execution time from 9 hours to 2 hours for 30M records and cutting overall processing time by 30% through multithreading, parallel processing, and Spark partition tuning.

To improve data trust and operational efficiency, I authored Python-based validation and alerting frameworks that reduced manual QA effort by 40%, and I built AI-assisted automation tools and agentic workflows that converted multi-step validation into one-click executions—improving efficiency by 70%. My projects reinforce this focus on streaming reliability, fault-tolerant storage design, and analytics-ready modeling.

Experience

Work history, roles, and key accomplishments

Innovaccer Analytics Pvt. Ltd. logoIL

Data Analyst - Data Engineering

Jan 2025 - May 2026 (1 year 4 months)

Designed and optimized scalable ETL/ELT pipelines using Python and PySpark to load EHR/claims and other sources into a Databricks-backed Snowflake warehouse, processing 50M+ records weekly at multi-terabyte scale. Reduced pipeline execution time from 9 hours to 2 hours for 30M records and cut overall processing time by 30%, while lowering manual QA effort by 40% and improving operational efficienc

Education

Degrees, certifications, and relevant coursework

LNM Institute of Information Technology logoLT

LNM Institute of Information Technology

Bachelor of Technology, Computer Science and Engineering

2021 - 2025

Earned a B.Tech in Computer Science and Engineering at LNM Institute of Information Technology in Jaipur (2021–2025).

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan