Skip to main content
Harshit AgrawalHA
Open to opportunities

Harshit Agrawal

@harshitagrawal

Big Data Engineer scaling high-performance data platforms on Azure and Databricks for measurable efficiency gains.

India
Message

What I'm looking for

I’m looking for a role where I can build governed, high-performance data platforms on Azure/Databricks—optimizing cost and performance, improving reliability with CI/CD, and partnering with teams to deliver measurable efficiency gains.

I’m a Big Data Engineer with 12 years of experience scaling high-performance data platforms in the Azure and Databricks ecosystems. I focus on turning complex pipelines into governed, reliable systems—collaborating with cross-functional teams to deliver outcomes like a 20% efficiency boost and faster, cleaner decision-making.

Recently, I architected a Platinum KPI Layer that became a governed single source of truth, reducing metric reporting discrepancies by 100% across business units. I’ve also driven major performance and cost improvements (30% compute cost reduction and 8x query performance) through Liquid Clustering, automated operational efficiencies with Predictive Optimization, and standardized deployments with Databricks Asset Bundles for smoother CI/CD. I enjoy mentoring engineers on hard SQL and PySpark logic, and I bring that same execution mindset from prior work like real-time streaming with Kafka and PySpark Structured Streaming, automated monitoring with ADF, and experimentation with retrieval-augmented generation.

Experience

Work history, roles, and key accomplishments

EN
Current

Senior Big Data Engineer

EnableData

Apr 2025 - Present (1 year 2 months)

Architected a Platinum KPI layer as a governed single source of truth, eliminating 100% of metric reporting discrepancies across business units. Optimized cloud compute costs by 30% and improved query performance by 8x using Liquid Clustering, while standardizing deployments with Databricks Asset Bundles to cut environment-related errors by 40%.

CS

Big Data Engineer

Concentrix Software Solutions

Dec 2021 - Aug 2024 (2 years 8 months)

Automated financial data processing on Azure Databricks using PySpark (P&L, sales, purchases, reconciliation) while guaranteeing 100% accuracy. Reduced execution time up to 35% through Spark optimization (partitioning, Z-order, join/aggregation, caching) and implemented real-time cleansing via PySpark Structured Streaming consuming Kafka with data mismatch monitoring via ADF (15% team efficiency l

MA

Senior Data Analyst

Magicpin.in

May 2016 - Jul 2018 (2 years 2 months)

Revamped the reporting framework using MySQL, Python, and BI tools, improving analyst efficiency by 25%. Extracted data from OLTP/CRM/payables sources to build Python cashback optimization models that reduced spillage by 80% and implemented fraud detection, while customizing SuiteCRM to improve sales manager funnel efficiency by 40%.

WI

Data Analyst

Wizikey

May 2013 - Nov 2015 (2 years 6 months)

Led a Python and SQL-driven data cleaning initiative across multiple sources (news websites, Twitter, Google Forms), improving data quality by 40%. Analyzed weekly/monthly/quarterly performance metrics to support targets and enabled marketing optimization using Facebook/Google Ads analytics to save 15% in marketing spend via CPC/CPA calibration.

Education

Degrees, certifications, and relevant coursework

IE

IIMT College of Engineering

Bachelor of Technology (B.Tech.), Electronics and Communication Engineering (ECE)

Earned a B.Tech. in Electronics and Communication Engineering (ECE) from IIMT College of Engineering, affiliated with UPTU.

JV

Jawahar Navodaya Vidyalaya

Higher Secondary (12th) - CBSE

Completed 12th grade under CBSE board at Jawahar Navodaya Vidyalaya.

JV

Jawahar Navodaya Vidyalaya

Secondary (10th) - CBSE

Completed 10th grade under CBSE board at Jawahar Navodaya Vidyalaya.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan