Skip to main content
Harsh KadamHK
Open to opportunities

Harsh Kadam

@harshkadam1

I’m a junior data engineer building scalable ETL pipelines on Azure and Databricks.

India
Message

What I'm looking for

I’m looking to build and optimize scalable ETL and data platforms on cloud (Azure/GCP), using Spark/Databricks and Delta Lake. I want ownership of data quality, governance (Unity Catalog), and performance tuning for reliable, near real-time insights.

I’m a Data Engineer with almost 3+ years of experience designing, developing, and maintaining robust ETL pipelines. I focus on optimizing data workflows to ensure seamless data integration across platforms and delivering high-performance solutions.

At LTIMindtree, I designed end-to-end ETL pipelines using Azure Data Factory to ingest data from CSV, SQL Server, and REST APIs into ADLS Gen2 and Azure Databricks. I build dynamic, reusable ADF pipelines with parameterization for multi-environment support, and I develop scalable transformations using PySpark following the Medallion Architecture (Bronze, Silver, Gold).

I also create and manage Delta Lake tables with support for ACID transactions, schema enforcement/evolution, and time travel. To handle historical and current state tracking, I implemented SCD Type 1 & 2 logic using Delta Merge operations, while using Unity Catalog for centralized governance, access control, and lineage tracking.

I tune and automate workloads for performance and reliability—optimizing Spark jobs (partition pruning, broadcast joins, AQE, caching, and more), applying Delta Lake optimizations (OPTIMIZE with ZORDER, VACUUM), and scheduling notebooks and JAR-based executions with retry policies. I’m also committed to data security and compliance through RBAC, encryption, and Unity Catalog policies.

Experience

Work history, roles, and key accomplishments

LT
Current

Junior Data Engineer

LTIMindtree

Nov 2022 - Present (3 years 7 months)

Designed and implemented end-to-end ETL pipelines in Azure Data Factory to ingest data from CSV, SQL Server, and REST APIs into ADLS Gen2 and Azure Databricks. Built reusable parameterized workflows, PySpark transformations using Medallion (Bronze/Silver/Gold), and Delta Lake tables with governance via Unity Catalog and performance tuning for efficient batch and streaming processing.

Education

Degrees, certifications, and relevant coursework

SI

Shri Vithal Education and Research Institute

Bachelor of Technology

2019 - 2022

Grade: CGPA 8.04

Earned a Bachelor of Technology from Shri Vithal Education and Research Institute (2019–2022) with CGPA 8.04.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan