Apurv Golatgaonkar
@apurvgolatgaonkar
Data Engineer with 4+ years building scalable Azure/GCP ETL pipelines for reliable analytics.
What I'm looking for
I’m a Data Engineer with 4+ years of experience building scalable ETL pipelines and data workflows across Azure and GCP. I focus on reliable ingestion, transformation, validation, migration, and automation, with experience supporting Finance and E-Commerce data needs.
At Equifax, I transformed legacy credit bureau data into Azure cloud using Azure Data Factory and Databricks. I designed end-to-end ETL pipelines to ingest and transform source data into ADLS Gen2 in Delta and Parquet formats for analytics, and I optimized SQL and PySpark logic to improve performance and reduce execution time by 50%.
I also drive data quality by running EDA and validation checks to identify inconsistencies and gaps. From there, I monitor, troubleshoot, and debug ETL pipelines and batch jobs—minimizing downtime and reducing job failure rates by 30%—while collaborating with cross-functional teams to gather requirements and improve delivery efficiency by 20%.
Earlier at Growsoft, I designed and maintained ETL pipelines for high-volume e-commerce data into a centralized analytics warehouse. I built SSIS packages for extraction and migration (improving ETL efficiency by 25%), automated cleaning/validation/transformation with Python and PySpark (reducing manual effort and processing time by 40%), and integrated multiple sources into Google BigQuery.
Experience
Work history, roles, and key accomplishments
Associate Data Engineer
Equifax Credit Information Services Pvt. Ltd.
Jan 2024 - Present (2 years 5 months)
Designed and built end-to-end ETL pipelines in Azure Data Factory integrated with Databricks to ingest and transform credit bureau data into ADLS Gen2 (Delta/Parquet). Optimized SQL and PySpark processing to reduce execution time by 50% and improved data reliability while lowering ETL downtime and job failures by 30%.
Junior Data Engineer
Growsoft India Pvt. Ltd.
Jan 2022 - Jan 2024 (2 years)
Built and maintained scalable ETL pipelines to process high-volume e-commerce data (orders, customers, products, transactions) into a centralized analytics warehouse in BigQuery. Developed SSIS packages and Python/PySpark automation to boost ETL efficiency by 25% and reduce manual effort and processing time by 40%.
Education
Degrees, certifications, and relevant coursework
Dr. Babasaheb Ambedkar Marathwada University
Bachelor of Science, Computer Science
Earned a B.Sc. in Computer Science from Dr. Babasaheb Ambedkar Marathwada University in 2021.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Interested in hiring Apurv?
You can contact Apurv and 90k+ other talented remote workers on Himalayas.
Message ApurvFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
