Shyam Patidar
@shyampatidar
Module Lead Data Engineer scaling Azure lakehouse and GenAI with measurable impact.
What I'm looking for
I’m a Module Lead Data Engineer with 6.5+ years of end-to-end experience architecting and scaling enterprise Lakehouse platforms on Azure Databricks. I progressed from Data Engineer to Lead within 2.5 years, leading 8+ member cross-functional teams across architecture, pipeline delivery, governance, and production ML/GenAI integration.
In my current role (GFIP), I built an enterprise Lakehouse using Medallion Architecture (Bronze/Silver/Gold), unifying IoT telemetry and logistics feeds to cut reporting latency from 24 hours to 1 hour. I delivered a 40% query performance gain through Spark tuning (Z-Ordering, broadcast joins, AQE), built DLT ELT pipelines with automated quality checks and SCD Type 2, and reduced pipeline maintenance by 30%.
I’m deeply hands-on in streaming and governance—event-driven ingestion with Azure Event Grid and Databricks Autoloader, plus enterprise governance via Unity Catalog (RBAC, PII masking, row-level security, audit logging). I also deploy production ML/GenAI solutions: XGBoost anomaly detection (85% accuracy) and RAG using Azure OpenAI GPT-4 on governed data with compliance controls, achieving 3–4% operational savings and reducing analyst investigation time by 30%.
Experience
Work history, roles, and key accomplishments
Module Lead Data Engineer
Impetus Technologies
Jul 2022 - Present (4 years)
Architected and scaled an enterprise Azure Databricks lakehouse using Medallion Architecture, enforcing governance with Unity Catalog and improving reporting latency from 24h to 1h. Led an 8-member team delivering Spark performance gains, compute cost reduction, production anomaly detection, and a governed RAG GenAI assistant using Azure OpenAI.
Data Engineer
Impetus Technologies
Jan 2020 - Jun 2022 (2 years 5 months)
Built real-time ingestion pipelines using Apache Kafka and Spark Structured Streaming for large-scale user activity analytics. Developed metadata-driven ELT workflows with Azure Data Factory and Kafka Connect, optimized streaming ingestion with Databricks Autoloader and Spark tuning, and implemented monitoring/alerting to maintain 99.9% pipeline uptime.
Education
Degrees, certifications, and relevant coursework
Rajiv Gandhi Proudyogiki Vishwavidyalaya (RGPV)
Bachelor of Engineering, Computer Science & Engineering
2015 - 2019
Bachelor of Engineering in Computer Science & Engineering at Rajiv Gandhi Proudyogiki Vishwavidyalaya (RGPV) from 2015 to 2019.
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Shyam?
You can contact Shyam and 90k+ other talented remote workers on Himalayas.
Message ShyamGet matched with your dream remote job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
