Skip to main content
Shyam PatidarSP
Open to opportunities

Shyam Patidar

@shyampatidar

Module Lead Data Engineer scaling Azure lakehouse and GenAI with measurable impact.

India
Message

What I'm looking for

I’m looking to lead end-to-end Azure lakehouse engineering—strong governance, streaming reliability, and production ML/GenAI. I want a role where I can design scalable architectures, deliver measurable performance gains, and mentor teams.

I’m a Module Lead Data Engineer with 6.5+ years of end-to-end experience architecting and scaling enterprise Lakehouse platforms on Azure Databricks. I progressed from Data Engineer to Lead within 2.5 years, leading 8+ member cross-functional teams across architecture, pipeline delivery, governance, and production ML/GenAI integration.

In my current role (GFIP), I built an enterprise Lakehouse using Medallion Architecture (Bronze/Silver/Gold), unifying IoT telemetry and logistics feeds to cut reporting latency from 24 hours to 1 hour. I delivered a 40% query performance gain through Spark tuning (Z-Ordering, broadcast joins, AQE), built DLT ELT pipelines with automated quality checks and SCD Type 2, and reduced pipeline maintenance by 30%.

I’m deeply hands-on in streaming and governance—event-driven ingestion with Azure Event Grid and Databricks Autoloader, plus enterprise governance via Unity Catalog (RBAC, PII masking, row-level security, audit logging). I also deploy production ML/GenAI solutions: XGBoost anomaly detection (85% accuracy) and RAG using Azure OpenAI GPT-4 on governed data with compliance controls, achieving 3–4% operational savings and reducing analyst investigation time by 30%.

Experience

Work history, roles, and key accomplishments

IT
Current

Module Lead Data Engineer

Impetus Technologies

Jul 2022 - Present (4 years)

Architected and scaled an enterprise Azure Databricks lakehouse using Medallion Architecture, enforcing governance with Unity Catalog and improving reporting latency from 24h to 1h. Led an 8-member team delivering Spark performance gains, compute cost reduction, production anomaly detection, and a governed RAG GenAI assistant using Azure OpenAI.

IT

Data Engineer

Impetus Technologies

Jan 2020 - Jun 2022 (2 years 5 months)

Built real-time ingestion pipelines using Apache Kafka and Spark Structured Streaming for large-scale user activity analytics. Developed metadata-driven ELT workflows with Azure Data Factory and Kafka Connect, optimized streaming ingestion with Databricks Autoloader and Spark tuning, and implemented monitoring/alerting to maintain 99.9% pipeline uptime.

Education

Degrees, certifications, and relevant coursework

RR

Rajiv Gandhi Proudyogiki Vishwavidyalaya (RGPV)

Bachelor of Engineering, Computer Science & Engineering

2015 - 2019

Bachelor of Engineering in Computer Science & Engineering at Rajiv Gandhi Proudyogiki Vishwavidyalaya (RGPV) from 2015 to 2019.

Get matched with your dream remote job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan