HimalayasHimalayas logo
Shubham SinghSS
Looking for a job

Shubham Singh

@shubhamsingh77

Data Engineer with 3+ years building scalable GCP pipelines and AI-powered analytics platforms.

India
Message

What I'm looking for

I’m looking for a role where I can build scalable GCP data pipelines and warehouse solutions, ship AI-powered analytics with LLM/Vertex AI, and strengthen data governance—while improving reliability, discoverability, and cost efficiency with modern tools like dbt and Airflow.

I’m a Data Engineer with 3+ years of experience designing and optimizing scalable data pipelines and cloud-based data platforms on Google Cloud Platform (GCP). I focus on ETL/ELT development, data modeling, and BigQuery-based data warehousing to deliver reliable, high-performance analytics.

At General Mills, I built an AI-powered metadata intelligence platform that integrates dbt lineage extraction, LLM-generated metadata, and BigQuery—enabling searchable analytics documentation and conversational dataset discovery with a 40% improvement in discoverability. I also created a Vertex AI and Glean-powered conversational analytics agent that reduced query time by 30% and enabled 50+ analysts to self-serve analytics.

I modernized legacy data workflows by re-architecting 50+ ETL pipelines into modular Airflow–BigQuery workflows, optimizing data throughput by 40% and reducing pipeline failures by 25%. I led dbt implementation across 160+ data models, automated data quality testing with dbt-expectations, and increased pipeline reliability by 35%.

I’m equally driven by governance and cost efficiency: I’ve strengthened dataset documentation and compliance through data governance work, improved data quality via lineage and metadata management, and reduced GCP costs by 15%. I also migrated SAP S/4HANA datasets, improving reconciliation accuracy by 20% and reducing migration time by 35%.

Experience

Work history, roles, and key accomplishments

General Mills logoGM
Current

Associate Data Engineer

Jan 2023 - Present (3 years 4 months)

Built an AI-powered metadata intelligence platform integrating dbt lineage extraction, Claude-generated metadata, and BigQuery, improving dataset discoverability by 40%. Re-architected 50+ legacy ETL pipelines into modular Airflow–BigQuery workflows, increasing data throughput by 40%, reducing pipeline failures by 25%, and enabling 50+ analysts to self-serve analytics with a Vertex AI conversation

Education

Degrees, certifications, and relevant coursework

Vivekanand Education Society's Institute of Technology (VESIT) logoVV

Vivekanand Education Society's Institute of Technology (VESIT)

Master of Computer Applications (MCA), Computer Applications

Grade: CGPI: 9.07

Completed MCA at Vivekanand Education Society's Institute of Technology (University of Mumbai) in 2023 with a CGPI of 9.07.

TC

Thakur Ramnarayan College of Arts & Commerce

Bachelor of Science (B.Sc. Computer Science), Computer Science

Grade: CGPI: 9.65

Completed a B.Sc. in Computer Science at Thakur Ramnarayan College of Arts & Commerce (University of Mumbai) in 2021 with a CGPI of 9.65.

Tech stack

Software and tools used professionally

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan