SP
Open to opportunities

surabhi pandey

@surabhipandey

Data Engineer with 10+ years of experience in data platforms.

Malaysia
Message

What I'm looking for

I am looking for a role that fosters innovation and collaboration, where I can leverage my data engineering expertise to drive impactful solutions and contribute to a data-driven culture.

I am a Data Engineering Lead with over 10 years of experience in building real-time and batch data platforms. My expertise lies in modernizing data pipelines using AWS and GCP, and I have a strong foundation in data modeling and big data analytics. Currently, I lead a team of three data engineers at Mindvalley, where we deliver cost-efficient and reliable data solutions that enhance data accessibility and platform scalability.

Throughout my career, I have spearheaded initiatives that significantly improved data quality and accessibility. For instance, I architected a master data management layer on GCP that created a unified source of truth, reducing data retrieval times by 50%. I have also migrated legacy SQL pipelines to modern workflows, establishing robust data quality checks that resulted in a 25% improvement in overall data quality. My passion for data governance and compliance drives me to ensure that our data solutions adhere to regional standards while providing actionable insights for stakeholders.

Experience

Work history, roles, and key accomplishments

MI
Current

Data Engineering Lead

Mindvalley

Jan 2022 - Present (3 years 7 months)

Spearheaded a cross-functional team to architect and deploy a master data management layer on GCP using BigQuery, creating a unified single source of truth in the data warehouse. Migrated legacy SQL pipelines to modern dbt and Airflow workflows, establishing robust data quality checks, data lineage tracking, and comprehensive data cataloging.

KI

Senior Data Engineer

Kinesso

Jan 2021 - Present (4 years 7 months)

Collaborated with the advertising team to identify and address data quality issues, implementing the Great Expectations suite for data quality checks and reporting. Created a user interface-based snowflake schema generator tool using Streamlit and Snowflake to meet analysts' and data scientists' ad hoc analysis requests.

CS

Data Migration Developer

CRM Solutions

Jan 2018 - Present (7 years 7 months)

Developed data migration and deployment strategy, assessing the impact on multiple third-party systems for a significant transformation project involving a customer base of 12 million. Led legacy data cleansing activities, conducting integrity checks on legacy data and collaborating with stakeholders to mitigate risks and drive data cleanup decisions.

AM

Senior Migration Analyst

Amdocs

Jan 2014 - Present (11 years 7 months)

Part of multiple BSS and D2H data migration projects around the globe, building deployment strategy, and assessing the impact. Developed and automated integrity checks, Data discovery, reconciliation checks and reporting during and post-migration.

IN

Senior System Analyst

Infosys

Jan 2011 - Present (14 years 7 months)

Led and supported training operations and managed batches of 350 interns as part of the Education and Research team. Developed and automated a data synchronization pipeline using GoldenGate, enabling real-time synchronization between multiple address servers.

Education

Degrees, certifications, and relevant coursework

Amrita University logoAU

Amrita University

Bachelor of Computer Science, Computer Science

2007 - 2011

Studied Computer Science at Amrita University in Kerala, India. Completed the program in 2011, gaining foundational knowledge in the field.

Tech stack

Software and tools used professionally

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
surabhi pandey - Data Engineering Lead - Mindvalley | Himalayas