Skip to main content
Rahul SahuRS
Open to opportunities

Rahul Sahu

@rahulsahu2

Senior Data Engineer focused on cloud data pipelines, Snowflake, and real-time analytics optimization.

India
Message

What I'm looking for

I’m looking for a growth-oriented team where I can architect scalable cloud data pipelines, optimize Snowflake/data warehouse performance, and deliver reliable real-time analytics with strong data governance—turning data into decisions with measurable impact.

I’m a Data Engineer with 6+ years of experience building and optimizing large-scale data pipelines across retail, finance, and education. I bring deep expertise in cloud data engineering and data warehousing to help teams turn complex datasets into reliable business insights.

At TripAdvisor, I architected and optimized Snowflake pipelines that ingest and transform diverse travel datasets, processing over 500GB of daily incremental updates. I engineered a unified 360-degree traveler view by merging on-platform interactions and off-platform bookings, improving personalized travel recommendations by 15%.

I also focused on real-time impact: I leveraged Snowflake Streams and Task for CDC and used Snowpark for complex Python-based transformations, reducing end-to-end data latency from 4 hours to under 30 minutes. To keep analytics trustworthy, I implemented data governance and quality frameworks (99.9% data accuracy) and optimized warehouse performance and cost, cutting monthly Snowflake credits by 20%.

Earlier, I delivered measurable outcomes at Credit Saison, reducing Athena query scan costs by 97% through S3 partitioning and Glue metadata management, and building transformation jobs with PySpark for financial datasets. At Embibe, I developed batch and streaming pipelines using Spark and Kafka and built real-time ranking with Kafka, Spark, and Redis—grounding my engineering style in performance, correctness, and practical delivery.

Experience

Work history, roles, and key accomplishments

Tripadvisor logoTR

Data Engineer 2

Feb 2025 - Dec 2025 (10 months)

Architected and optimized Snowflake pipelines ingesting and transforming diverse travel datasets with 500GB/day incremental updates, enabling a 360-degree traveler view that improved personalized travel recommendations by 15%. Reduced real-time CDC latency from 4 hours to under 30 minutes and improved warehouse cost/performance with 20% lower monthly Snowflake credits.

Tripadvisor logoTR

Data Engineer 2

May 2024 - Jan 2025 (8 months)

Built and managed Whampipe-orchestrated ETL/ELT workflows using advanced SQL to automate travel data movement across multi-cloud environments. Tuned query execution and partitioning to cut peak-season ETL processing time by 25% and added automated SQL validations to detect schema drift and data anomalies.

BI

Data Engineer 2

Blackbuck Insights

Feb 2022 - May 2024 (2 years 3 months)

Developed ingestion pipelines to load SFTP and GCS files into BigQuery for centralized processing, supporting batch analytics and CDP data structures. Built Airflow (Composer) batch ETL for CSV/JSON/Parquet into BigQuery and used SparkSQL/PySpark with Dataproc to explore datasets for ad performance and segmentation.

Education

Degrees, certifications, and relevant coursework

AC

ABES Engineering College

Bachelor of Technology, Computer Science

2019 -

Completed a B.Tech in Computer Science at ABES Engineering College in 2019.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan