Skip to main content
HimalayasHimalayas logo
RG
Looking for a job

Raj Gandhi

@rajgandhi

Data Engineer focused on scalable batch and low-latency real-time pipelines.

India
Message

What I'm looking for

I’m looking to build and operate reliable, scalable data platforms—especially low-latency real-time and large-scale ETL—where I can own end-to-end pipelines, improve performance, and ensure compliance with strong monitoring and data validation.

I’m a Data Engineer with 4 years of experience designing and operating scalable batch and real-time data pipelines. I build solutions end-to-end—modeling, enrichment, and reliable data delivery—so analytics teams can trust their numbers.

In my current role, I designed and implemented a low-latency real-time sessionization system using Redis, Lua scripts, and Java, achieving ~300 ms P99 latency. I also provisioned and operated infrastructure with CloudFormation, ensured consistency through atomic Redis updates, and validated correctness using data parity checks and monitoring/alarms.

I’ve led data privacy and compliance work by removing sensitive data from datasets owned across multiple teams. I collaborated with privacy partners to analyze sources and deliver a centralized sanitized dataset for downstream consumption, ensuring policy compliance at scale.

Earlier, I built large-scale Spark ETL pipelines and optimized terabyte-scale processing by tuning joins and shuffle/memory behavior to improve performance and reliability. I also migrated data from GCP to AWS using PySpark (200–300 GB/day) and automated workflows with Python, AWS Lambda, and AWS Glue—plus delivered stakeholder-facing QuickSight KPI dashboards and a Streamlit reporting web app.

Experience

Work history, roles, and key accomplishments

BookMyShow logoBO

Data Engineer I

BookMyShow

Jun 2022 - Dec 2024 (2 years 6 months)

Built a PySpark pipeline to migrate 200–300 GB/day from GCP to AWS, replacing the existing workflow and saving ~$7,300/year while improving analytics availability. Migrated SQL/NoSQL to Amazon Redshift with a deduplication hash ensuring 100% data integrity, and created QuickSight dashboards plus automation using Python, AWS Lambda, AWS Glue, and a Streamlit reporting app.

Education

Degrees, certifications, and relevant coursework

University of Mumbai logoUM

University of Mumbai

Bachelor of Engineering, Computer Science

2018 - 2022

Activities and societies: Award: Spot Award for the OND 2023 Quarter. Certificates: dbt Fundamentals, Databricks Lakehouse Fundamentals, Snowflake Data Engineering, and Data Warehousing workshops.

Completed a Bachelor of Engineering in Computer Science at the University of Mumbai from 2018 to 2022.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan