I’m looking for a senior data engineering role where I can build reliable data platforms and drive operational excellence, cost and performance wins, and mentor teams—partnering closely with analytics stakeholders to turn data into faster decisions.
Purbarag Pathak Choudhury
@purbaragchoudhury
Senior Data Engineer (AdTech) @ HelloFresh | PySpark • Snowflake • Databricks • dbt• Airflow • IaC
What I'm looking for
With over seven years of experience in data engineering, I specialize in building and optimizing cloud-based data pipelines. As a Senior Data Engineer at HelloFresh, I focus on leveraging tools like PySpark, dbt, Airflow, and Snowflake to enable data-driven decision-making while supporting teams in Marketing, Reporting, and Data Science. My approach emphasizes collaboration, quality assurance, and aligning technical solutions with business goals.
By driving the creation of robust data workflows and optimizations through technologies like DeltaLake, our team has streamlined data operations. I am committed to delivering efficient, reliable, and scalable data solutions while fostering clear communication with stakeholders to ensure timely project delivery.
Currently based in the Berlin, Germany and actively open to fully remote opportunities.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
HelloFresh
Jul 2025 - Present (11 months)
Led centralization of programmatic ad data across 6 platforms into a single source of truth, reducing cross-market reporting discrepancies by 80%+.
Built an event-driven marketing conversion pipeline that cut time-to-insight from 1 day to 2 hours and optimized the Snowflake serving layer to reduce load time by 72%.
Data Engineer
HelloFresh
Oct 2022 - Jun 2025 (2 years 8 months)
Built Snowflake + dbt conversion cohort pipelines orchestrated in Airflow with 150 Soda checks to reach 90% SLA compliance, reducing data issues by 30% and Snowflake storage costs by 75%.
Migrated legacy Salesforce CRM CDP pipelines to Databricks DeltaLake, reducing execution time by 80% and delivering €172.6K annual compute cost savings.
Led a 4-person team to deliver on-time PySpark batch pipelines on AWS EMR processing 600 GB/day of clinical behavioral data for risk mitigation models.
Implemented SCD Type 2 change-tracking across 20+ tables to provide full historical audit trails for compliance reporting.
Solutions Engineer
Zaloni Inc.
Aug 2018 - Aug 2021 (3 years)
Executed migration of 12TB SQL Server data to MongoDB using PySpark, Sqoop, and EMR, completing the transition to a NoSQL architecture in under 90 days.
Designed cross-region replication (RDS + Lambda + Kafka + Debezium) achieving <10 seconds average replication lag and 99.9% uptime for critical workloads.
Education
Degrees, certifications, and relevant coursework
Tezpur University
Bachelor of Technology (B.Tech), Computer Science
2009 - 2013
Earned a B.Tech in Computer Science from Tezpur University from 2009 to 2013.
Tech stack
Software and tools used professionally
Availability
Location
Social media
Job categories
Skills
Interested in hiring Purbarag Pathak?
You can contact Purbarag Pathak and 90k+ other talented remote workers on Himalayas.
Message Purbarag PathakFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
