Rio Deb
@riodeb
Data Engineer specializing in scalable Databricks analytics pipelines and performance optimization.
What I'm looking for
I’m a Data Engineer focused on designing, building, and optimizing large-scale analytics platforms using Databricks, Apache Spark, and Delta Lake. I deliver high-performance, reliable, and scalable data solutions—especially in enterprise financial analytics environments.
In my current role, I manage enterprise analytics pipelines handling massive volumes, including ~1500 tables, 60+ TB of structured data, and datasets up to 500+ GB with 300B+ rows. I build standardized Bronze-to-Silver ETL frameworks with incremental ingestion, overwrite/merge workflows, and SCD Type 1/2 implementations, including Delta Live Tables for Silver and Gold layers.
I also prioritize automation and maintainability: I created a reusable metadata-driven transformation framework that dynamically generates Databricks SQL and PySpark logic, reducing manual query writing and accelerating onboarding. I enabled automated schema evolution across pipelines and standardized modular job templates so large teams can build consistently.
Performance and data quality are core to how I work. I optimize Spark workloads through shuffle reduction, join/aggregation re-architecture, partition pruning, predicate pushdown, and deep analysis with Spark UI and execution plans—improving stability, runtime, and cost efficiency. I back this with automated validation frameworks across Bronze, Silver, and Gold, plus Terraform-based Databricks job automation and parameterized reusable workflows.
Experience
Work history, roles, and key accomplishments
Data Engineer
Eucloid Data Solutions
Jun 2025 - Present (1 year)
Designed and operated enterprise Databricks analytics pipelines at massive scale, managing ~1,500 tables and processing 60+ TB of structured data, including 500+ GB tables with 300B+ rows. Built metadata-driven Bronze-to-Silver ETL frameworks with SCD Type 1/2 and production Delta Live Tables (DLT) pipelines, enabling automated orchestration, schema evolution, data validation, and Spark performanc
Education
Degrees, certifications, and relevant coursework
G.B. Pant DSEU
Bachelor of Technology, Computer Science and Engineering
Grade: CGPA: 8.3/10
Pursuing a Bachelor of Technology in Computer Science and Engineering at G.B. Pant DSEU, expected to graduate June 2026.
IIT Madras
Bachelor of Science, Programming and Data Science
Pursuing a Bachelor of Science in Programming and Data Science at IIT Madras, expected to graduate June 2026.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Rio?
You can contact Rio and 90k+ other talented remote workers on Himalayas.
Message RioFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
