Skip to main content
Roro ZoroRZ
Open to opportunities

Roro Zoro

@rorozoro

I’m a Data Engineer optimizing PySpark/Databricks pipelines for reliable, scalable banking data.

India
Message

What I'm looking for

I’m looking to build and modernize end-to-end data pipelines with PySpark/Databricks and AWS. I want to own Spark performance optimization, reliable orchestration, and strong monitoring so data platforms stay dependable at scale.

I’m a Data Engineer with 4 years of experience designing and optimizing scalable data pipelines using PySpark, Databricks, AWS, and the Hadoop ecosystem. I focus on ETL modernization, distributed data processing, and building cloud-native data engineering solutions that are dependable in production.

I’ve automated migration of legacy SQL workloads into reusable PySpark frameworks and built Delta Lake-based architectures. At Barclays, I developed and optimized PySpark pipelines for enterprise banking control frameworks, implementing validation controls, reusable ETL components, and efficient Spark partition strategies.

I also prioritize operational excellence: I’ve implemented centralized logging and monitoring with AWS CloudWatch, orchestrated end-to-end ETL with AWS Lambda/Glue/Step Functions, and strengthened reliability through resilience testing (AWS Fault Injection Simulator). From Capgemini to Barclays, I’ve supported production deployment and cutover activities with zero critical production incidents during release cycles.

Experience

Work history, roles, and key accomplishments

Barclays logoBA
Current

Data Engineer

Barclays

Sep 2025 - Present (10 months)

Developed scalable PySpark pipelines for enterprise banking control frameworks, including automated migration of legacy SQL workflows into standardized PySpark frameworks. Built reusable ETL components and AWS-native orchestration with Lambda/Glue/Step Functions, and improved monitoring and reliability using CloudWatch and AWS Fault Injection Simulator.

CA

Data Engineer

Dec 2022 - Sep 2025 (2 years 9 months)

Developed and optimized PySpark-based ETL pipelines on AWS EMR and the Hadoop ecosystem for large-scale distributed data processing. Implemented Airflow scheduling and AWS Glue/S3 data lake architecture, improved performance with Spark and Hive tuning, and automated Databricks deployments with CI/CD using GitHub Actions and Jenkins.

Education

Degrees, certifications, and relevant coursework

Savitribai Phule Pune University logoSU

Savitribai Phule Pune University

Bachelor of Engineering, Electronics and Telecommunication

2019 - 2022

Bachelor of Engineering in Electronics and Telecommunication from Savitribai Phule Pune University (2019–2022).

Get matched with your dream remote job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan