Yash Rupani
@yashrupani
Data engineer focused on scalable ETL and real-time architectures, building AI-driven RAG systems and cutting latency costs.
What I'm looking for
I’m a Master’s graduate with 2+ years of enterprise experience building scalable ETL pipelines and real-time data architectures across AWS, GCP, and Microsoft Fabric. I focus on transforming complex data into reliable, low-latency systems—especially with Apache Spark, Kafka, and AI-driven data workflows like RAG and vector search.
In my recent roles, I engineered end-to-end autonomous ETL for unstructured multimedia data, eliminating 40% of manual prep and improving retrieval latency by 25%. At Infosys, I modernized enterprise data quality pipelines, automating 10+ hours/week of manual effort and cutting Spark ETL runtime by 40%, while building dashboards from semi-structured logs to enable automated monitoring.
Experience
Work history, roles, and key accomplishments
Research Assistant (Data)
Oregon State University
Oct 2025 - Dec 2025 (2 months)
Engineered an end-to-end autonomous ETL pipeline for unstructured multimedia data, reducing manual data preparation by 40% for agentic AI workflows. Improved AI agent retrieval by reducing retrieval latency 25% and ensured 100% data integrity with validation frameworks for terabyte-scale ingestion.
AI Agent RAG Intern
GrantAide
Jul 2025 - Sep 2025 (2 months)
Architected scalable backend systems with Flask and Google Firestore, restructuring data access to cut API latency by 30% during peak usage. Built FAISS-based vector search pipelines to improve semantic retrieval relevance by 40% versus keyword matching and managed multi-cloud deployments achieving 99.9% uptime.
Modernized enterprise data quality pipelines by automating workflows and eliminating 10+ hours per week of manual intervention. Designed high-performance Apache Spark ETL processing large-scale monitoring data, reducing runtime by 40% per cycle, and built automated dashboards from semi-structured logs.
Education
Degrees, certifications, and relevant coursework
Oregon State University
Master of Engineering, Computer Science
2023 - 2025
Grade: 3.87
Activities and societies:
Master of Engineering in Computer Science at Oregon State University from Sep 2023 to Dec 2025.
Pandit Deendayal Energy University
Bachelor of Technology, Electrical Engineering
2017 - 2021
Grade: 3.05
Activities and societies:
Bachelor of Technology in Electrical Engineering at PDEU from Aug 2017 to Jun 2021.
Availability
Location
Authorized to work in
Portfolio
rupaniyash.github.io/portfolioSalary expectations
Social media
Job categories
Skills
Interested in hiring Yash?
You can contact Yash and 90k+ other talented remote workers on Himalayas.
Message YashFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
