Yash Rupani
@yashrupani
Data engineer focused on scalable ETL and real-time architectures, building AI-driven RAG systems and cutting latency costs.
What I'm looking for
I’m a Master’s graduate with 2+ years of enterprise experience building scalable ETL pipelines and real-time data architectures across AWS, GCP, and Microsoft Fabric. I focus on transforming complex data into reliable, low-latency systems—especially with Apache Spark, Kafka, and AI-driven data workflows like RAG and vector search.
In my recent roles, I engineered end-to-end autonomous ETL for unstructured multimedia data, eliminating 40% of manual prep and improving retrieval latency by 25%. At Infosys, I modernized enterprise data quality pipelines, automating 10+ hours/week of manual effort and cutting Spark ETL runtime by 40%, while building dashboards from semi-structured logs to enable automated monitoring.
Experience
Work history, roles, and key accomplishments
Research Assistant (Data)
Oregon State University
Oct 2025 - Dec 2025 (2 months)
Engineered an end-to-end autonomous ETL pipeline for unstructured multimedia data, reducing manual data preparation by 40% for agentic AI workflows. Improved AI agent retrieval by reducing retrieval latency 25% and ensured 100% data integrity with validation frameworks for terabyte-scale ingestion.
AI Agent RAG Intern
GrantAide
Jul 2025 - Sep 2025 (2 months)
Architected scalable backend systems with Flask and Google Firestore, restructuring data access to cut API latency by 30% during peak usage. Built FAISS-based vector search pipelines to improve semantic retrieval relevance by 40% versus keyword matching and managed multi-cloud deployments achieving 99.9% uptime.
Modernized enterprise data quality pipelines by automating workflows and eliminating 10+ hours per week of manual intervention. Designed high-performance Apache Spark ETL processing large-scale monitoring data, reducing runtime by 40% per cycle, and built automated dashboards from semi-structured logs.
Education
Degrees, certifications, and relevant coursework
Oregon State University
Master of Engineering, Computer Science
2023 - 2025
Grade: 3.87
Activities and societies:
Master of Engineering in Computer Science at Oregon State University from Sep 2023 to Dec 2025.
Pandit Deendayal Energy University
Bachelor of Technology, Electrical Engineering
2017 - 2021
Grade: 3.05
Activities and societies:
Bachelor of Technology in Electrical Engineering at PDEU from Aug 2017 to Jun 2021.
Availability
Location
Authorized to work in
Portfolio
rupaniyash.github.io/portfolioSalary expectations
Social media
Job categories
Skills
Interested in hiring Yash?
You can contact Yash and 90k+ other talented remote workers on Himalayas.
Message YashFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
