Sai Ruthvik
@sairuthvik
Production ML engineer building end-to-end RAG and edge AI systems with measurable latency and faithfulness gains.
What I'm looking for
I’m a Production ML Engineer with 2+ years owning end-to-end AI systems—from 800ms p95 RAG inference to TensorRT-optimized edge deployment with 60% latency reduction. I’m driven by measurable outcomes: raised LLM faithfulness to 91%, stabilized RAG latency around 800ms p95, pushed CV accuracy to 95%, and fixed production issues end-to-end.
Currently, I’m a Founding Data Scientist at Visionary (10,000+ students), where I designed and owned an end-to-end RAG platform for K-12, JEE, NEET, and Olympiad prep. I built a curriculum knowledge graph enabling concept-level retrieval, then stabilized p95 latency by addressing cross-encoder reranking bottlenecks with top-k pruning, async retrieval, and query caching.
When newly ingested curriculum PDFs introduced partially ungrounded answers, I traced the regression to noisy OCR chunks and implemented chunk validation, metadata filters, and retrieval regression tests. I also raised answer faithfulness from 75% → 91% using grounding checks, citation validation, and LangChain/LlamaIndex context-window controls, while running nightly RAG evaluation to catch silent regressions before user impact.
Previously, at Cyepro Solutions I owned an AI lead acquisition pipeline integrating Meta Ads API and CRM, improved sales conversion by 22% with XGBoost lead scoring, and reduced incident impact using a Neo4j-based blast radius estimator. Before that, I shipped edge-first computer vision and monitoring systems at Livestockify—boosting YOLOv11 accuracy from 82% → 95%, cutting inference latency by 60% with TensorRT optimization, and using multi-signal confirmation to reduce false alerts.
Experience
Work history, roles, and key accomplishments
Founding Data Scientist
Visionary
Mar 2026 - Present (3 months)
Designed and owned an end-to-end RAG platform for K-12 and test prep, serving 10,000+ active student sessions. Improved faithfulness to 91% and stabilized p95 RAG latency around 800ms by adding grounding/citation validation, retrieval fixes, and optimized inference deployment on GCP.
Technical Lead (AI/ML)
Infin AI Club, IIT Madras
Jan 2024 - May 2026 (2 years 4 months)
Led an 8-person AI/ML team delivering 5+ production projects, establishing engineering standards and sprint-based delivery roadmaps. Mentored 15+ members on ML fundamentals and RAG production practices, driving 40% membership growth and improving project completion by 25%.
AI Engineer
Cyepro Solutions
Dec 2025 - Mar 2026 (3 months)
Owned an end-to-end AI lead acquisition pipeline integrating Meta Ads API and CRM, automating processing for 1,000+ leads/day. Improved conversion rate by 22% using XGBoost lead scoring and reduced model/debugging time by 50% with MLflow.
Machine Learning Engineer
Livestockify
Aug 2024 - Nov 2025 (1 year 3 months)
Debugged and improved YOLOv11 performance in real farm conditions by expanding augmentation and retraining on 8,000+ images, raising accuracy from 82% to 95%. Reduced edge inference latency by 60% using TensorRT on Raspberry Pi and improved alerting/monitoring reliability (manual inspection time down 40%).
Education
Degrees, certifications, and relevant coursework
Indian Institute of Technology Madras
Bachelor of Science, Data Science and Applications
2023 - 2026
Grade: CGPA: 7.00/10
B.Sc. in Data Science and Applications at IIT Madras from 2023 to 2026. CGPA: 7.00/10.
Marri Laxman Reddy Institute of Technology
Bachelor of Technology, Computer Science and Engineering
2022 - 2026
Grade: CGPA: 8.49/10
B.Tech in Computer Science and Engineering from Marri Laxman Reddy Institute of Technology from 2022 to 2026. CGPA: 8.49/10.
Tech stack
Software and tools used professionally
Amazon EC2
Google Cloud Platform
Amazon S3
GitHub
Kubernetes
Cloudflare
GitHub Actions
Jupyter
MySQL
PostgreSQL
MongoDB
Gmail
Node.js
Neo4j
Slack
OpenCV
Redis
Jira
React
JavaScript
Python
Go
TensorFlow
PyTorch
MLflow
scikit-learn
Kubeflow
Kafka
RabbitMQ
FastAPI
Linux
Confluence
Elasticsearch
TypeScript
Docker
NGINX
Root Cause
Amazon Web Services (AWS)
SQL
XGBoost
Hugging Face
Temporal
Qdrant
LangChain
LlamaIndex
ChromaDB
Ragas
pgvector
Radius
LangFlow
Faiss
LangGraph
LangSmith
Increase
Farm
Remote
Jan
Availability
Location
Authorized to work in
Social media
Job categories
Skills
Interested in hiring Sai?
You can contact Sai and 90k+ other talented remote workers on Himalayas.
Message SaiFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
