Skip to main content
Sai RuthvikSR
Open to opportunities

Sai Ruthvik

@sairuthvik

Production ML engineer building end-to-end RAG and edge AI systems with measurable latency and faithfulness gains.

India
Message

What I'm looking for

I’m looking for a team where I can own production ML end-to-end—RAG and edge deployment—with clear success metrics. I want to improve reliability (latency, grounding, monitoring), ship fast, and iterate with rigorous evaluation and regression testing.

I’m a Production ML Engineer with 2+ years owning end-to-end AI systems—from 800ms p95 RAG inference to TensorRT-optimized edge deployment with 60% latency reduction. I’m driven by measurable outcomes: raised LLM faithfulness to 91%, stabilized RAG latency around 800ms p95, pushed CV accuracy to 95%, and fixed production issues end-to-end.

Currently, I’m a Founding Data Scientist at Visionary (10,000+ students), where I designed and owned an end-to-end RAG platform for K-12, JEE, NEET, and Olympiad prep. I built a curriculum knowledge graph enabling concept-level retrieval, then stabilized p95 latency by addressing cross-encoder reranking bottlenecks with top-k pruning, async retrieval, and query caching.

When newly ingested curriculum PDFs introduced partially ungrounded answers, I traced the regression to noisy OCR chunks and implemented chunk validation, metadata filters, and retrieval regression tests. I also raised answer faithfulness from 75% → 91% using grounding checks, citation validation, and LangChain/LlamaIndex context-window controls, while running nightly RAG evaluation to catch silent regressions before user impact.

Previously, at Cyepro Solutions I owned an AI lead acquisition pipeline integrating Meta Ads API and CRM, improved sales conversion by 22% with XGBoost lead scoring, and reduced incident impact using a Neo4j-based blast radius estimator. Before that, I shipped edge-first computer vision and monitoring systems at Livestockify—boosting YOLOv11 accuracy from 82% → 95%, cutting inference latency by 60% with TensorRT optimization, and using multi-signal confirmation to reduce false alerts.

Experience

Work history, roles, and key accomplishments

VI
Current

Founding Data Scientist

Visionary

Mar 2026 - Present (3 months)

Designed and owned an end-to-end RAG platform for K-12 and test prep, serving 10,000+ active student sessions. Improved faithfulness to 91% and stabilized p95 RAG latency around 800ms by adding grounding/citation validation, retrieval fixes, and optimized inference deployment on GCP.

LI

Machine Learning Engineer

Livestockify

Aug 2024 - Nov 2025 (1 year 3 months)

Debugged and improved YOLOv11 performance in real farm conditions by expanding augmentation and retraining on 8,000+ images, raising accuracy from 82% to 95%. Reduced edge inference latency by 60% using TensorRT on Raspberry Pi and improved alerting/monitoring reliability (manual inspection time down 40%).

Education

Degrees, certifications, and relevant coursework

Indian Institute of Technology Madras logoIM

Indian Institute of Technology Madras

Bachelor of Science, Data Science and Applications

2023 - 2026

Grade: CGPA: 7.00/10

B.Sc. in Data Science and Applications at IIT Madras from 2023 to 2026. CGPA: 7.00/10.

Marri Laxman Reddy Institute of Technology logoMT

Marri Laxman Reddy Institute of Technology

Bachelor of Technology, Computer Science and Engineering

2022 - 2026

Grade: CGPA: 8.49/10

B.Tech in Computer Science and Engineering from Marri Laxman Reddy Institute of Technology from 2022 to 2026. CGPA: 8.49/10.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan