Skip to main content
Sai KiranSK
Looking for a job

Sai Kiran

@saikiran14

AI/ML Engineer building LLM systems and RAG architectures for reliable, low-latency production AI.

India
Message

What I'm looking for

I’m looking for a role where I can build and deploy production LLM/RAG systems end-to-end—owning MLOps, latency optimization, evaluation, and monitoring—so quality and reliability translate into real user impact.

I’m an AI/ML Engineer specializing in LLM systems, RAG architectures, NLP, and computer vision. I focus on building production-ready ML pipelines that deliver reliability, measurable quality, and real-world usability.

In a Google Partner Project internship, I designed and deployed an automated ML pipeline integrating REST APIs (Meta/Facebook Ads → CRM) with real-time ingestion, deduplication, and lead scoring. I reduced manual processing overhead by 45% through preprocessing and feature engineering workflows, and improved throughput by optimizing SQL queries and backend indexing.

Across my RAG work, I’ve led prompt engineering and embedding optimization to drive performance—maintaining 99% uptime for a production RAG system handling 500+ daily concurrent queries. On a legal-domain RAG project, I achieved strict grounding with <5% hallucination rate, improved clause-level relevance by 28%, and built citation generation that increased lawyer trust and adoption by 3x.

I also build supporting systems for evaluation and deployment, including MLflow-based monitoring and FastAPI inference with auto-scaling (sustained <1.2s response time under 50 concurrent users). I’m motivated by end-to-end ownership—turning model capabilities into dependable products.

Experience

Work history, roles, and key accomplishments

GP

ML Engineering Intern

Google Partner Project

Jan 2026 - Apr 2026 (3 months)

Designed and deployed an automated ML pipeline integrating REST APIs from Meta/Facebook Ads into a CRM with real-time ingestion, deduplication, and lead scoring. Reduced manual processing overhead by 45% by building preprocessing and feature engineering for anomaly flagging, and improved data retrieval latency through SQL query optimization and backend indexing.

Education

Degrees, certifications, and relevant coursework

Marri Laxman Reddy Institute of Technology logoMT

Marri Laxman Reddy Institute of Technology

B.Tech, Computer Science & IT

2022 -

Grade: CGPA: 9.0/10.0

Pursuing a B.Tech in Computer Science & IT with a CGPA of 9.0/10.0.

MC

Model Jr College

Intermediate, Intermediate

2020 - 2022

Grade: CGPA: 9.46/10.0

Completed Intermediate with a CGPA of 9.46/10.0.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan