Skip to main content
HimalayasHimalayas logo
K SriniKS
Open to opportunities

K Srini

@ksrini

AI/ML Engineer building production GenAI RAG and multi-agent systems—end to end from prototype to optimized inference.

India
Message

What I'm looking for

I’m looking to build reliable GenAI/RAG products in a platform-minded team—owning evaluation, deployment, and performance (latency/cost), with room to grow into senior ML systems leadership.

I’m an AI/ML Engineer with 3+ years at TCS building production GenAI systems, RAG pipelines, and multi-agent workflows. I work across the full LLM stack—from data ingestion and embedding pipelines to vector search, fine-tuning (LoRA/QLoRA), model serving (vLLM, AWS Bedrock), inference optimization, and evaluation frameworks.

What sets me apart is that I don’t stop at prototypes: I deliver production-grade systems with platform engineering discipline. I’ve designed FastAPI microservices, deployed on AWS EKS, and used Terraform IaC, CI/CD automation, and observability to own the full path from model development to reliable runtime behavior.

In my role, I’ve improved retrieval accuracy ~25% with hybrid search (BM25 + dense vector) and cross-encoder reranking, reduced RAG response latency 30–40% using Redis semantic caching and async execution, and set up rigorous evaluation pipelines (RAGAS and custom LLM-as-judge) with regression tests and MLflow-based experiment lifecycle management. I’m especially energized by building agentic workflows with LangGraph/CrewAI and making them measurable, testable, and cost-aware in production.

Experience

Work history, roles, and key accomplishments

TT
Current

AI/ML & Platform Engineer

May 2022 - Present (4 years 1 month)

Designed and deployed production RAG pipelines for enterprise document intelligence, improving retrieval accuracy ~25% using hybrid search (BM25 + dense vector) with cross-encoder reranking. Built multi-agent LangGraph/CrewAI workflows and integrated vLLM/Bedrock, reducing end-to-end RAG latency 30–40% and implementing MLflow-based experiment tracking, evaluation, and gated model promotions.

Education

Degrees, certifications, and relevant coursework

BC

BVC Engineering College

Bachelor of Technology (B.Tech), Engineering

B.Tech degree from BVC Engineering College, affiliated with JNTU Kakinada.

AA

Amazon Web Services (AWS)

AWS Certified Developer – Associate, Cloud Computing

AWS Certified Developer – Associate certification.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan