K Srini
@ksrini
AI/ML Engineer building production GenAI RAG and multi-agent systems—end to end from prototype to optimized inference.
What I'm looking for
I’m an AI/ML Engineer with 3+ years at TCS building production GenAI systems, RAG pipelines, and multi-agent workflows. I work across the full LLM stack—from data ingestion and embedding pipelines to vector search, fine-tuning (LoRA/QLoRA), model serving (vLLM, AWS Bedrock), inference optimization, and evaluation frameworks.
What sets me apart is that I don’t stop at prototypes: I deliver production-grade systems with platform engineering discipline. I’ve designed FastAPI microservices, deployed on AWS EKS, and used Terraform IaC, CI/CD automation, and observability to own the full path from model development to reliable runtime behavior.
In my role, I’ve improved retrieval accuracy ~25% with hybrid search (BM25 + dense vector) and cross-encoder reranking, reduced RAG response latency 30–40% using Redis semantic caching and async execution, and set up rigorous evaluation pipelines (RAGAS and custom LLM-as-judge) with regression tests and MLflow-based experiment lifecycle management. I’m especially energized by building agentic workflows with LangGraph/CrewAI and making them measurable, testable, and cost-aware in production.
Experience
Work history, roles, and key accomplishments
Designed and deployed production RAG pipelines for enterprise document intelligence, improving retrieval accuracy ~25% using hybrid search (BM25 + dense vector) with cross-encoder reranking. Built multi-agent LangGraph/CrewAI workflows and integrated vLLM/Bedrock, reducing end-to-end RAG latency 30–40% and implementing MLflow-based experiment tracking, evaluation, and gated model promotions.
Education
Degrees, certifications, and relevant coursework
BVC Engineering College
Bachelor of Technology (B.Tech), Engineering
B.Tech degree from BVC Engineering College, affiliated with JNTU Kakinada.
Amazon Web Services (AWS)
AWS Certified Developer – Associate, Cloud Computing
AWS Certified Developer – Associate certification.
Tech stack
Software and tools used professionally
AWS Glue
GitHub
Kubernetes
AWS CodePipeline
GitHub Actions
PySpark
PostgreSQL
Gmail
Django
pre-commit
Redis
Terraform
JSON
PyTorch
MLflow
RabbitMQ
Django REST framework
FastAPI
Grafana
Prometheus
Datadog
OpenSearch
Serverless
s3-lambda
Hugging Face
LangChain
LlamaIndex
Pydantic
Pinecone
CrewAI
vLLM
ArgoCD
Terragrunt
Ragas
Karpenter
Faiss
LangGraph
PEFT
Middleware
Task
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring K?
You can contact K and 90k+ other talented remote workers on Himalayas.
Message KFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
