Skip to main content
Nikhil MouryaNM
Open to opportunities

Nikhil Mourya

@nikhilmourya

ML Engineer focused on production RAG systems, LLM fine-tuning, and MLOps on AWS.

India
Message

What I'm looking for

I’m looking to build and deploy reliable LLM-powered products—RAG, fine-tuning, and MLOps—with strong evaluation and observability, shipping quickly while keeping systems grounded, safe, and maintainable.

I’m an ML engineer currently building production-grade LLM applications—especially multi-agent orchestration and RAG pipelines that are measurable, testable, and fast at scale. I shipped systems that process 10K+ requests/day, combining strong retrieval with grounded generation and practical deployment patterns.

Most recently, I built an India-specific resume scoring system using transformer embeddings and FAISS vector search, then fine-tuned on a 75K-resume dataset. My RAG retrieval pipeline (LangChain/LangGraph + OpenAI API) improved shortlist precision by 35% versus a keyword-matching baseline at 10K+ resumes/day, and my Docker + AWS MLOps deployment served traffic through a REST API, increasing platform engagement by 40% by integrating into recruiter workflows.

I also build tools and research prototypes that feel production-ready. I created HiveMind, a 5-agent autonomous deep research system that loops until a deterministic confidence threshold is met to prevent hallucinated certainty, and I shipped CodeLens, an offline VS Code extension for semantic code search with AST-based chunking and sub-10ms ANN latency. Alongside applied engineering, I’ve validated parameter-efficient fine-tuning with PEGASUS + LoRA (dramatically reducing trainable parameters) and contributed to IIT Kharagpur research on structured pruning for biomedical segmentation (IoU > 0.95).

Experience

Work history, roles, and key accomplishments

HI

Software Engineer (ML)

HireBuddy

May 2025 - Nov 2025 (6 months)

Built an India-specific resume scoring system using RAG pipelines, transformer embeddings, and FAISS vector search, fine-tuned on a 75K-resume dataset; achieved 35% higher shortlist precision than a keyword baseline while processing 10K+ resumes/day. Designed and evaluated an end-to-end RAG retrieval pipeline (LangChain/LangGraph, RAGAS/LangSmith) and deployed the MLOps stack to production via Doc

Education

Degrees, certifications, and relevant coursework

Birla Institute of Technology, Mesra logoBM

Birla Institute of Technology, Mesra

Bachelor of Technology (B.Tech), Physics and Computing

2023 - 2027

B.Tech program in Physics and Computing (Aug 2023 to May 2027) with focus on ML, deep learning, and optimization.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan