Skip to main content
Anay ChauhanAC
Looking for a job

Anay Chauhan

@anaychauhan

Applied ML researcher and R&D engineer focused on efficient LLM inference, RAG systems, and production-ready model evaluation.

India
Message

What I'm looking for

I’m looking for a team where I can build and ship efficient LLM inference and agentic RAG systems—blending research rigor with production evaluation, correctness checks, and deployment-focused engineering on real-world models.

I’m an Applied Machine Learning Researcher and R&D Engineer building efficient, memory-aware capabilities for transformer-based LLM inference. I designed a kernel-native KV cache compression method that couples Angle Domain Attention with a rate-distortion objective, achieving 1.55–1.72× higher decode throughput at matched perplexity and reducing HBM bytes/token by 24% through a compressed-domain GPU kernel without dense-key reconstruction.

At Synopsys, I shipped an agentic spec-to-code Retrieval-Augmented Generation (RAG) pipeline using a DSL object model, and I fine-tuned Qwen2.5 with LoRA adapters to improve DSL schema compliance by 35% over prompt-only baselines. I also architected AST-level correctness checks, schema validation, and regression-test infrastructure to reduce silent failures, and I delivered a FastMCP server integrating Coverity static analysis for automated code-review filing adopted across 100+ engineers.

Experience

Work history, roles, and key accomplishments

PL
Current

Applied Machine Learning Researcher

Pragya Labs

Jan 2026 - Present (6 months)

Designed a kernel-native KV cache compression method using Angle Domain Attention with a rate-distortion retention objective, achieving lossless-equivalent quality at reduced cache sizes for LLM inference. Improved decode throughput by 1.55–1.72× at matched perplexity on 8K/32K/128K windows and reduced HBM bytes/token by 24%.

Synopsys logoSY
Current

R&D Engineer

Jun 2025 - Present (1 year 1 month)

Designed and shipped an agentic retrieval-augmented generation (RAG) pipeline for spec-to-code synthesis using a DSL object model, deployed in production tooling for Google, Qualcomm, and MediaTek. Fine-tuned Qwen2.5 with LoRA (SFT) to improve DSL schema compliance by 35% and shipped an MCP server integrating Coverity static analysis adopted by 100+ engineers.

TL

Undergraduate Researcher

Translational Biology Lab

Jan 2024 - May 2024 (4 months)

Built ML pipelines for protein-ligand binding (IC50) prediction on 11K ChEMBL samples for ALK and BRAF using Mordred/RDKit descriptors. Improved binding affinity prediction by 20% via cross-feature interaction terms and achieved k-fold cross-validated R=0.92 (Linear Regression) and R=0.89 (Random Forest).

Education

Degrees, certifications, and relevant coursework

Indraprastha Institute of Information Technology (IIIT), Delhi logoID

Indraprastha Institute of Information Technology (IIIT), Delhi

Bachelor of Technology, Computer Science

2021 - 2025

Bachelor of Technology in Computer Science at Indraprastha Institute of Information Technology (IIIT), Delhi from 2021 to 2025.

Get matched with your dream remote job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan