Anay Chauhan
@anaychauhan
Applied ML researcher and R&D engineer focused on efficient LLM inference, RAG systems, and production-ready model evaluation.
What I'm looking for
I’m an Applied Machine Learning Researcher and R&D Engineer building efficient, memory-aware capabilities for transformer-based LLM inference. I designed a kernel-native KV cache compression method that couples Angle Domain Attention with a rate-distortion objective, achieving 1.55–1.72× higher decode throughput at matched perplexity and reducing HBM bytes/token by 24% through a compressed-domain GPU kernel without dense-key reconstruction.
At Synopsys, I shipped an agentic spec-to-code Retrieval-Augmented Generation (RAG) pipeline using a DSL object model, and I fine-tuned Qwen2.5 with LoRA adapters to improve DSL schema compliance by 35% over prompt-only baselines. I also architected AST-level correctness checks, schema validation, and regression-test infrastructure to reduce silent failures, and I delivered a FastMCP server integrating Coverity static analysis for automated code-review filing adopted across 100+ engineers.
Experience
Work history, roles, and key accomplishments
Applied Machine Learning Researcher
Pragya Labs
Jan 2026 - Present (5 months)
Designed a kernel-native KV cache compression method using Angle Domain Attention with a rate-distortion retention objective, achieving lossless-equivalent quality at reduced cache sizes for LLM inference. Improved decode throughput by 1.55–1.72× at matched perplexity on 8K/32K/128K windows and reduced HBM bytes/token by 24%.
Designed and shipped an agentic retrieval-augmented generation (RAG) pipeline for spec-to-code synthesis using a DSL object model, deployed in production tooling for Google, Qualcomm, and MediaTek. Fine-tuned Qwen2.5 with LoRA (SFT) to improve DSL schema compliance by 35% and shipped an MCP server integrating Coverity static analysis adopted by 100+ engineers.
Undergraduate Researcher
Translational Biology Lab
Jan 2024 - May 2024 (4 months)
Built ML pipelines for protein-ligand binding (IC50) prediction on 11K ChEMBL samples for ALK and BRAF using Mordred/RDKit descriptors. Improved binding affinity prediction by 20% via cross-feature interaction terms and achieved k-fold cross-validated R=0.92 (Linear Regression) and R=0.89 (Random Forest).
Education
Degrees, certifications, and relevant coursework
Indraprastha Institute of Information Technology (IIIT), Delhi
Bachelor of Technology, Computer Science
2021 - 2025
Bachelor of Technology in Computer Science at Indraprastha Institute of Information Technology (IIIT), Delhi from 2021 to 2025.
Availability
Location
Authorized to work in
Salary expectations
Social media
Job categories
Skills
Interested in hiring Anay?
You can contact Anay and 90k+ other talented remote workers on Himalayas.
Message AnayFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
