Abhishek Gaur
@abhishekgaur2
Senior Machine Learning Engineer specializing in generative AI, MLOps, and production-ready deep learning systems.
What I'm looking for
I am a Senior Machine Learning Engineer with 12+ years building production AI systems across generative AI, deep learning, computer vision, and edge inference. I specialize in end-to-end GenAI stacks (RAG, agentic orchestration, SLM finetuning, LLMOps) and have repeatedly reduced latency and cost while improving relevance and faithfulness in production deployments.
I have led architecture reviews, mentored engineers, and delivered measurable improvements — e.g., sub-2.5s p95 Llama-3.1-70B on-prem inference, 60% inference cost reduction via routing agents, +23% NDCG@5 through custom re-ranking, and major retrieval/recall gains. I seek a Lead Generative AI role to drive technical direction, mentor teams, and own the full AI product lifecycle.
Experience
Work history, roles, and key accomplishments
Architected an internal agentic AI platform consolidating 15+ tools into a conversational interface, deployed Llama-3.1-70B on-premises with sub-2.5s p95 latency and reduced inference cost 60%, and established LLMOps observability to cut root-cause analysis time from hours to minutes.
Architected a behavioral analytics platform processing millions of events daily and built multi-model forecasting and anomaly detection pipelines; led multi-turn LLM fitness coaching assistant using RAG and agent routing with sub-2s p95 latency.
Built real-time 3D body pose estimation and temporal pose models for the Peloton Guide, achieving 4× model size reduction via INT8 quantization and TensorRT while maintaining >30 FPS and reducing error rate 35%.
Computer Vision Scientist
Affectiva
Apr 2019 - Jun 2020 (1 year 2 months)
Ported face detection and landmark models to TensorFlow Lite with INT8 quantization, reducing model size 75% and latency 40%, and designed safety-critical in-cabin child presence detection targeting >99% recall for automotive deployments.
Deep Learning Engineer
Neurala
Feb 2017 - Apr 2019 (2 years 2 months)
Developed AI-assisted video annotation with model-in-the-loop active learning reducing annotation time ~50%, built 3D vision systems for industrial inspection, and implemented pruning/quantization pipelines reducing inference cost up to 3×.
Education
Degrees, certifications, and relevant coursework
Boston University
Master of Science, Computer Engineering
2015 - 2017
Completed a Master of Science in Computer Engineering with coursework and projects focused on machine learning, computer vision, and systems for deployment.
Guru Gobind Singh Indraprastha University
Bachelor of Science, Computer Science
2010 - 2014
Completed a Bachelor of Science in Computer Science with foundational studies in algorithms, programming, and software engineering.
Tech stack
Software and tools used professionally
GitHub
Kubernetes
GitHub Actions
MySQL
MongoDB
Gmail
Adobe Analytics
JSON
TensorFlow
PyTorch
MLflow
scikit-learn
Keras
Tensorflow Lite
NLTK
FastAPI
Grafana
Prometheus
Linux
GuardRails
RootCause
SQL
XGBoost
Hugging Face
Temporal
LangChain
Weaviate
Evidently AI
BentoML
Pinecone
WhyLabs
Feast
vLLM
Harness
Bash
Agentic
Faiss
LangGraph
LangSmith
PEFT
Agno
Movement
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Abhishek?
You can contact Abhishek and 90k+ other talented remote workers on Himalayas.
Message AbhishekFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
