HimalayasHimalayas logo
snehit vaddiSV
Open to opportunities

snehit vaddi

@snehitvaddi

AI Engineer building production GenAI systems—RAG, agents, and guardrails—served to thousands of real users.

United States
Message

What I'm looking for

I’m looking to build production GenAI products where RAG/agents, rigorous evaluation, and real-world guardrails matter. I want to ship fast, monitor quality in production, and work with teams that value measurable impact and reliability.

I’m an AI Engineer with 4+ years of experience in data engineering, ML systems, and NLP, with 2+ years focused on production GenAI. I’ve shipped GenAI applications serving 15,000+ real users daily, spanning multi-agent orchestration, RAG systems, evaluation pipelines, LLM fine-tuning, and enterprise guardrails.

I build AI SaaS products with 1,000+ active users and pair engineering with research—published work on LLM hallucination detection and small model reasoning. I’m hands-on end-to-end: from retrieval, parsing, and orchestration to monitoring, PHI-safe pipelines, and CI/CD deployments that keep systems reliable in production.

Experience

Work history, roles, and key accomplishments

MO
Current

AI Engineer - GenAI Systems

ModMed

Feb 2025 - Present (1 year 3 months)

Shipped a clinical ambient AI scribe serving 15,000+ providers across 11 specialties, automating 70% of documentation. Built agentic document processing with OpenAI Agents SDK and fine-tuned vision-language models, reducing costs from $400K/month to $20K/month (95%) while supporting PHI-safe pipelines and RAG quality guardrails.

UF

Graduate Researcher, AI/ML

University of Florida

Feb 2023 - Dec 2024 (1 year 10 months)

Developed a hybrid YOLOv8-ViT model improving small-object detection by 15%, with Grad-CAM/EigenCAM explainability and publications in SPIE (2025) and IEEE (2023). Built an interactive React analytics dashboard (adoption 15% → 85%) and automated retraining via MLflow and GitHub Actions, reducing deployment time from 4 hours to 15 minutes.

GU

AI Software Developer Intern

GeoSpider AI USA

May 2024 - Jul 2024 (2 months)

Built a LangGraph-based multi-agent RAG system that autonomously resolved 65% of customer tickets from a 50K-doc knowledge base. Improved helpfulness from 43% to 76%, relevance by 30%, and achieved 92% recall@10 with FAISS hybrid search and vLLM (40% p95 latency reduction).

AA

Software Data Engineer

AT&T (via Accenture)

Jun 2021 - Dec 2022 (1 year 6 months)

Engineered a BERT + XGBoost intent classifier (88% F1) that reduced unnecessary technician dispatches by 12K/year, saving $2M annually. Built Elasticsearch + Word2Vec anomaly detection to cut diagnosis time by 40% and Tier-2 escalations by 25%, and optimized PySpark/Delta Lake pipelines for 1M+ logs/day (30% lower latency).

Education

Degrees, certifications, and relevant coursework

University of Florida logoUF

University of Florida

Master of Science, Computer & Information Science

2023 - 2024

Master of Science in Computer & Information Science from the University of Florida. Completed graduate research and published work in AI/ML and computer vision.

GITAM University logoGU

GITAM University

Bachelor of Technology, Computer Science

2017 - 2021

Grade: 3.9/4.0

Bachelor of Technology in Computer Science at GITAM University, completed with a GPA of 3.9/4.0.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan