snehit vaddi
@snehitvaddi
AI Engineer building production GenAI systems—RAG, agents, and guardrails—served to thousands of real users.
What I'm looking for
I’m an AI Engineer with 4+ years of experience in data engineering, ML systems, and NLP, with 2+ years focused on production GenAI. I’ve shipped GenAI applications serving 15,000+ real users daily, spanning multi-agent orchestration, RAG systems, evaluation pipelines, LLM fine-tuning, and enterprise guardrails.
I build AI SaaS products with 1,000+ active users and pair engineering with research—published work on LLM hallucination detection and small model reasoning. I’m hands-on end-to-end: from retrieval, parsing, and orchestration to monitoring, PHI-safe pipelines, and CI/CD deployments that keep systems reliable in production.
Experience
Work history, roles, and key accomplishments
AI Engineer - GenAI Systems
ModMed
Feb 2025 - Present (1 year 3 months)
Shipped a clinical ambient AI scribe serving 15,000+ providers across 11 specialties, automating 70% of documentation. Built agentic document processing with OpenAI Agents SDK and fine-tuned vision-language models, reducing costs from $400K/month to $20K/month (95%) while supporting PHI-safe pipelines and RAG quality guardrails.
Graduate Researcher, AI/ML
University of Florida
Feb 2023 - Dec 2024 (1 year 10 months)
Developed a hybrid YOLOv8-ViT model improving small-object detection by 15%, with Grad-CAM/EigenCAM explainability and publications in SPIE (2025) and IEEE (2023). Built an interactive React analytics dashboard (adoption 15% → 85%) and automated retraining via MLflow and GitHub Actions, reducing deployment time from 4 hours to 15 minutes.
AI Software Developer Intern
GeoSpider AI USA
May 2024 - Jul 2024 (2 months)
Built a LangGraph-based multi-agent RAG system that autonomously resolved 65% of customer tickets from a 50K-doc knowledge base. Improved helpfulness from 43% to 76%, relevance by 30%, and achieved 92% recall@10 with FAISS hybrid search and vLLM (40% p95 latency reduction).
Software Data Engineer
AT&T (via Accenture)
Jun 2021 - Dec 2022 (1 year 6 months)
Engineered a BERT + XGBoost intent classifier (88% F1) that reduced unnecessary technician dispatches by 12K/year, saving $2M annually. Built Elasticsearch + Word2Vec anomaly detection to cut diagnosis time by 40% and Tier-2 escalations by 25%, and optimized PySpark/Delta Lake pipelines for 1M+ logs/day (30% lower latency).
Education
Degrees, certifications, and relevant coursework
University of Florida
Master of Science, Computer & Information Science
2023 - 2024
Master of Science in Computer & Information Science from the University of Florida. Completed graduate research and published work in AI/ML and computer vision.
GITAM University
Bachelor of Technology, Computer Science
2017 - 2021
Grade: 3.9/4.0
Bachelor of Technology in Computer Science at GITAM University, completed with a GPA of 3.9/4.0.
Tech stack
Software and tools used professionally
Azure Synapse
Apache Spark
GitHub
Kubernetes
GitHub Actions
Pandas
PySpark
dbt
Gmail
Databricks
Redis
Terraform
TensorFlow
PyTorch
MLflow
scikit-learn
Streamlit
FastAPI
Datadog
Elasticsearch
Airflow
GuardRails
SQL
XGBoost
Qdrant
LangChain
Pinecone
Delta Lake
vLLM
OpenAI API
Evidence
DSPy
pgvector
Agentic
Modal
Faiss
LangGraph
uv
Dynamic
Unity Catalog
Remote
Safe
Jan
Availability
Location
Authorized to work in
Portfolio
resume2portfolio.comJob categories
Skills
Interested in hiring snehit?
You can contact snehit and 90k+ other talented remote workers on Himalayas.
Message snehitFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
