Skip to main content
Umang SharmaUS
Open to opportunities

Umang Sharma

@umangsharma

AI/ML Data Analyst building production RAG and agentic NLP systems that improve accuracy and reliability.

India
Message

What I'm looking for

I want to build production-grade NLP/GenAI systems—RAG and agentic workflows—that prioritize grounding, measurable quality, and reliable deployment. I’m excited by healthcare/enterprise problems and teams that value clean engineering and evaluation.

I’m an AI/ML-focused Data Analyst with 1.6+ years of industry experience designing and deploying NLP, Generative AI, and deep learning solutions in production. I build systems that don’t just work—they measurably improve accuracy and reliability, especially in healthcare and enterprise settings.

At Elucidata, I built a multi-agent Graph Data Science framework on the Polly Knowledge Graph, routing natural-language questions through specialized ReAct agents and orchestrating them with LangGraph tool-calling across custom GDS algorithms. Using GPT-based planner/supervisor flows with CoT prompt chaining, I achieved >80% TPR.

I also engineered an agentic pipeline for OMOP CDM 5.4 schema mapping of clinical EHR data, reaching 90%+ mapping accuracy and 85% concordance with ground truth across Synthea and MIMIC-IV. To keep the workflow compliant and dependable, I added a weighted penalty model for iterative validation, automated PHI/PII detection for HIPAA-aligned processing, and deployed the solution end-to-end as a FastAPI microservice.

Before that, I delivered a Knowledge Graph MVP using LLMs and LangChain to extract biomedical entities and relationships from PubMed/PMC abstracts, integrated into the Polly platform via an R Shiny interface. I’m equally hands-on with production RAG—building hybrid retrieval (FAISS + BM25), corrective grounding checks, and async ingestion pipelines using Celery, Redis, and Docker—while staying grounded in performance evaluation.

Experience

Work history, roles, and key accomplishments

EL
Current

Data Analyst

Elucidata

Jul 2025 - Present (11 months)

Built a multi-agent Graph Data Science framework on the Polly Knowledge Graph, routing natural-language queries through 5 ReAct agents and 12 custom GDS algorithms to achieve >80% TPR. Engineered an agentic OMOP CDM 5.4 clinical EHR mapping pipeline with 90%+ mapping accuracy and 85% concordance, including HIPAA-compliant PHI/PII detection and deployment as a FastAPI microservice.

EL

Machine Learning Intern

Elucidata

Jan 2025 - Jul 2025 (6 months)

Delivered a Knowledge Graph MVP using LLMs and LangChain to extract biomedical entities and relationships from PubMed/PMC abstracts, integrated into the Polly platform via an R Shiny interface. Built reusable adapters to unify 7+ heterogeneous biomedical sources into standardized S3-compatible formats for ingestion into the Polly Knowledge Graph.

Education

Degrees, certifications, and relevant coursework

Indraprastha Institute of Information Technology Delhi logoID

Indraprastha Institute of Information Technology Delhi

Master of Technology, Computational Biology

2023 - 2025

Pursued an M.Tech in Computational Biology, studying protein thermal stability prediction using deep learning approaches such as structure- and sequence-based modeling.

MT

Meerut Institute of Engineering and Technology

Bachelor of Technology

2019 - 2023

Completed a B.Tech degree at Meerut Institute of Engineering and Technology (AKTU).

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan