Umang Sharma
@umangsharma
AI/ML Data Analyst building production RAG and agentic NLP systems that improve accuracy and reliability.
What I'm looking for
I’m an AI/ML-focused Data Analyst with 1.6+ years of industry experience designing and deploying NLP, Generative AI, and deep learning solutions in production. I build systems that don’t just work—they measurably improve accuracy and reliability, especially in healthcare and enterprise settings.
At Elucidata, I built a multi-agent Graph Data Science framework on the Polly Knowledge Graph, routing natural-language questions through specialized ReAct agents and orchestrating them with LangGraph tool-calling across custom GDS algorithms. Using GPT-based planner/supervisor flows with CoT prompt chaining, I achieved >80% TPR.
I also engineered an agentic pipeline for OMOP CDM 5.4 schema mapping of clinical EHR data, reaching 90%+ mapping accuracy and 85% concordance with ground truth across Synthea and MIMIC-IV. To keep the workflow compliant and dependable, I added a weighted penalty model for iterative validation, automated PHI/PII detection for HIPAA-aligned processing, and deployed the solution end-to-end as a FastAPI microservice.
Before that, I delivered a Knowledge Graph MVP using LLMs and LangChain to extract biomedical entities and relationships from PubMed/PMC abstracts, integrated into the Polly platform via an R Shiny interface. I’m equally hands-on with production RAG—building hybrid retrieval (FAISS + BM25), corrective grounding checks, and async ingestion pipelines using Celery, Redis, and Docker—while staying grounded in performance evaluation.
Experience
Work history, roles, and key accomplishments
Data Analyst
Elucidata
Jul 2025 - Present (11 months)
Built a multi-agent Graph Data Science framework on the Polly Knowledge Graph, routing natural-language queries through 5 ReAct agents and 12 custom GDS algorithms to achieve >80% TPR. Engineered an agentic OMOP CDM 5.4 clinical EHR mapping pipeline with 90%+ mapping accuracy and 85% concordance, including HIPAA-compliant PHI/PII detection and deployment as a FastAPI microservice.
Machine Learning Intern
Elucidata
Jan 2025 - Jul 2025 (6 months)
Delivered a Knowledge Graph MVP using LLMs and LangChain to extract biomedical entities and relationships from PubMed/PMC abstracts, integrated into the Polly platform via an R Shiny interface. Built reusable adapters to unify 7+ heterogeneous biomedical sources into standardized S3-compatible formats for ingestion into the Polly Knowledge Graph.
Education
Degrees, certifications, and relevant coursework
Indraprastha Institute of Information Technology Delhi
Master of Technology, Computational Biology
2023 - 2025
Pursued an M.Tech in Computational Biology, studying protein thermal stability prediction using deep learning approaches such as structure- and sequence-based modeling.
Meerut Institute of Engineering and Technology
Bachelor of Technology
2019 - 2023
Completed a B.Tech degree at Meerut Institute of Engineering and Technology (AKTU).
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Umang?
You can contact Umang and 90k+ other talented remote workers on Himalayas.
Message UmangFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
