Skip to main content
HimalayasHimalayas logo
Sambodh GuptaSG
Open to opportunities

Sambodh Gupta

@sambodhgupta

Data Analyst and ML builder who turns messy data into explainable, automated outcomes.

India
Message

What I'm looking for

I’m looking for a role where I can build automation-first data pipelines and ML/NLP systems, focusing on data quality, explainability (SHAP/LIME), and production-ready apps with measurable performance improvements.

I’m a Computer Science and Engineering B.Tech student building real-world data products—especially pipelines that automate work, improve data quality, and make results understandable to people.

As a Data Analyst Intern, I automated an end-to-end social media posting workflow using Meta Graph API and Google Apps Script, removing ~8 hours/week of manual scheduling. I cleaned and standardized 20,873 records across a 409-field schema, fixing 12 categories of data quality issues and cutting null rate from 34% to <2%, enabling the team’s first automated reporting dashboard.

I’ve also worked on research-grade data engineering: I built an RDF compression pipeline (OpenRefine + custom OWL ontology) that reduced a 77 GB dataset to 9 GB (88%), cutting SPARQL query latency by ~60% for real-time analytics on 120 years of Olympic records. I contributed to an Olympics Semantic Web portal using SPARQL and designed ontology relationships in Protégé.

Across projects, I combine strong analytics with explainable AI and modern NLP/RAG. I built a Multi-Disease Prediction System with XAI (SHAP/LIME) and a full-stack healthcare portal, plus a RAG-based teaching assistant that uses Whisper transcription, timestamped chunking, embedding retrieval, and LLM inference with a Streamlit UI.

Experience

Work history, roles, and key accomplishments

RF

Data Analyst Intern

Red Dot Foundation

Dec 2025 - Mar 2026 (3 months)

Automated end-to-end social media posting using Meta Graph API and Google Apps Script, eliminating ~8 hrs/week of manual scheduling and ensuring consistent posting across 3 platforms. Cleaned and standardized 20,873 records across a 409-field schema, reducing null rate from 34% to <2% and enabling the team’s first automated reporting dashboard.

NIT Kurukshetra logoNK

Research Intern

NIT Kurukshetra

Jan 2026 - Mar 2026 (2 months)

Built an RDF compression pipeline using OpenRefine and a custom OWL ontology, reducing dataset size from 77 GB to 9 GB (88%) and cutting SPARQL query latency by ~60% for real-time analytics on 120 years of Olympic records. Developed OlyGraph components and architected a multi-graph RDF system with Apache Jena Fuseki, SPARQLWrapper, and Streamlit to resolve triple-count inconsistencies.

Education

Degrees, certifications, and relevant coursework

Indian Institute of Information Technology Manipur logoIM

Indian Institute of Information Technology Manipur

Bachelor of Technology, Computer Science and Engineering

2023 -

Pursuing a B.Tech in Computer Science and Engineering at IIIT Manipur (Aug 2023–present).

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan