Sambodh Gupta
@sambodhgupta
Data Analyst and ML builder who turns messy data into explainable, automated outcomes.
What I'm looking for
I’m a Computer Science and Engineering B.Tech student building real-world data products—especially pipelines that automate work, improve data quality, and make results understandable to people.
As a Data Analyst Intern, I automated an end-to-end social media posting workflow using Meta Graph API and Google Apps Script, removing ~8 hours/week of manual scheduling. I cleaned and standardized 20,873 records across a 409-field schema, fixing 12 categories of data quality issues and cutting null rate from 34% to <2%, enabling the team’s first automated reporting dashboard.
I’ve also worked on research-grade data engineering: I built an RDF compression pipeline (OpenRefine + custom OWL ontology) that reduced a 77 GB dataset to 9 GB (88%), cutting SPARQL query latency by ~60% for real-time analytics on 120 years of Olympic records. I contributed to an Olympics Semantic Web portal using SPARQL and designed ontology relationships in Protégé.
Across projects, I combine strong analytics with explainable AI and modern NLP/RAG. I built a Multi-Disease Prediction System with XAI (SHAP/LIME) and a full-stack healthcare portal, plus a RAG-based teaching assistant that uses Whisper transcription, timestamped chunking, embedding retrieval, and LLM inference with a Streamlit UI.
Experience
Work history, roles, and key accomplishments
Data Analyst Intern
Red Dot Foundation
Dec 2025 - Mar 2026 (3 months)
Automated end-to-end social media posting using Meta Graph API and Google Apps Script, eliminating ~8 hrs/week of manual scheduling and ensuring consistent posting across 3 platforms. Cleaned and standardized 20,873 records across a 409-field schema, reducing null rate from 34% to <2% and enabling the team’s first automated reporting dashboard.
Research Intern
NIT Kurukshetra
Jan 2026 - Mar 2026 (2 months)
Built an RDF compression pipeline using OpenRefine and a custom OWL ontology, reducing dataset size from 77 GB to 9 GB (88%) and cutting SPARQL query latency by ~60% for real-time analytics on 120 years of Olympic records. Developed OlyGraph components and architected a multi-graph RDF system with Apache Jena Fuseki, SPARQLWrapper, and Streamlit to resolve triple-count inconsistencies.
Education
Degrees, certifications, and relevant coursework
Indian Institute of Information Technology Manipur
Bachelor of Technology, Computer Science and Engineering
2023 -
Pursuing a B.Tech in Computer Science and Engineering at IIIT Manipur (Aug 2023–present).
Availability
Location
Authorized to work in
Social media
Job categories
Interested in hiring Sambodh?
You can contact Sambodh and 90k+ other talented remote workers on Himalayas.
Message SambodhFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
