HimalayasHimalayas logo
sumit kumarSK
Looking for a job

sumit kumar

@sumitkumar28

Entry-level machine learning engineer building GenAI, RAG, and computer vision systems that ship fast and measure impact.

India
Message

What I'm looking for

I’m looking for a role where I can build and deploy GenAI (RAG, agentic systems) and computer vision models, focusing on measurable latency/accuracy gains and strong engineering practices.

I’m a B.Tech student in Computer Science & Engineering – Artificial Intelligence at IIITDM Chennai, focused on turning research-grade ideas into usable systems. My coursework spans Data Structures, Algorithm Analysis, Deep Learning, NLP, Large Language Models, Retrieval Systems, DBMS, Operating Systems, and MLOps.

I’ve led and delivered at competitive scale, including “Machine Learning Team Lead – Amazon ML Challenge 2024,” where I achieved Top 1100 rank (Top 4%) out of 25,000+ global participants. By building a multimodal AI pricing system, I reduced inference latency by 27% with an ensemble strategy (LightGBM/XGBoost) and decreased RMSE by 14.8%. I also compete as a “Machine Learning Engineer – Kaggle Competitions,” ranking Top 800 globally by using deep learning and Vision Transformer architectures.

In projects, I build production-style LLM and AI pipelines: my “RAG Document Intelligence System” supports 5+ document formats with ingestion (chunking → embedding → semantic indexing via Pinecone), Groq-powered Llama-3.3-70B for ~2s latency, and FastAPI endpoints for end-to-end query processing. I also created an “AI-Powered GitHub Repository Auditor” using a multi-agent LangGraph design to analyze code quality, security vulnerabilities, and dependency risks, generating structured health reports with severity-ranked findings.

I’m equally comfortable with computer vision and deployment. I developed a “Universal Deepfake Detection System” combining Swin Transformer visual semantics with forensic noise analysis (98% classification accuracy on challenging benchmarks) and shipped a live interactive demo on HuggingFace Spaces; I also built a real-time “AI-Powered Exercise Form Tracker” using MediaPipe at 30 FPS with Kalman filtering for smoothed movement vectors.

Experience

Work history, roles, and key accomplishments

GA
Current

AI-Powered GitHub Auditor

GitHub Repository Auditor

Mar 2026 - Present (2 months)

Architected a multi-agent LangGraph system that analyzes code quality, security vulnerabilities, dependency risks, and git activity in parallel. Implemented agent orchestration with LangGraph state machines and generated structured health reports with severity-ranked findings and recommendations.

US
Current

Universal Deepfake Detection

Universal Deepfake Detection System

Jan 2026 - Present (4 months)

Developed a hybrid deep learning model combining Swin Transformer visual semantics with forensic noise analysis to classify AI-generated vs. real images. Achieved 98% classification accuracy on benchmarks including Midjourney v6 and DALL-E 3, and deployed a live interactive demo on HuggingFace Spaces with a Streamlit UI.

RS
Current

RAG Document Intelligence

RAG Document Intelligence System

Sep 2025 - Present (8 months)

Built a production-style RAG pipeline supporting 5+ document formats (PDF, Word, email, OCR images) from ingestion to semantic indexing with Pinecone. Integrated Llama-3.3-70B via Groq API (~2s latency), exposed FastAPI endpoints for query processing, and generated clause-level JSON evidence with confidence scoring.

Kaggle logoKA
Current

Machine Learning Engineer

Kaggle

Jan 2024 - Present (2 years 4 months)

Achieved top 800 global rank across 4+ ML competitions using deep learning and Vision Transformer architectures. Built robust training pipelines with 5-fold cross-validation and Optuna hyperparameter tuning, improving accuracy by 12%.

AC

Machine Learning Team Lead

Amazon ML Challenge

Aug 2024 - Dec 2024 (4 months)

Led development of a multimodal AI pricing system, achieving top 1100 rank (Top 4%) out of 25,000+ global participants. Reduced inference latency by 27% using an ensemble strategy (LightGBM/XGBoost) and decreased RMSE by 14.8%.

AT

AI-Powered Exercise Tracker

AI-Powered Exercise Form Tracker

Jan 2025 - Present (1 year 4 months)

Built a real-time pose estimation system using MediaPipe to track 33 keypoints at 30 FPS, providing instant audio corrective feedback for exercise form analysis. Improved movement smoothing and geometric angle calculations across repetitions using Kalman filtering.

Education

Degrees, certifications, and relevant coursework

IIITDM Chennai logoIC

IIITDM Chennai

Bachelor of Technology, Computer Science & Engineering (Artificial Intelligence)

2023 - 2027

Activities and societies: Relevant coursework: Data Structures, Algorithm Analysis, Deep Learning, NLP, Large Language Models, Retrieval Systems, DBMS, Operating Systems, MLOps.

BTech in Computer Science & Engineering (Artificial Intelligence focus) at IIITDM Chennai (Aug 2023–May 2027), covering core CS and advanced AI/LLM systems.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan