sumit kumar
@sumitkumar28
Entry-level machine learning engineer building GenAI, RAG, and computer vision systems that ship fast and measure impact.
What I'm looking for
I’m a B.Tech student in Computer Science & Engineering – Artificial Intelligence at IIITDM Chennai, focused on turning research-grade ideas into usable systems. My coursework spans Data Structures, Algorithm Analysis, Deep Learning, NLP, Large Language Models, Retrieval Systems, DBMS, Operating Systems, and MLOps.
I’ve led and delivered at competitive scale, including “Machine Learning Team Lead – Amazon ML Challenge 2024,” where I achieved Top 1100 rank (Top 4%) out of 25,000+ global participants. By building a multimodal AI pricing system, I reduced inference latency by 27% with an ensemble strategy (LightGBM/XGBoost) and decreased RMSE by 14.8%. I also compete as a “Machine Learning Engineer – Kaggle Competitions,” ranking Top 800 globally by using deep learning and Vision Transformer architectures.
In projects, I build production-style LLM and AI pipelines: my “RAG Document Intelligence System” supports 5+ document formats with ingestion (chunking → embedding → semantic indexing via Pinecone), Groq-powered Llama-3.3-70B for ~2s latency, and FastAPI endpoints for end-to-end query processing. I also created an “AI-Powered GitHub Repository Auditor” using a multi-agent LangGraph design to analyze code quality, security vulnerabilities, and dependency risks, generating structured health reports with severity-ranked findings.
I’m equally comfortable with computer vision and deployment. I developed a “Universal Deepfake Detection System” combining Swin Transformer visual semantics with forensic noise analysis (98% classification accuracy on challenging benchmarks) and shipped a live interactive demo on HuggingFace Spaces; I also built a real-time “AI-Powered Exercise Form Tracker” using MediaPipe at 30 FPS with Kalman filtering for smoothed movement vectors.
Experience
Work history, roles, and key accomplishments
AI-Powered GitHub Auditor
GitHub Repository Auditor
Mar 2026 - Present (2 months)
Architected a multi-agent LangGraph system that analyzes code quality, security vulnerabilities, dependency risks, and git activity in parallel. Implemented agent orchestration with LangGraph state machines and generated structured health reports with severity-ranked findings and recommendations.
Universal Deepfake Detection
Universal Deepfake Detection System
Jan 2026 - Present (4 months)
Developed a hybrid deep learning model combining Swin Transformer visual semantics with forensic noise analysis to classify AI-generated vs. real images. Achieved 98% classification accuracy on benchmarks including Midjourney v6 and DALL-E 3, and deployed a live interactive demo on HuggingFace Spaces with a Streamlit UI.
RAG Document Intelligence
RAG Document Intelligence System
Sep 2025 - Present (8 months)
Built a production-style RAG pipeline supporting 5+ document formats (PDF, Word, email, OCR images) from ingestion to semantic indexing with Pinecone. Integrated Llama-3.3-70B via Groq API (~2s latency), exposed FastAPI endpoints for query processing, and generated clause-level JSON evidence with confidence scoring.
Machine Learning Engineer
Kaggle
Jan 2024 - Present (2 years 4 months)
Achieved top 800 global rank across 4+ ML competitions using deep learning and Vision Transformer architectures. Built robust training pipelines with 5-fold cross-validation and Optuna hyperparameter tuning, improving accuracy by 12%.
Machine Learning Team Lead
Amazon ML Challenge
Aug 2024 - Dec 2024 (4 months)
Led development of a multimodal AI pricing system, achieving top 1100 rank (Top 4%) out of 25,000+ global participants. Reduced inference latency by 27% using an ensemble strategy (LightGBM/XGBoost) and decreased RMSE by 14.8%.
AI-Powered Exercise Tracker
AI-Powered Exercise Form Tracker
Jan 2025 - Present (1 year 4 months)
Built a real-time pose estimation system using MediaPipe to track 33 keypoints at 30 FPS, providing instant audio corrective feedback for exercise form analysis. Improved movement smoothing and geometric angle calculations across repetitions using Kalman filtering.
Education
Degrees, certifications, and relevant coursework
IIITDM Chennai
Bachelor of Technology, Computer Science & Engineering (Artificial Intelligence)
2023 - 2027
Activities and societies: Relevant coursework: Data Structures, Algorithm Analysis, Deep Learning, NLP, Large Language Models, Retrieval Systems, DBMS, Operating Systems, MLOps.
BTech in Computer Science & Engineering (Artificial Intelligence focus) at IIITDM Chennai (Aug 2023–May 2027), covering core CS and advanced AI/LLM systems.
Availability
Location
Authorized to work in
Salary expectations
Social media
Job categories
Skills
Interested in hiring sumit?
You can contact sumit and 90k+ other talented remote workers on Himalayas.
Message sumitFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
