Sourav Roy
@souravroy2
AI/ML Engineer specializing in production LLM systems, RAG pipelines, and low-latency GenAI APIs.
What I'm looking for
I’m an AI/ML Engineer with 3+ years of experience building production-grade LLM systems, including RAG pipelines and agentic AI workflows. I focus on the full LLM stack—from retrieval and ranking to generation and deployment—so models perform reliably in real products.
I’ve shipped low-latency GenAI APIs using FastAPI, serving 10K+ daily requests and delivering 200–300ms end-to-end responses with major latency reductions. I also fine-tune large models with QLoRA, optimize vector retrieval and re-ranking with FAISS, and improve production quality through prompt engineering and context management to reduce hallucination rates. Beyond LLM platforms, I’ve built NLP automation pipelines processing 20,000+ records/day and developed backend APIs handling 15,000+ daily requests with meaningful inference-time improvements.
Experience
Work history, roles, and key accomplishments
Founding AI/ML Engineer
YUGA AI
Dec 2025 - Present (6 months)
Built and scaled the AI backend for an adaptive learning platform using Python microservices, reducing latency by 30% and improving completion rates by 10% through personalized student modeling. Developed a production-grade RAG-based LLM tutor supporting 500+ concurrent lecture sessions and established reusable AI infrastructure standards that accelerated model integrations and onboarding.
Freelance AI Developer
New Gen Leads USA
Jan 2023 - Feb 2025 (2 years 1 month)
Built NLP automation pipelines processing 20,000+ records/day for lead enrichment, classification, and entity extraction. Developed and maintained backend APIs handling 15,000+ daily requests and reduced model inference time by 30% through preprocessing optimization and feature engineering.
Education
Degrees, certifications, and relevant coursework
Techno India University
Bachelor of Technology (B.Tech), Computer Science & Engineering (AI Specialisation)
2021 - 2025
Completed a B.Tech in Computer Science & Engineering with an AI specialization at Techno India University (Nov 2021–Aug 2025).
Availability
Location
Authorized to work in
Website
royxlead.netlify.appSocial media
Job categories
Interested in hiring Sourav?
You can contact Sourav and 90k+ other talented remote workers on Himalayas.
Message SouravFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
