Skip to main content
HimalayasHimalayas logo
Dhruvin MeshiyaDM
Open to opportunities

Dhruvin Meshiya

@dhruvinmeshiya

AI & Machine Learning Engineer focused on computer vision, LLM pipelines, and multimodal video analytics.

India
Message

What I'm looking for

I’m looking for an AI/ML role where I can build production-grade computer vision and multimodal LLM systems—end to end—using agentic RAG, real-time streaming, and OCR pipelines, with room to scale capabilities safely and fast.

I’m an AI & Machine Learning Engineer with 3 years of hands-on experience building end-to-end systems that go from data ingestion to deployment. I focus on computer vision, LLM/Vision-LLM pipelines, real-time video analytics, and multi-modal AI workflows that are reliable in production.

At work, I’ve developed real-time object detection & tracking pipelines using YOLOv8/YOLO-NAS with ByteTrack / ByteSORT, improving tracking stability for video analytics. I’ve also built 3D hand pose estimation workflows and explored 3D Gaussian Splatting to push advanced computer-vision capabilities.

I build agentic, retrieval-augmented solutions that support real business use cases—like an Agentic ERP Automation & RAG chatbot with CrewAI and Langchain, multi-format ingestion, vector DBs, and an interactive speaking avatar for enterprise decision support. I implement OCR/Document Intelligence pipelines using Vision-LLM + prompt tuning to output structured JSON (e.g., receipt extraction with fields like totals and payment details) that teams can directly integrate.

I’m equally comfortable designing streaming architectures and production-style services, including low-latency multimodal agents using WebRTC with a dual-server FastAPI design, and end-to-end pipelines connecting STT → Vision-Language models → TTS. My approach is always modular, observable, and built to deliver fast, accurate outputs under real-world constraints.

Experience

Work history, roles, and key accomplishments

AL

AI & Machine Learning Engineer

Annotatehub Solutions LLP

Sep 2023 - Feb 2026 (2 years 5 months)

Built real-time object detection and tracking pipelines using YOLOv8/YOLO-NAS with ByteTrack/ByteSORT for video analytics workflows. Developed agentic ERP automation and a multi-modal RAG chatbot with multi-format ingestion, plus OCR receipt extraction with Vision-LLM and prompt tuning to output structured JSON.

Education

Degrees, certifications, and relevant coursework

SS

Shree Swami Atmanand Saraswati Institute of Technology (SSASIT)

Bachelor of Engineering (BE), Computer Engineering

2020 - 2024

Pursued a BE in Computer Engineering at SSASIT from 2020 to 2024.

AS

Ashadeep IIT, Surat

HSC, Higher Secondary Education

2019 - 2020

Completed HSC at Ashadeep IIT in Surat from 2019 to 2020.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan