Ashish Solanki
@ashishsolanki1
AI Engineer building production Document AI, RAG, and LLM/VLM extraction systems that work reliably.
What I'm looking for
I’m an AI Engineer focused on production Document AI, RAG, and LLM/VLM-based extraction systems. I build pipelines that reliably handle digital PDFs, scanned PDFs, images, and multi-page files—then I validate and harden them against real edge cases.
At Capsitech, I improved a bank statement extraction system from ~60% to ~95% quality by replacing Azure Document Intelligence fallback with a fine-tuned Qwen-VL-based workflow. I also expanded bank coverage from 12 live/UAT banks to ~60 banks and 138 versions by designing bank/version-specific logic using coordinate parsing, OpenCV line detection, IoU mapping, NMS-style boundaries, and row-merging rules.
I built an HMRC-style RAG chatbot and ingestion pipeline using LangChain and MongoDB Atlas hybrid retrieval, including a recursive, resumable crawler that generated structured datasets for indexing. I improved internal RAG answer quality from ~50% to ~80% across simple, complex, follow-up, keyword-based, indirect, calculation-based, and year-specific queries.
I delivered invoice extraction workflows using YOLO-based field detection plus OCR-to-detection mapping, and then upgraded unknown-template extraction with Qwen-VL (from Qwen-VL-2.5-7B to Qwen-VL-3-4B), raising quality from ~80% to ~90%. In production FastAPI services, I reduced representative scanned/VLM extraction time from ~86 seconds to ~20 seconds with vLLM batching, and strengthened reliability with Pydantic validation, structured logging, error capture, and Docker-based deployment/debugging. I was also recognized with a Best Paper Award (TIACOMP 2024) for emerging technologies and smart systems.
Experience
Work history, roles, and key accomplishments
Assistant ML Engineer
Capsitech
Sep 2024 - Present (1 year 9 months)
Improved bank statement extraction quality from ~60% to 95% by replacing Azure Document Intelligence fallback with a fine-tuned Qwen-VL VLM extraction workflow, and expanded coverage from 12 to ~60 banks and 138 versions. Built HMRC-style RAG chatbot/data ingestion (improved answers ~50% to 80%) and accelerated production FastAPI document processing from ~86s to 20s using vLLM batching.
AI/ML Engineer Intern
Websoham
Mar 2024 - May 2024 (2 months)
Developed automated bank statement PDF extraction handling encrypted PDFs, table detection/parsing, and multi-page Excel output generation. Worked on video transcription and shorts generation (timestamp mapping, captions, hook selection) and explored OpenCV/MediaPipe-based lip movement analysis for speaker detection.
Education
Degrees, certifications, and relevant coursework
Chandigarh University
Master of Engineering in Artificial Intelligence, Artificial Intelligence
2022 - 2024
Activities and societies: Thesis/papers: High-Resolution Fashion Image Generation using Quantum-GAN; Review paper: Comprehensive Analysis on Image Generation using Quantum GAN.
Master of Engineering in Artificial Intelligence at Chandigarh University. Thesis and research focused on high-resolution fashion image generation using Quantum-GAN.
Indian Institute of Business Management
Post Graduate Program in Data Science, Data Science
2020 - 2021
Activities and societies: 6-month internship/training covering Machine Learning, AI, and Predictive Modeling.
Post Graduate Program in Data Science at the Indian Institute of Business Management. Completed a hands-on 6-month training in machine learning, AI, and predictive modeling.
U.V. Patel College of Engineering
Bachelor of Technology in Mechatronics Engineering, Mechatronics Engineering
2016 - 2020
Activities and societies: Major project: Automatic conveyor belt for loading/unloading with inspection using pick and place robot.
Bachelor of Technology in Mechatronics Engineering at U.V. Patel College of Engineering. Completed a major project related to an automatic conveyor belt with inspection using a pick-and-place robot.
Availability
Location
Authorized to work in
Salary expectations
Social media
Job categories
Skills
Interested in hiring Ashish?
You can contact Ashish and 90k+ other talented remote workers on Himalayas.
Message AshishFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
