Ashish Solanki
@ashishsolanki1
AI Engineer building production Document AI, RAG, and LLM/VLM extraction systems that work reliably.
What I'm looking for
I’m an AI Engineer focused on production Document AI, RAG, and LLM/VLM-based extraction systems. I build pipelines that reliably handle digital PDFs, scanned PDFs, images, and multi-page files—then I validate and harden them against real edge cases.
At Capsitech, I improved a bank statement extraction system from ~60% to ~95% quality by replacing Azure Document Intelligence fallback with a fine-tuned Qwen-VL-based workflow. I also expanded bank coverage from 12 live/UAT banks to ~60 banks and 138 versions by designing bank/version-specific logic using coordinate parsing, OpenCV line detection, IoU mapping, NMS-style boundaries, and row-merging rules.
I built an HMRC-style RAG chatbot and ingestion pipeline using LangChain and MongoDB Atlas hybrid retrieval, including a recursive, resumable crawler that generated structured datasets for indexing. I improved internal RAG answer quality from ~50% to ~80% across simple, complex, follow-up, keyword-based, indirect, calculation-based, and year-specific queries.
I delivered invoice extraction workflows using YOLO-based field detection plus OCR-to-detection mapping, and then upgraded unknown-template extraction with Qwen-VL (from Qwen-VL-2.5-7B to Qwen-VL-3-4B), raising quality from ~80% to ~90%. In production FastAPI services, I reduced representative scanned/VLM extraction time from ~86 seconds to ~20 seconds with vLLM batching, and strengthened reliability with Pydantic validation, structured logging, error capture, and Docker-based deployment/debugging. I was also recognized with a Best Paper Award (TIACOMP 2024) for emerging technologies and smart systems.
Experience
Work history, roles, and key accomplishments
Assistant ML Engineer
Capsitech
Sep 2024 - Present (1 year 8 months)
Improved bank statement extraction quality from ~60% to 95% by replacing Azure Document Intelligence fallback with a fine-tuned Qwen-VL VLM extraction workflow, and expanded coverage from 12 to ~60 banks and 138 versions. Built HMRC-style RAG chatbot/data ingestion (improved answers ~50% to 80%) and accelerated production FastAPI document processing from ~86s to 20s using vLLM batching.
AI/ML Engineer Intern
Websoham
Mar 2024 - May 2024 (2 months)
Developed automated bank statement PDF extraction handling encrypted PDFs, table detection/parsing, and multi-page Excel output generation. Worked on video transcription and shorts generation (timestamp mapping, captions, hook selection) and explored OpenCV/MediaPipe-based lip movement analysis for speaker detection.
Education
Degrees, certifications, and relevant coursework
Chandigarh University
Master of Engineering in Artificial Intelligence, Artificial Intelligence
2022 - 2024
Activities and societies: Thesis/papers: High-Resolution Fashion Image Generation using Quantum-GAN; Review paper: Comprehensive Analysis on Image Generation using Quantum GAN.
Master of Engineering in Artificial Intelligence at Chandigarh University. Thesis and research focused on high-resolution fashion image generation using Quantum-GAN.
Indian Institute of Business Management
Post Graduate Program in Data Science, Data Science
2020 - 2021
Activities and societies: 6-month internship/training covering Machine Learning, AI, and Predictive Modeling.
Post Graduate Program in Data Science at the Indian Institute of Business Management. Completed a hands-on 6-month training in machine learning, AI, and predictive modeling.
U.V. Patel College of Engineering
Bachelor of Technology in Mechatronics Engineering, Mechatronics Engineering
2016 - 2020
Activities and societies: Major project: Automatic conveyor belt for loading/unloading with inspection using pick and place robot.
Bachelor of Technology in Mechatronics Engineering at U.V. Patel College of Engineering. Completed a major project related to an automatic conveyor belt with inspection using a pick-and-place robot.
Availability
Location
Authorized to work in
Salary expectations
Social media
Job categories
Skills
Interested in hiring Ashish?
You can contact Ashish and 90k+ other talented remote workers on Himalayas.
Message AshishFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
