Atharva Ingle
@atharvaingle
Data Scientist II specializing in AI and machine learning solutions.
What I'm looking for
I am a Data Scientist II at Wolters Kluwer, where I lead AI capabilities for various projects, including the processing of 65 million Uniform Commercial Code filings. My expertise lies in fine-tuning and deploying advanced models, such as the Qwen 2 VL 7B Vision-Language Model, achieving remarkable extraction accuracy and inference speeds.
Throughout my career, I have developed innovative solutions like RAG-based chatbots and internal frameworks for model fine-tuning, significantly enhancing operational efficiency. My contributions have been recognized through awards, including the Global Innovation Award for my role in the Content Rocket project, which won the CEO's Choice Award among numerous teams.
Experience
Work history, roles, and key accomplishments
Data Scientist II
Wolters Kluwer
Aug 2023 - Present (1 year 11 months)
Core contributor to Content Rocket, focusing on monetizing existing data and transforming user interactions with GenAI. Led multiple sub-projects and developed a standardized AI workflow for RAG, document classification, and chatbots, significantly reducing manual effort.
Data Scientist II
Wolters Kluwer
Aug 2023 - Present (1 year 11 months)
Built a search engine to recommend required licenses for starting a business, developing a HyDE workflow to expand user queries. Used synthetically generated descriptions and a reranker, achieving 96% retrieval accuracy.
Data Scientist II
Wolters Kluwer
Aug 2023 - Present (1 year 11 months)
Led AI capabilities for Borrower Analytics 2.0, processing 65 million Uniform Commercial Code (UCC) filings and fine-tuning a Qwen 2 VL 7B Vision-Language Model (VLM) for 95% extraction accuracy. Developed an LLM-as-a-judge framework to generate high-quality labeled data, significantly reducing human labeling effort for VLM fine-tuning.
Data Scientist II
Wolters Kluwer
Aug 2023 - Present (1 year 11 months)
Developed a sophisticated RAG-based chatbot leveraging an XML-based IRA knowledge base, achieving a 97% customer satisfaction rate. Designed and implemented an advanced RAG pipeline incorporating query transformation, parent-child retrieval, hierarchical indexing, and multi-vector database retrieval using LanceDB.
Data Scientist II
Wolters Kluwer
Aug 2023 - Present (1 year 11 months)
Built a RAG-based chatbot on 91k legal citations for efficient legal query resolution, enhancing retrieval with metadata filtering and query transformation. Used BERTTopic to generate hypothetical topics for improved search space reduction and query handling.
Data Scientist II
Wolters Kluwer
Aug 2023 - Present (1 year 11 months)
Developed an end-to-end internal framework for fine-tuning various production models like Donut, LiLT, and Qwen, making it accessible for non-technical users. Integrated MLflow for experiment tracking, model registry, and dataset/model versioning, supporting distributed multi-GPU training and advanced techniques.
Weights & Biases Ambassador
Weights and Biases
May 2022 - Present (3 years 2 months)
Optimized Kaggle notebooks with added support for W&B tracking and monitoring tools. Wrote technical reports showcasing W&B features in diverse areas including medical imaging, Visual-Language Models, few-shot learning, PyTorch 2.0, HuggingFace, RAG, and LLMs.
Data Science Intern
Wolters Kluwer
Jan 2023 - Present (2 years 6 months)
Automated key information extraction (KIE) from legal documents, creating an in-house solution using transformer models like Donut, LiLT, and pix2struct. This achieved a 98% cost reduction compared to Azure Form Recognizer and included building a document classification pipeline.
Education
Degrees, certifications, and relevant coursework
Vishwakarma Institute of Technology
Bachelor of Technology, Instrumentation and Control Engineering
Grade: 9.72/10
Completed a Bachelor of Technology in Instrumentation and Control Engineering. Studied core concepts and applied principles in various projects. Achieved a strong academic record with a CGPA of 9.72/10.
Availability
Location
Authorized to work in
Website
atharva.bearblog.devJob categories
Interested in hiring Atharva?
You can contact Atharva and 90k+ other talented remote workers on Himalayas.
Message AtharvaFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
