Atharva Ingle - Data Scientist II - Wolters Kluwer | Himalayas
Atharva IngleAI
Open to opportunities

Atharva Ingle

@atharvaingle

Data Scientist II specializing in AI and machine learning solutions.

India
Message

What I'm looking for

I seek a collaborative environment that fosters innovation and growth in AI technologies.

I am a Data Scientist II at Wolters Kluwer, where I lead AI capabilities for various projects, including the processing of 65 million Uniform Commercial Code filings. My expertise lies in fine-tuning and deploying advanced models, such as the Qwen 2 VL 7B Vision-Language Model, achieving remarkable extraction accuracy and inference speeds.

Throughout my career, I have developed innovative solutions like RAG-based chatbots and internal frameworks for model fine-tuning, significantly enhancing operational efficiency. My contributions have been recognized through awards, including the Global Innovation Award for my role in the Content Rocket project, which won the CEO's Choice Award among numerous teams.

Experience

Work history, roles, and key accomplishments

WK
Current

Data Scientist II

Wolters Kluwer

Aug 2023 - Present (1 year 11 months)

Led AI capabilities for Borrower Analytics 2.0, processing 65 million Uniform Commercial Code (UCC) filings and fine-tuning a Qwen 2 VL 7B Vision-Language Model (VLM) for 95% extraction accuracy. Developed an LLM-as-a-judge framework to generate high-quality labeled data, significantly reducing human labeling effort for VLM fine-tuning.

WK
Current

Data Scientist II

Wolters Kluwer

Aug 2023 - Present (1 year 11 months)

Developed a sophisticated RAG-based chatbot leveraging an XML-based IRA knowledge base, achieving a 97% customer satisfaction rate. Designed and implemented an advanced RAG pipeline incorporating query transformation, parent-child retrieval, hierarchical indexing, and multi-vector database retrieval using LanceDB.

WK
Current

Data Scientist II

Wolters Kluwer

Aug 2023 - Present (1 year 11 months)

Developed an end-to-end internal framework for fine-tuning various production models like Donut, LiLT, and Qwen, making it accessible for non-technical users. Integrated MLflow for experiment tracking, model registry, and dataset/model versioning, supporting distributed multi-GPU training and advanced techniques.

WB
Current

Weights & Biases Ambassador

Weights and Biases

May 2022 - Present (3 years 2 months)

Optimized Kaggle notebooks with added support for W&B tracking and monitoring tools. Wrote technical reports showcasing W&B features in diverse areas including medical imaging, Visual-Language Models, few-shot learning, PyTorch 2.0, HuggingFace, RAG, and LLMs.

WK

Data Science Intern

Wolters Kluwer

Jan 2023 - Present (2 years 6 months)

Automated key information extraction (KIE) from legal documents, creating an in-house solution using transformer models like Donut, LiLT, and pix2struct. This achieved a 98% cost reduction compared to Azure Form Recognizer and included building a document classification pipeline.

Education

Degrees, certifications, and relevant coursework

VT

Vishwakarma Institute of Technology

Bachelor of Technology, Instrumentation and Control Engineering

Grade: 9.72/10

Completed a Bachelor of Technology in Instrumentation and Control Engineering. Studied core concepts and applied principles in various projects. Achieved a strong academic record with a CGPA of 9.72/10.

Tech stack

Software and tools used professionally

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan