Skip to main content
HimalayasHimalayas logo
Aashu UserAU
Open to opportunities

Aashu User

@aashuuser

AI Data Specialist focused on LLM/LAM/agent evaluation, benchmarking, and high-quality data.

India
Message

What I'm looking for

I’m looking for a role where I can drive LLM/LAM/agent evaluation—design benchmarks, run auditing against rubrics, and improve model retraining with high-quality datasets, prompt iteration, and clear root-cause reporting.

I’m an AI Data Specialist who builds dependable evaluation pipelines for LLM, LAM, and CUA systems—turning raw model behavior into actionable findings. I run benchmarking audits against acceptance criteria and evaluation rubrics to surface misalignments, regulatory risks, and clear error ratings that teams can use immediately.

In my recent work, I designed and refined prompt templates for agentic AI workflows, iterating until instruction following, model grounding, and autonomous execution were consistent across test cases. I also validate video trajectory datasets for CUA training by cross-checking spatial and temporal annotations against labeling guidelines to keep training inputs clean.

My projects reflect the same focus on realism and rigor. For CUA-Bench, I built a benchmark dataset of 100+ prompts with conditional branching and multi-constraint scenarios to evaluate decision-making, fallback handling, and instruction following under conditions real users actually run. For my LLM and LAM Failure Analysis and Audit Framework, I used failure taxonomy mapping and behavioral pattern classification to produce annotated datasets and error-rated audit reports.

I take quality seriously: I’ve maintained data accuracy above 99% through shared audit checklists, labeling standards, and quality reviews for teams of annotation specialists. I’m motivated by improving model retraining cycles with reproducible benchmarking frameworks, root cause analysis, and measurable robustness.

Experience

Work history, roles, and key accomplishments

ST
Current

AI Data Specialist

Softage Information Technology

Jan 2025 - Present (1 year 5 months)

Ran benchmarking audits on LLM, LAM, and CUA outputs against acceptance criteria and evaluation rubrics, producing root-cause analysis reports with error ratings that fed model retraining cycles. Designed prompt templates for agentic AI workflows and managed annotation quality so data accuracy stayed above 99%, including video trajectory dataset validation using labeling guidelines.

Education

Degrees, certifications, and relevant coursework

J.C. Bose University of Science & Technology logoJT

J.C. Bose University of Science & Technology

Master of Computer Applications (MCA), Computer Applications

2023 - 2025

Grade: CGPA: 8.24/10.0

Master of Computer Applications (MCA) at J.C. Bose University of Science & Technology (2023–2025), achieving CGPA 8.24/10.0.

J.C. Bose University of Science & Technology (YMCA) logoJY

J.C. Bose University of Science & Technology (YMCA)

Master of Computer Applications (MCA), Computer Applications

2023 - 2025

Grade: CGPA: 8.24/10.0

Pursued a Master of Computer Applications (MCA) degree at J.C. Bose University of Science & Technology (YMCA) in Faridabad, earning a CGPA of 8.24/10.0.

University Institute of Engineering & Technology (MDU Rohtak) logoUR

University Institute of Engineering & Technology (MDU Rohtak)

Bachelor of Computer Applications (BCA), Computer Applications

2020 - 2023

Grade: 72.60%

Completed a Bachelor of Computer Applications (BCA) at the University Institute of Engineering & Technology, MDU Rohtak, with 72.60%.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan