Open to opportunities

Aashu User

@aashuuser

Message

AI Data Specialist focused on LLM/LAM/agent evaluation, benchmarking, and high-quality data.

India

Message

What I'm looking for

I’m looking for a role where I can drive LLM/LAM/agent evaluation—design benchmarks, run auditing against rubrics, and improve model retraining with high-quality datasets, prompt iteration, and clear root-cause reporting.

I’m an AI Data Specialist who builds dependable evaluation pipelines for LLM, LAM, and CUA systems—turning raw model behavior into actionable findings. I run benchmarking audits against acceptance criteria and evaluation rubrics to surface misalignments, regulatory risks, and clear error ratings that teams can use immediately.

In my recent work, I designed and refined prompt templates for agentic AI workflows, iterating until instruction following, model grounding, and autonomous execution were consistent across test cases. I also validate video trajectory datasets for CUA training by cross-checking spatial and temporal annotations against labeling guidelines to keep training inputs clean.

My projects reflect the same focus on realism and rigor. For CUA-Bench, I built a benchmark dataset of 100+ prompts with conditional branching and multi-constraint scenarios to evaluate decision-making, fallback handling, and instruction following under conditions real users actually run. For my LLM and LAM Failure Analysis and Audit Framework, I used failure taxonomy mapping and behavioral pattern classification to produce annotated datasets and error-rated audit reports.

I take quality seriously: I’ve maintained data accuracy above 99% through shared audit checklists, labeling standards, and quality reviews for teams of annotation specialists. I’m motivated by improving model retraining cycles with reproducible benchmarking frameworks, root cause analysis, and measurable robustness.

Experience

Work history, roles, and key accomplishments

Current

AI Data Specialist

Current

Softage Information Technology

Jan 2025 - Present (1 year 6 months)

Ran benchmarking audits on LLM, LAM, and CUA outputs against acceptance criteria and evaluation rubrics, producing root-cause analysis reports with error ratings that fed model retraining cycles. Designed prompt templates for agentic AI workflows and managed annotation quality so data accuracy stayed above 99%, including video trajectory dataset validation using labeling guidelines.

LLM Evaluation CUA Benchmarking Evaluation Rubric Design Prompt Engineering Root Cause Analysis Data Validation Video Trajectory Validation

Education

Degrees, certifications, and relevant coursework

J.C. Bose University of Science & Technology

Master of Computer Applications (MCA), Computer Applications

2023 - 2025

Grade: CGPA: 8.24/10.0

Master of Computer Applications (MCA) at J.C. Bose University of Science & Technology (2023–2025), achieving CGPA 8.24/10.0.

J.C. Bose University of Science & Technology (YMCA)

Master of Computer Applications (MCA), Computer Applications

2023 - 2025

Grade: CGPA: 8.24/10.0

Pursued a Master of Computer Applications (MCA) degree at J.C. Bose University of Science & Technology (YMCA) in Faridabad, earning a CGPA of 8.24/10.0.