Skip to main content
Yamalith DíazYD
Open to opportunities

Yamalith Díaz

@yamalithdaz

I evaluate AI responses and ensure trust & safety with QA rigor.

Peru
Message

What I'm looking for

I’m looking for AI evaluation and Trust & Safety work where I can run rubric-based quality checks, do pairwise comparisons, and contribute to calibration and policy improvements—ideally with RLHF annotation and data quality responsibilities.

I’m an AI evaluator and QA specialist with 4+ years in high-volume content review and trust & safety operations, focused on accuracy, judgment quality, and clear feedback. I’ve worked across real estate and leasing domain prompts as well as high-sensitivity moderation workflows.

In my AI Domain Expert role, I evaluate model responses using a 6-dimension rubric—Instruction Following, Truthfulness, Correctness, Writing Quality, Verbosity, and Overall Quality. I also run comparative pairwise evaluations, selecting preferred outputs and writing detailed justifications per dimension, then stress-test reasoning with domain-specific prompts.

Previously, I was promoted from line moderator to QA specialist at Teleperformance/ByteDance (TikTok), where I audited policy compliance, maintained 97–100% accuracy, and consistently ranked top performer. I analyzed edge cases and identified policy gaps in queues (including Hate Speech, Graphic Content, and Violent Behavior) and delivered calibration feedback with me as the SME point of contact for escalations.

I bring strong operational discipline from moderation and content quality roles, reviewing 1,000–1,300 items/day and supporting onboarding and policy Q&A for team members. I’m open to AI evaluation, RLHF annotation, data quality, and Trust & Safety roles where rubric-based quality and continuous calibration make measurable impact.

Experience

Work history, roles, and key accomplishments

ME
Current

AI Domain Expert (Real Estate)

Mercor

Jan 2026 - Present (5 months)

Evaluated AI model responses for real estate and leasing using a 6-dimension rubric (instruction following, truthfulness, correctness, writing quality, verbosity, overall quality). Conducted comparative pairwise scoring and produced detailed justifications, while writing domain-specific prompts to stress-test reasoning.

Education

Degrees, certifications, and relevant coursework

Yamalith hasn't added their education

Don't worry, there are 90k+ talented remote workers on Himalayas

Tech stack

Software and tools used professionally

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan