HimalayasHimalayas logo
Livia MaiaLM
Open to opportunities

Livia Maia

@livmaia

AI evaluation specialist with QA testing and language analysis background, designing frameworks to improve linguistic accuracy and LLM performance.

United States
Message

What I'm looking for

I’m looking to lead AI evaluation and human-judgment systems end-to-end—building rubrics, annotation workflows, and multilingual quality standards—while partnering with Engineering, Product, and Data Science to ship reliable, decision-ready model outputs.

I’m an AI evaluation specialist with 5+ years of experience building and scaling human judgment systems for AI/ML at production scale. With a background in QA testing—including mobile application testing—and language analysis, I design evaluation frameworks that translate complex model behavior into clear, actionable insights. My work focuses on linguistic quality evaluation, including rubric and rating scale development, defect taxonomy, labeling guidelines, and QA systems that ensure consistency and reliability.

I’ve contributed to evaluation strategy across 10+ AI/ML initiatives, including LLM output assessment and LLM-as-a-judge and human-in-the-loop workflows. I specialize in failure mode analysis, adversarial testing, and edge case evaluation, partnering with engineering and product teams to drive model improvements. I bring multilingual expertise in Brazilian Portuguese, English, and Spanish, and have led or supported large-scale data collection efforts with 6,000+ participants, implementing quality monitoring processes to ensure data integrity and detect drift over time.

Experience

Work history, roles, and key accomplishments

Google logoGO
Current

AI/ML Evaluation Specialist

Jan 2023 - Present (3 years 3 months)

Led end-to-end evaluation strategy across 10+ AI/ML initiatives, defining rubrics, rating scales, and defect taxonomies to produce consistent, decision-ready outputs at production scale. Orchestrated evaluation and QA workflows across 10+ projects, supported 6,000+ contributors through onboarding and calibration, and performed deep error analysis to drive engineering and product iteration prioriti

San Francisco State University logoSU

Creative Writing Graduate Instructor

San Francisco State University

Jan 2019 - Jan 2022 (3 years)

Designed evaluation rubrics for open-ended, subjective work and delivered evidence-based, structured feedback to support consistent assessment standards. Built an interdisciplinary curriculum with faculty to establish shared quality expectations across courses.

Education

Degrees, certifications, and relevant coursework

San Francisco State University logoSU

San Francisco State University

Master of Fine Arts, Creative Writing (Literary Translation)

Completed a Master of Fine Arts in Creative Writing with a focus on Literary Translation at San Francisco State University.

Hunter College, CUNY logoHC

Hunter College, CUNY

Bachelor of Arts, English Literature & Creative Writing

Earned a Bachelor of Arts in English Literature & Creative Writing from Hunter College (CUNY).

Universidade do Estado do Rio Grande do Norte, Brazil logoUB

Universidade do Estado do Rio Grande do Norte, Brazil

Bachelor of Arts, Communication Studies

Earned a Bachelor of Arts in Communication Studies from Universidade do Estado do Rio Grande do Norte.

Tech stack

Software and tools used professionally

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan