Livia Maia
@livmaia
AI evaluation specialist with QA testing and language analysis background, designing frameworks to improve linguistic accuracy and LLM performance.
What I'm looking for
I’m an AI evaluation specialist with 5+ years of experience building and scaling human judgment systems for AI/ML at production scale. With a background in QA testing—including mobile application testing—and language analysis, I design evaluation frameworks that translate complex model behavior into clear, actionable insights. My work focuses on linguistic quality evaluation, including rubric and rating scale development, defect taxonomy, labeling guidelines, and QA systems that ensure consistency and reliability.
I’ve contributed to evaluation strategy across 10+ AI/ML initiatives, including LLM output assessment and LLM-as-a-judge and human-in-the-loop workflows. I specialize in failure mode analysis, adversarial testing, and edge case evaluation, partnering with engineering and product teams to drive model improvements. I bring multilingual expertise in Brazilian Portuguese, English, and Spanish, and have led or supported large-scale data collection efforts with 6,000+ participants, implementing quality monitoring processes to ensure data integrity and detect drift over time.
Experience
Work history, roles, and key accomplishments
Led end-to-end evaluation strategy across 10+ AI/ML initiatives, defining rubrics, rating scales, and defect taxonomies to produce consistent, decision-ready outputs at production scale. Orchestrated evaluation and QA workflows across 10+ projects, supported 6,000+ contributors through onboarding and calibration, and performed deep error analysis to drive engineering and product iteration prioriti
Creative Writing Graduate Instructor
San Francisco State University
Jan 2019 - Jan 2022 (3 years)
Designed evaluation rubrics for open-ended, subjective work and delivered evidence-based, structured feedback to support consistent assessment standards. Built an interdisciplinary curriculum with faculty to establish shared quality expectations across courses.
Localization QA Tester
Urban Apps
Jan 2014 - Present (12 years 3 months)
Tested multilingual mobile apps (EN/ES/PT) for translation quality, UX compliance, and cultural fit, documenting issues and recommending process changes to reduce turnaround time by 15%. Applied locale-specific quality standards to improve consistency across languages.
Education
Degrees, certifications, and relevant coursework
San Francisco State University
Master of Fine Arts, Creative Writing (Literary Translation)
Completed a Master of Fine Arts in Creative Writing with a focus on Literary Translation at San Francisco State University.
Hunter College, CUNY
Bachelor of Arts, English Literature & Creative Writing
Earned a Bachelor of Arts in English Literature & Creative Writing from Hunter College (CUNY).
Universidade do Estado do Rio Grande do Norte, Brazil
Bachelor of Arts, Communication Studies
Earned a Bachelor of Arts in Communication Studies from Universidade do Estado do Rio Grande do Norte.
Availability
Location
Authorized to work in
Portfolio
bit.ly/liviamaiaSocial media
Job categories
Skills
Interested in hiring Livia?
You can contact Livia and 90k+ other talented remote workers on Himalayas.
Message LiviaFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
