EMMAH GITHINJI
@emmahgithinji
AI Trainer and data specialist improving ML model quality through expert annotation.
What I'm looking for
I’m an AI Trainer and native Swahili linguist with advanced English proficiency (C2). I help teams build better machine learning systems by delivering reliable data annotation, collection, and quality evaluation that directly improves model outcomes.
In my current AWS re/Start Program, I engineered RLHF workflows and ran pairwise comparisons that helped uncover a 20% model failure rate for FinOps anomaly detection. I also built synthetic data pipelines with Python and LangChain to automate prompt evaluation across 500+ test cases, cutting manual review time by 40%.
I’ve created golden datasets with 98% alignment to quality rubrics by collecting and pre-processing raw data with strong attention to measurement and consistency. Previously, as a Data Annotation Specialist (freelance/contract), I transcribed sensitive audio and escalated critical cases with 100% accuracy, ensuring safety-critical information reached care coordinators.
I also digitized and structured data (like PDF invoices) to reduce retrieval time by 50%, and completed web research across 200+ companies with 98% accuracy through rigorous data validation. Across legal and general domains, I’ve transcribed, translated, and annotated audio/video content—reducing legal terminology error rates from 15% to 3%—and I apply the same quality discipline to translation and transcription work.
Experience
Work history, roles, and key accomplishments
Engineered RLHF workflows and ran pairwise comparisons, uncovering a 20% model failure rate for FinOps anomaly detection and improving data evaluation quality. Built synthetic data pipelines with Python and LangChain across 500+ test cases and created golden datasets achieving 98% alignment to quality rubrics.
Data Annotation Specialist
Cloudfactory
Jan 2020 - Jan 2022 (2 years)
Transcribed sensitive audio for elderly home calls, identifying medical/safety emergencies and escalating critical cases with 100% accuracy. Digitized PDF invoices into structured labeled data, cutting retrieval time by 50%, and conducted web research on 200+ companies with 98% accuracy through rigorous validation.
English–Swahili Translator
Freelance
Jan 2015 - Jan 2020 (5 years)
Provided real-time interpretation and transcription for community/church events, converting English sermons to Swahili across hundreds of hours with 100% accuracy. Organized translated content with quality rubrics to ensure cultural relevance and prepare materials for AI training datasets.
Education
Degrees, certifications, and relevant coursework
Dedan Kimathi University of Technology
Bachelor of Science, Geospatial Information Systems and Remote Sensing
Earned a BSc in Geospatial Information Systems and Remote Sensing from Dedan Kimathi University of Technology.
Availability
Location
Authorized to work in
Job categories
Interested in hiring EMMAH?
You can contact EMMAH and 90k+ other talented remote workers on Himalayas.
Message EMMAHFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
