Victor Mbithi
@victormbithi1
Detail-oriented linguist and data annotation specialist for high-quality ML training datasets.
What I'm looking for
I am a linguist and data annotation specialist with 4+ years of experience transcribing, MTPE/post-editing, and collecting focused datasets to support NLP, ASR, and computer vision projects. I work reliably under detailed guidelines to produce consistent, high-quality labeled text, audio, and image data, and I contribute to QA cycles and guideline improvements to reduce inter-annotator disagreement.
I hold a BA in Linguistics and have self-studied core ML concepts and annotation best practices, with hands-on experience using Labelbox, CVAT, and Prodigy plus CSV/JSON and spreadsheet workflows. I am adaptable to remote, distributed teams, available for flexible schedules, and committed to secure dataset handling, metadata management, and reproducible dataset versioning.
Experience
Work history, roles, and key accomplishments
Data Annotation Specialist
Freelance
Jan 2019 - Present (6 years 9 months)
Deliver high-quality text, audio, and image annotation and MTPE services for ML projects, ensuring guideline adherence and reducing inter-annotator disagreement through QA and guideline improvements.
Translation & Linguistic QA
Freelance
Jan 2018 - Present (7 years 9 months)
Edited MT output and performed linguistic QA for web and technical content, producing glossaries and style guides to ensure domain-appropriate voice and dataset consistency.
Contract Annotator
Multiple Clients
Jan 2021 - Dec 2024 (3 years 11 months)
Executed short-turnaround image and text tagging projects (classification, bounding boxes, NER, sentiment) while completing qualification tests and reducing ambiguity via logged reports and guideline clarifications.
Education
Degrees, certifications, and relevant coursework
University of Nairobi
Bachelor of Arts, Linguistics
Activities and societies: Relevant coursework: Phonetics & Phonology, Syntax, Semantics, Sociolinguistics.
Completed a Bachelor of Arts in Linguistics with coursework in phonetics & phonology, syntax, semantics, and sociolinguistics, providing a strong foundation for language annotation and linguistic analysis.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Interested in hiring Victor?
You can contact Victor and 90k+ other talented remote workers on Himalayas.
Message VictorFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
