Himalayas logo
VM
Open to opportunities

Victor Mbithi

@victormbithi1

Detail-oriented linguist and data annotation specialist for high-quality ML training datasets.

Kenya
Message

What I'm looking for

I seek remote roles contributing to high-quality ML datasets where I can apply linguistic expertise, annotation, QA, and guideline development within collaborative teams.

I am a linguist and data annotation specialist with 4+ years of experience transcribing, MTPE/post-editing, and collecting focused datasets to support NLP, ASR, and computer vision projects. I work reliably under detailed guidelines to produce consistent, high-quality labeled text, audio, and image data, and I contribute to QA cycles and guideline improvements to reduce inter-annotator disagreement.

I hold a BA in Linguistics and have self-studied core ML concepts and annotation best practices, with hands-on experience using Labelbox, CVAT, and Prodigy plus CSV/JSON and spreadsheet workflows. I am adaptable to remote, distributed teams, available for flexible schedules, and committed to secure dataset handling, metadata management, and reproducible dataset versioning.

Experience

Work history, roles, and key accomplishments

FR
Current

Translation & Linguistic QA

Freelance

Jan 2018 - Present (7 years 9 months)

Edited MT output and performed linguistic QA for web and technical content, producing glossaries and style guides to ensure domain-appropriate voice and dataset consistency.

MC

Contract Annotator

Multiple Clients

Jan 2021 - Dec 2024 (3 years 11 months)

Executed short-turnaround image and text tagging projects (classification, bounding boxes, NER, sentiment) while completing qualification tests and reducing ambiguity via logged reports and guideline clarifications.

Education

Degrees, certifications, and relevant coursework

UN

University of Nairobi

Bachelor of Arts, Linguistics

Activities and societies: Relevant coursework: Phonetics & Phonology, Syntax, Semantics, Sociolinguistics.

Completed a Bachelor of Arts in Linguistics with coursework in phonetics & phonology, syntax, semantics, and sociolinguistics, providing a strong foundation for language annotation and linguistic analysis.

Tech stack

Software and tools used professionally

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Victor Mbithi - Data Annotation Specialist - Freelance | Himalayas