I'm seeking remote or hybrid roles in Clinical NLP, medical NER, or annotation QA. I aim to support ethical AI by building HIPAA/GDPR-compliant datasets and NLP pipelines. I value roles where I can apply my skills in data quality, clinical text processing, and cross-functional collaboration to improve model performance and trust.
Samuel Ngari
@samuelngari
Clinical NLP & QA Specialist | Medical NER, Annotation QA, HIPAA-compliant AI datasets
What I'm looking for
I'm a Clinical NLP Annotation & QA Specialist with over a decade of experience in transcription, medical NER, and structured dataset creation for AI/ML systems. I specialize in building high-accuracy, HIPAA/GDPR-compliant pipelines that support clinical text understanding, emotion-aware speech models, and video object tracking tasks.
My technical skill set includes spaCy, Label Studio, Python, and QA workflows for medical NER, negation detection, section classification, and temporal normalization. I’ve led and published multiple open-source projects including:
Clinical NER Pipeline – medication/condition/dosage extraction with spaCy + EntityRuler
Annotation QA Toolkit – machine vs. human evaluation with Precision, Recall & F1
Clinical Temporal Normalizer – ISO 8601 time expression normalizer from free text
Audio Emotion & Speaker Annotation – emotion-tagged speech datasets for AI
Sports Video Annotation & Player Tracking – object tracking datasets for vision models
With 10,000+ hours of transcription, post-editing, and annotation delivered, I’m driven by precision, trust & safety, and the ethical use of AI. I’m currently open to roles in Clinical NLP, data QA, and AI annotation — both freelance and long-term.
Experience
Work history, roles, and key accomplishments
Video Annotation Specialist
Independent
Jan 2025 - Present (5 months)
Created object-tracking and event-labeling datasets for sports video AI projects.
Audio Emotion & Speaker Annotator
Independent
Jan 2025 - Present (5 months)
Labeled audio for speaker turns and emotional tone (e.g., Neutral, Happy, Angry) for AI emotion recognition models.
Medical AI Data Annotator (NER)
Independent
Jan 2025 - Present (5 months)
Annotated clinical text with Label Studio for machine-readable NLP datasets. Built and published medical NER projects for open-source use.
Clinical NLP & Data Annotation QA Specialist
Independent Projects
Jan 2025 - Present (5 months)
Built Clinical NLP pipelines with spaCy + EntityRuler to extract medical entities (MEDICATION, DOSAGE, CONDITION). Developed QA tools to compare machine vs human annotations (Precision, Recall, F1).
AI Post-Editing & Transcription QA
Focus Forward
Jan 2020 - Jan 2025 (5 years)
Edited 500+ hours of AI-generated transcripts in medical and legal domains, improving accuracy to 99%+. Ensured speaker labeling, timestamp integrity, and compliance with HIPAA/GDPR standards.
Education
Degrees, certifications, and relevant coursework
University of Nairobi
Bachelor of Education (Arts), Education
2016 - 2018
Focused on language and communication, providing a strong foundation for clinical text annotation and transcription work.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Portfolio
github.com/samuelnjerungariSalary expectations
Social media
Job categories
Interested in hiring Samuel?
You can contact Samuel and 90k+ other talented remote workers on Himalayas.
Message SamuelFind your dream job
Sign up now and join over 85,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
