Geoffrey Mwangi
@geoffreymwangi
AI training and data annotation specialist building high-quality ML datasets and full-stack software with cybersecurity rigor.
What I'm looking for
I’m a detail-oriented AI Training Specialist and Data Annotation Generalist with 5+ years of progressive experience across AI training, data labeling, full-stack development, and cybersecurity. I hold an MSc in Software Engineering (Distinction) and a BSc in Computer Science and Mathematics.
For the past 3 years, I’ve worked as a parallel contractor across Outlier, Remotasks (Scale AI), and Handshake, contributing to training and evaluation of next-generation machine learning and natural language processing models. I annotate and validate text, code, image, and structured datasets with strong attention to guideline interpretation, consistency, and data integrity.
I help improve model alignment through RLHF workflows—evaluating and ranking AI-generated responses for quality, factual accuracy, tone, and intent. I also review and correct peer annotations to maintain inter-annotator agreement, document edge cases, and provide actionable feedback to teams and project managers.
Alongside annotation work, I build and maintain software systems with a security-first mindset. As a Senior Software Engineer (Remote), I architect Django microservices at scale, automate CI/CD with Docker and GitHub Actions, and have hands-on experience with OWASP Top 10, penetration testing support, and Splunk SIEM monitoring.
Experience
Work history, roles, and key accomplishments
Senior Software Engineer
TechNova Solutions Ltd
Jan 2024 - Present (2 years 6 months)
Architected and maintained a Django microservices platform and improved reliability through caching and proactive monitoring. Designed RESTful APIs integrating Stripe and M-Pesa Daraja, automated CI/CD with GitHub Actions and Docker, and mentored junior developers through code reviews.
NLP Annotator
Handshake
Jun 2022 - Dec 2024 (2 years 6 months)
Contributed structured NLP and conversational AI annotation data covering intent classification, entity recognition, sentiment labeling, and dialogue quality assessment. Reviewed and validated model outputs against human-written references and documented annotation processes and edge case decisions.
Completed high-volume multimodal annotation tasks across text classification, image bounding boxes, semantic segmentation, and audio transcription, while maintaining platform quality targets. Documented annotation decisions and edge cases and participated in qualification and training sessions for new task types.
Labeled and annotated large-scale text, code, and instruction datasets for LLM training, following strict project guidelines for consistency and accuracy. Evaluated and ranked AI-generated responses for quality and alignment to support RLHF pipelines, and assessed code samples in Python and JavaScript.
Software Developer
DataVault Kenya
Mar 2022 - Dec 2023 (1 year 9 months)
Built internal data management tools and real-time KPI dashboards using Python with Plotly/Dash to replace manual reporting workflows. Migrated MySQL to PostgreSQL with query optimizations, productionized a churn prediction ML model with scikit-learn, and improved test coverage using pytest.
Delivered multiple client web projects using Django and front-end web technologies, including JavaScript, HTML5, and CSS3. Built e-commerce storefronts with PayPal and Stripe integrations and managed scope, timelines, and communications independently.
Cybersecurity Analyst Intern
Safaricom PLC
Jan 2021 - May 2021 (4 months)
Assisted with vulnerability assessments and penetration testing of internal web portals using OWASP Top 10 methodologies. Monitored Splunk SIEM dashboards, escalated security incidents within SLA windows, and contributed to remediation reports reviewed by the CISO team.
Education
Degrees, certifications, and relevant coursework
University of Nairobi
Master of Science (MSc) in Software Engineering, Software Engineering
2023 - 2025
Grade: Distinction (dissertation grade: A+)
Activities and societies: Dissertation: Adaptive Threat Detection in Distributed Web Systems (ML). Graduate Teaching Assistant for Python and database design labs (60+ students). Key modules included Advanced Software Architecture, Cloud-Native Development, ML for Developers, and Enterprise Security.
Completed an MSc in Software Engineering with a distinction, including a dissertation on adaptive threat detection in distributed web systems using machine learning. Served as a Graduate Teaching Assistant facilitating Python and database design labs for 60+ undergraduate students.
Zetech University
Bachelor of Science (BSc) in Computer Science and Mathematics, Computer Science and Mathematics
2022 - 2024
Grade: Final year semester average: B (Good); final project grade: A
Activities and societies: Final Year Project: School Notice Board System (Django/Python/MySQL) with role-based access control. Coursework included Cyber Security, Probability & Statistics I & II, Multivariate Data Analysis, Numerical Analysis, Calculus I, Ordinary Differential Equations, and Mathematical Programming.
Completed a BSc in Computer Science and Mathematics, finishing with a good final-year semester average and strong performance in multiple core courses. Completed a final year project building a School Notice Board System using Django/Python/MySQL with role-based access control.
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Geoffrey?
You can contact Geoffrey and 90k+ other talented remote workers on Himalayas.
Message GeoffreyGet matched with your dream remote job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
