Paul Mwangi
@paulmwangi2
AI Research Engineer and full-stack developer specializing in LLM evaluation, bias auditing, and AI-driven application delivery.
What I'm looking for
I’m an AI Research Engineer and full-stack developer specialized in end-to-end development of AI-driven applications and the evaluation of Large Language Models. I focus on making AI outputs accurate, safe, and dependable in real workflows.
I identify algorithmic bias, mitigate hallucinations, and audit complex codebases using Python, Java, and Dart. I also work with advanced NLP and LLM techniques like RLHF, Model Grounding, and LLM Benchmarking to test boundaries and improve reliability.
In my current role, I reduced unresolved discrepancies by 30% by investigating transaction variances with automated validation techniques, while maintaining 100% compliance with reporting deadlines. Previously, I developed Flutter mobile applications with robust Firebase backends and integrated Dialogflow with Telegram for conversational agents, including extensive flow testing and issue tracking.
I’m currently building my expertise through an MSc in Artificial Intelligence and hands-on projects spanning delivery workflows, inspection/maintenance tracking, merchandising monitoring, and a Smart Farm prototype combining mobile interaction with cloud data storage and sensor concepts.
Experience
Work history, roles, and key accomplishments
Night Auditor & Data Analyst
Village Hotel
Mar 2025 - Present (1 year 2 months)
Reduced unresolved discrepancies by 30% through investigation of transaction variances. Maintained 100% compliance with financial reporting deadlines using automated data validation techniques.
AI Technical Trainer & Evaluator
Moran Systems
Apr 2025 - Jan 2026 (9 months)
Applied advanced NLP to rank and refine LLM outputs for technical accuracy and safety. Performed Python/Java code-correction on frontier models and built multi-turn prompts to test model reasoning and problem-solving boundaries.
Mobile Application Developer
Moran Systems
Feb 2023 - Mar 2025 (2 years 1 month)
Developed Flutter applications with robust Firebase backends, focusing on cloud-based operational reporting. Integrated Dialogflow with Telegram to deliver conversational agents, and architected a merchandising monitoring app with efficient implementations of business logic.
Education
Degrees, certifications, and relevant coursework
Liverpool John Moores University
Master of Science, Artificial Intelligence
2024 - 2025
MSc in Artificial Intelligence covering Deep Learning, Machine Learning Theory, Natural Language Processing, Ethics in AI, and Data Mining.
Jomo Kenyatta University of Agriculture and Technology
Bachelor of Science, Information Technology
2019 - 2022
BSc in Information Technology from Jomo Kenyatta University of Agriculture and Technology.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Paul?
You can contact Paul and 90k+ other talented remote workers on Himalayas.
Message PaulFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
