Skip to main content
H OHO
Open to opportunities

H O

@hideyuki001

I’m a Senior AI Evaluator specializing in LLM, ASR, and multimodal QA to improve reliability.

Japan
Message

What I'm looking for

I’m looking to join teams that value reproducible, audit-ready AI QA—LLM/ASR/multimodal evaluation with clear rubrics, decision traceability, and human-in-the-loop workflows, while improving dataset quality, reliability, and privacy-aware relevance.

Senior AI Evaluator specializing in LLM evaluation, Japanese ASR validation, multimodal assessment, and translation QA. My work focuses on identifying hallucinations, reasoning failures, instruction-following issues, and semantic inconsistencies to improve AI quality, reliability, and auditability.

I have completed 1,500+ multimodal AI evaluations, 1,200+ Japanese ASR validation tasks, and 730,000+ words of technical translation and localization. Using structured rubrics (30+ evaluation criteria), I assess semantic fidelity, prompt alignment, safety, visual consistency, and overall model quality while producing structured English rationales for reviewer alignment and traceable QA.

I specialize in reproducible AI evaluation through structured decision frameworks, Red Flag detection, pattern-based QA, and YAML/JSON logging to support scalable Human-in-the-Loop (HITL) workflows. My experience spans multimodal evaluation, LLM assessment, translation QA, MTPE, and cross-lingual quality assurance across enterprise AI projects.

Beyond day-to-day evaluation, I develop reusable quality frameworks including Kernel Core, Translation OS Runtime, ModelRefiner, and Unified Cognitive OS to improve evaluation consistency, traceability, and governance across AI systems.

Experience

Work history, roles, and key accomplishments

HG
Current

Senior AI Evaluator

Hansem Global

Jan 2024 - Present (2 years 6 months)

Conducted 1,500+ multimodal AI evaluations and 1,200+ Japanese ASR validation tasks using structured rubrics and enterprise QA standards.

Produced structured English rationales while identifying hallucinations, omissions, instruction-following issues, and other quality risks to improve dataset reliability.

Education

Degrees, certifications, and relevant coursework

VC

Vocational Technical College

Diploma in Information Technology, Information Technology

Completed a 3-year Diploma program in Information Technology at a Vocational Technical College in Japan.

Tech stack

Software and tools used professionally

Get matched with your dream remote job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan