H O
@hideyuki001
I’m a Senior AI Evaluator specializing in LLM, ASR, and multimodal QA to improve reliability.
What I'm looking for
Senior AI Evaluator specializing in LLM evaluation, Japanese ASR validation, multimodal assessment, and translation QA. My work focuses on identifying hallucinations, reasoning failures, instruction-following issues, and semantic inconsistencies to improve AI quality, reliability, and auditability.
I have completed 1,500+ multimodal AI evaluations, 1,200+ Japanese ASR validation tasks, and 730,000+ words of technical translation and localization. Using structured rubrics (30+ evaluation criteria), I assess semantic fidelity, prompt alignment, safety, visual consistency, and overall model quality while producing structured English rationales for reviewer alignment and traceable QA.
I specialize in reproducible AI evaluation through structured decision frameworks, Red Flag detection, pattern-based QA, and YAML/JSON logging to support scalable Human-in-the-Loop (HITL) workflows. My experience spans multimodal evaluation, LLM assessment, translation QA, MTPE, and cross-lingual quality assurance across enterprise AI projects.
Beyond day-to-day evaluation, I develop reusable quality frameworks including Kernel Core, Translation OS Runtime, ModelRefiner, and Unified Cognitive OS to improve evaluation consistency, traceability, and governance across AI systems.
Experience
Work history, roles, and key accomplishments
Senior AI Evaluator
Hansem Global
Jan 2024 - Present (2 years 6 months)
Conducted 1,500+ multimodal AI evaluations and 1,200+ Japanese ASR validation tasks using structured rubrics and enterprise QA standards.
Produced structured English rationales while identifying hallucinations, omissions, instruction-following issues, and other quality risks to improve dataset reliability.
Technical Translator & Localization
Gengo / TransPerfect / NICT
Jan 2019 - Present (7 years 6 months)
Delivered 730,000+ words of English/Chinese-to-Japanese technical translation, MTPE, and translation QA across IT, engineering, patents, and policy documentation. Applied structured QA to ensure semantic fidelity, terminology consistency, and reproducible quality.
Evaluated LLM responses for reasoning quality, factual accuracy, safety, and instruction following using structured rubrics. Produced structured English rationales for cross-lingual preference ranking and model quality assessment.
Education
Degrees, certifications, and relevant coursework
Vocational Technical College
Diploma in Information Technology, Information Technology
Completed a 3-year Diploma program in Information Technology at a Vocational Technical College in Japan.
Availability
Location
Authorized to work in
Social media
Job categories
Skills
Interested in hiring H?
You can contact H and 90k+ other talented remote workers on Himalayas.
Message HGet matched with your dream remote job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
