Skip to main content
JJ
Looking for a job

Jeorge Johns

@jeorgejohns

High-throughput AI evaluation specialist bridging frontier LLMs and engineering rigor.

United Kingdom
Message

What I'm looking for

I’m looking for high-trust work evaluating frontier AI with rigorous rubrics, red-teaming, and failure-mode taxonomy—ideally where technical accuracy, adversarial testing, and clear documentation directly improve model safety and real-world performance.

I’m a high-throughput AI evaluation specialist with 250,000+ evaluations at 98%+ approval rating across OpenAI (Feather), Alphabet, HuggingFace, and Microsoft FlightAcademy.ai. I specialize in adversarial red-teaming, failure mode taxonomy, and rubric design, and I’m routinely staffed on reasoning, technical, and long-form workstreams concurrently across labs.

Alongside evaluation, I run a parallel career as a design engineer and documentation lead, with mechanical product ownership from concept through commercial entry using PTC Creo and Windchill PLM. I also publish on AI evaluation methodology and bring uncommon depth from production engineering, technical writing, and independent first-principles study in plasma physics, sputter deposition, and ultra-high vacuum engineering.

Experience

Work history, roles, and key accomplishments

OpenAI (Feather) logoOF
Current

Frontier Model Evaluator

Apr 2024 - Present (2 years 2 months)

Performed high-throughput evaluations for frontier AI, completing 250,000+ evaluations with 98%+ approval ratings across OpenAI, Alphabet, Hugging Face, and Microsoft FlightAcademy.ai. Conducted adversarial red-teaming, failure mode taxonomy, and rubric design for engineering and technical reasoning tasks.

Education

Degrees, certifications, and relevant coursework

University of Liverpool logoUL

University of Liverpool

MSc(Eng), Product Design & Management

Grade: Merit

Earned an MSc(Eng) in Product Design & Management with a Merit.

University of Liverpool logoUL

University of Liverpool

Bachelor of Engineering (BEng), Aerospace Engineering

Grade: 2:1 (Hons)

Earned a BEng in Aerospace Engineering with a 2:1 (Hons).

Tech stack

Software and tools used professionally

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan