HimalayasHimalayas logo
Kyle BaileyKB
Open to opportunities

Kyle Bailey

@kylebailey

Senior AI/ML engineer specializing in LLM reliability, evaluation, and production systems.

United States
Message

What I'm looking for

I seek a role owning production LLM reliability and evaluation, emphasizing scalable inference, observability, and cross-team impact in an engineering-driven culture.

I am a Senior AI and Machine Learning Engineer with 12+ years designing, launching, and operating large-scale production AI systems, focused on LLM reliability, evaluation, and inference behavior under real-world traffic.

I've been technical owner of end-to-end LLM platforms, including retrieval pipelines, online and offline evaluation, observability, and incident response, supporting many product teams and external customers.

My work reduced P95 latency significantly, scaled retrieval-augmented generation systems to millions of documents and hundreds of thousands of daily queries, and introduced traffic-derived evaluation, edge-case mining, and online shadow testing to detect real-world failures.

I pair strong hands-on engineering with data engineering and analytics expertise—building dbt analytics layers, Spark/ Airflow pipelines, and human-in-the-loop feedback loops—while mentoring engineers on production LLM debugging, evaluation methodology, and reliability best practices.

Experience

Work history, roles, and key accomplishments

Scale AI logoSA
Current

Senior AI / ML Engineer

Feb 2020 - Present (6 years 2 months)

Technical owner for LLM reliability and evaluation across production inference platforms; reduced P95 latency from ~2.3s to <1s and scaled RAG systems supporting 10M+ documents and 100k+ daily queries while improving observability and rollback controls.

Databricks logoDA

Machine Learning Engineer

May 2016 - Jan 2020 (3 years 8 months)

Built large-scale data and feature pipelines with Spark, Python and Airflow; developed evaluation frameworks for precision, recall, calibration and drift and implemented human-in-the-loop labeling and quality controls.

Education

Degrees, certifications, and relevant coursework

U.S. Air Force Institute of Technology logoUT

U.S. Air Force Institute of Technology

Master of Science, Cyberspace Operations

2011 - 2013

Completed a Master of Science in Cyberspace Operations focusing on advanced topics in cybersecurity and operations from 09/2011 to 05/2013.

U.S. Air Force Academy logoUA

U.S. Air Force Academy

Bachelor of Science, Computer Science

2007 - 2011

Earned a Bachelor of Science in Computer Science with coursework supporting software engineering and systems from 04/2007 to 08/2011.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan