Skip to main content
Kyle BaileyKB
Open to opportunities

Kyle Bailey

@kylebailey

Senior AI/ML engineer specializing in LLM reliability, evaluation, and production systems.

United States
Message

What I'm looking for

I seek a role owning production LLM reliability and evaluation, emphasizing scalable inference, observability, and cross-team impact in an engineering-driven culture.

I am a Senior AI and Machine Learning Engineer with 12+ years designing, launching, and operating large-scale production AI systems, focused on LLM reliability, evaluation, and inference behavior under real-world traffic.

I've been technical owner of end-to-end LLM platforms, including retrieval pipelines, online and offline evaluation, observability, and incident response, supporting many product teams and external customers.

My work reduced P95 latency significantly, scaled retrieval-augmented generation systems to millions of documents and hundreds of thousands of daily queries, and introduced traffic-derived evaluation, edge-case mining, and online shadow testing to detect real-world failures.

I pair strong hands-on engineering with data engineering and analytics expertise—building dbt analytics layers, Spark/ Airflow pipelines, and human-in-the-loop feedback loops—while mentoring engineers on production LLM debugging, evaluation methodology, and reliability best practices.

Experience

Work history, roles, and key accomplishments

Scale AI logoSA
Current

Senior AI / ML Engineer

Feb 2020 - Present (6 years 4 months)

Technical owner for LLM reliability and evaluation across production inference platforms; reduced P95 latency from ~2.3s to <1s and scaled RAG systems supporting 10M+ documents and 100k+ daily queries while improving observability and rollback controls.

Databricks logoDA

Machine Learning Engineer

May 2016 - Jan 2020 (3 years 8 months)

Built large-scale data and feature pipelines with Spark, Python and Airflow; developed evaluation frameworks for precision, recall, calibration and drift and implemented human-in-the-loop labeling and quality controls.

Education

Degrees, certifications, and relevant coursework

U.S. Air Force Institute of Technology logoUT

U.S. Air Force Institute of Technology

Master of Science, Cyberspace Operations

2011 - 2013

Completed a Master of Science in Cyberspace Operations focusing on advanced topics in cybersecurity and operations from 09/2011 to 05/2013.

U.S. Air Force Academy logoUA

U.S. Air Force Academy

Bachelor of Science, Computer Science

2007 - 2011

Earned a Bachelor of Science in Computer Science with coursework supporting software engineering and systems from 04/2007 to 08/2011.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan