Kyle Bailey
@kylebailey
Senior AI/ML engineer specializing in LLM reliability, evaluation, and production systems.
What I'm looking for
I am a Senior AI and Machine Learning Engineer with 12+ years designing, launching, and operating large-scale production AI systems, focused on LLM reliability, evaluation, and inference behavior under real-world traffic.
I've been technical owner of end-to-end LLM platforms, including retrieval pipelines, online and offline evaluation, observability, and incident response, supporting many product teams and external customers.
My work reduced P95 latency significantly, scaled retrieval-augmented generation systems to millions of documents and hundreds of thousands of daily queries, and introduced traffic-derived evaluation, edge-case mining, and online shadow testing to detect real-world failures.
I pair strong hands-on engineering with data engineering and analytics expertise—building dbt analytics layers, Spark/ Airflow pipelines, and human-in-the-loop feedback loops—while mentoring engineers on production LLM debugging, evaluation methodology, and reliability best practices.
Experience
Work history, roles, and key accomplishments
Technical owner for LLM reliability and evaluation across production inference platforms; reduced P95 latency from ~2.3s to <1s and scaled RAG systems supporting 10M+ documents and 100k+ daily queries while improving observability and rollback controls.
Built large-scale data and feature pipelines with Spark, Python and Airflow; developed evaluation frameworks for precision, recall, calibration and drift and implemented human-in-the-loop labeling and quality controls.
Developed backend services and telemetry/data pipelines on Azure to transform raw application data into analytical datasets, improving observability and on-call incident response for downstream ML consumers.
Education
Degrees, certifications, and relevant coursework
U.S. Air Force Institute of Technology
Master of Science, Cyberspace Operations
2011 - 2013
Completed a Master of Science in Cyberspace Operations focusing on advanced topics in cybersecurity and operations from 09/2011 to 05/2013.
U.S. Air Force Academy
Bachelor of Science, Computer Science
2007 - 2011
Earned a Bachelor of Science in Computer Science with coursework supporting software engineering and systems from 04/2007 to 08/2011.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Kyle?
You can contact Kyle and 90k+ other talented remote workers on Himalayas.
Message KyleFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
