Open to opportunities

Ian Juch

@ianjuch

Message

Staff AI Systems Engineer building production LLM platforms with reliability, cost, and latency tradeoffs.

United States

Message

What I'm looking for

I’m looking to lead end-to-end LLM systems work—retrieval, agent orchestration, and evaluation—where I can own reliability and observability while optimizing latency, cost, correctness, and safety in production.

I’m a Staff AI Systems Engineer with 10+ years building production LLM platforms and distributed systems at Adobe, Cohere, and Netflix. I focus on systems that work under real constraints—latency, cost, correctness, and reliability—across retrieval, agent orchestration, and evaluation pipelines.

At Adobe, I build backend orchestration powering Firefly generative workflows, designing multi-stage execution pipelines with explicit state tracking and context-grounding from user assets and project state. I also implement tool-execution layers for deterministic services, plus rollout and gating strategies balancing latency, cost, and content safety.

Earlier, at Cohere and Netflix, I built enterprise LLM and retrieval infrastructure and high-scale experimentation and traffic control systems. I take ownership end-to-end—making systems observable, debuggable, and reliable through request-level tracing, targeted failure-mode evaluation, and staged rollouts with kill switches and fallback paths.

Experience

Work history, roles, and key accomplishments

Current

Staff AI Systems Engineer

Current

Adobe

Jan 2023 - Present (3 years 6 months)

Built backend orchestration systems powering Firefly generative workflows, including multi-stage execution with explicit state tracking across retrieval, inference, and post-processing. Implemented tool-augmented execution, rollout gating, evaluation pipelines for failure modes, and request-level observability for end-to-end debugging of multi-step behavior.

Orchestration Tool Augmented Execution State Tracking Model Rollout Gating Evaluation Pipelines RAG

Senior Applied AI / Platform Engineer

Cohere

May 2020 - Dec 2022 (2 years 7 months)

Built a developer-facing platform for generation and embedding APIs, enabling enterprises to integrate LLM capabilities into production systems. Designed retrieval and orchestration pipelines (retrieve → rank → prompt → generate → validate) using hybrid retrieval and evaluation tooling to debug retrieval and output correctness.

Embeddings Vector Search Reranking Orchestration Inference Batching and Queuing Evaluation Pipelines

Senior Applied AI Engineer

Netflix

Jun 2019 - Apr 2020 (10 months)

Built backend services within Netflix Experimentation Platform to enable safe product evaluations across global traffic. Developed traffic routing, staged rollout mechanisms (progressive ramp-up, kill switches, fallbacks), dynamic experiment configuration APIs, and observability for detecting anomalies and regressions.

Experimentation Platforms Routing Kill Switches Fallbacks Experiment Configuration APIs Observability Anomaly Detection Reliability Engineering

Software Engineer

Netflix

Jul 2016 - May 2019 (2 years 10 months)

Built distributed services for Traﬀic Routing and Experimentation APIs, supporting high-throughput request routing across global regions. Implemented low-latency routing, production observability (metrics/logging/tracing), resilience mechanisms (retries, circuit breakers, graceful degradation), and on-call incident diagnostics for critical infrastructure.

Distributed Systems High Throughput API Design Low Latency Request Routing Microservices Resilience (Retries Circuit Breakers)Graceful Degradation

Software Engineering Intern

Netflix

Jun 2015 - Aug 2015 (2 months)

Contributed backend code supporting Streaming Platform infrastructure with a focus on reliability and service coordination. Built internal tooling to improve deployment workflows and service monitoring, and improved logging/instrumentation to enhance debugging of distributed interactions.

Backend Development Monitoring Reliability Engineering Service Coordination Debugging