HimalayasHimalayas logo
JS
Open to opportunities

Justin Stoecker

@justinstoecker

Staff Software Engineer specializing in hyperscale AI infrastructure and low-latency production systems.

United States
Message

What I'm looking for

I seek roles building production AI/ML infrastructure—low-latency, scalable systems with strong observability, mentorship, and technical leadership opportunities.

I am a Staff Software Engineer with 14+ years of experience at Google, OpenAI, and Meta, focused on architecting hyperscale AI infrastructure, scaling ChatGPT, and optimizing Llama 4 inference for production. I bridge research-to-production, mentor teams, and deliver reliable, low-latency systems under extreme concurrency, achieving substantial performance and cost improvements.

My work includes designing production-grade inference stacks (Python, PyTorch, vLLM), building autoscaling and observability systems (Kubernetes, Ray, Prometheus/Grafana), and driving quantization and parallelism innovations that produced 2–4× efficiency gains. I prioritize robust deployment pipelines, fault tolerance, and measurable impact across multimodal, long-context, and high-traffic environments.

Experience

Work history, roles, and key accomplishments

Meta logoME
Current

Staff Software Engineer

Meta

May 2023 - Present (2 years 10 months)

Architected production-grade inference serving stack for Llama 4 family, achieving 2–4× improved performance-to-cost and supporting billions of daily multimodal interactions while delivering sub-second generation with >99.9% uptime.

OpenAI logoOP

Senior Software Engineer

Dec 2019 - May 2023 (3 years 5 months)

Engineered production deployment and inference infrastructure for ChatGPT, scaling to hundreds of millions of daily queries and reducing latency up to 40% via optimized KV cache management, batching, and autoscaling on large GPU clusters.

Education

Degrees, certifications, and relevant coursework

University of Miami logoUM

University of Miami

Master of Science, Computer Science

2009 - 2011

Completed a Master of Science in Computer Science focusing on advanced topics in AI, machine learning, and systems from 2009 to 2011.

Availability

Open to opportunities

Location

United States

Authorized to work in

Salary expectations

180k-250k USD

Interested in hiring Justin?

You can contact Justin and 90k+ other talented remote workers on Himalayas.

Message Justin

People also viewed

View all talent

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan