Austin Cai
@austincai
Senior AI/ML engineer specializing in scalable LLMs, vision, and production ML systems.
What I'm looking for
I am a Senior AI Engineer with proven experience building and deploying end-to-end AI solutions across LLMs, computer vision, and reinforcement learning, driving measurable business impact in production environments. I have led technical teams, launched 0-to-1 SaaS products, and reduced operational costs through real-time AI decisioning and voice AI automation.
My background spans research and applied ML at Stanford, Adobe, DoorDash, and startups, with strengths in model optimization, cloud-native deployments (AWS/GCP/Kubernetes), and human-in-the-loop systems. I focus on scalable, fault-tolerant architectures and continuous improvement through A/B testing, CI/CD, and interpretability to build trustworthy AI products.
Experience
Work history, roles, and key accomplishments
Senior AI Engineer
Lumos
Jan 2021 - Present (4 years 8 months)
Spearheaded transformer-based models and real-time AI decisioning for SaaS spend management and support automation, reducing response time by 50%, increasing engagement 25%, and cutting support costs within six months.
Adapted CLIP to multilingual support and deployed scalable image retrieval pipelines, improving image search precision 22% and reducing inference latency by 30% via ONNX conversions.
Machine Learning Researcher
Stanford Intelligent Systems Lab
Jan 2020 - Dec 2020 (11 months)
Researched autonomous vehicle decision-making using POMDPs and RL, improving pedestrian detection accuracy 20% and reducing decision latency via Julia-based POMDP stacks.
Machine Learning Fellow
Lumos
Jan 2020 - Dec 2020 (11 months)
Supported AI/ML initiatives by developing models, evaluating performance, and contributing insights that shaped the company's AI strategy.
Machine Learning Researcher
Stanford University
Jan 2020 - Dec 2020 (11 months)
Collaborated on image classification research, developing data augmentation techniques and experiments that improved model robustness and supported publications.
Built end-to-end pipelines for predicting Dasher sign-off behavior and demand quantiles, increasing driver retention 12% and improving delivery forecasting accuracy 20% during peak periods.
Software Engineer Intern
Lark Health
Jan 2019 - Dec 2019 (11 months)
Migrated server codebase to Spring Boot, added Prometheus metrics, and implemented caching to reduce latency in conversation generation and improve scalability.
Developed tools to parse historical labor data for NASA mission timeline and cost evaluation, automating processes and saving ~10 maintenance hours per week.
Software Engineer Intern
ProLabs.com
Jan 2017 - Dec 2018 (1 year 11 months)
Built applications for international staff to parse warehouse data, implemented admin access controls, and streamlined supply chain operations across global locations.
Education
Degrees, certifications, and relevant coursework
Stanford University
Bachelor of Arts, Symbolic Systems (Artificial Intelligence)
Completed a Bachelor of Arts in Symbolic Systems with a focus on artificial intelligence and related interdisciplinary coursework.
Stanford University
Master of Science, Computer Science (Computer Systems)
Completed a Master of Science in Computer Science with a specialization in computer systems.
University of Oxford
Bachelor of Arts, Modern Chinese History
Completed a Bachelor of Arts in Modern Chinese History focusing on historical and cultural studies of China.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Austin?
You can contact Austin and 90k+ other talented remote workers on Himalayas.
Message AustinFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
