Skip to main content
ML
Open to opportunities

Matthew Liu

@matthewliu

Senior AI Engineer specializing in agentic AI, multi-agent orchestration, and production-scale LLM systems.

United States
Message

What I'm looking for

I’m looking for a team building production-grade agentic AI and LLM platforms. I want to own multi-agent orchestration, RAG quality, and real-time inference reliability—shipping fast with strong evaluation, observability, and performance engineering.

I’m a Senior AI Engineer focused on agentic AI systems, multi-agent orchestration, and large-scale LLM platforms. I build and scale production AI that turns unstructured data into automated workflows and actionable insights—prioritizing reliability, performance, and production readiness.

At Notion, I spearheaded NotionAI meeting notes, creating a production-grade agentic AI pipeline (capture → transcript → reasoning → action generation) with GPT-4/4o, Whisper, and streaming. I also designed and scaled multi-agent orchestration with LangGraph and LlamaIndex, improving task completion rates by 40%, and re-architected real-time inference to cut median latency from 3.8s to 1s under production load.

I’ve strengthened quality and observability end-to-end—building RAG pipelines (pgvector, hybrid BM25 + embeddings) that improved factual accuracy by 35%, implementing an LLM evaluation framework that reduced hallucinations by 28%, and adding agent observability with OpenTelemetry to reduce debugging time by 50%. Previously at CourseHero and earlier core platform roles, I engineered scalable AI microservices, TB-scale ETL/data pipelines, and search/recommendation systems that supported multi-million user platforms.

Experience

Work history, roles, and key accomplishments

Notion logoNO

Software Engineer - NotionAI

Mar 2024 - May 2026 (2 years 2 months)

Spearheaded the NotionAI meeting-notes agent, improving AI feature adoption 3x and increasing task completion rates 40% with LangGraph/LlamaIndex multi-agent orchestration. Reduced LLM inference median latency from 3.8s to 1s using model routing, semantic caching (Redis), and streaming, while boosting factual accuracy 35% and reducing hallucinations 28% through RAG reasoning loops and offline eval

CourseHero logoCO

Staff Software Engineer

Mar 2021 - Mar 2024 (3 years)

Redefined AI tutoring and content generation systems, improving content creation efficiency 60% at multi-million user scale using GPT-3.5/4 and NLP pipelines. Built early RAG with Pinecone/Elasticsearch/pgvector, shipped scalable FastAPI/Express microservices, and established MLOps with MLflow/Weights & Biases and offline evaluation to accelerate iteration and improve model quality.

Education

Degrees, certifications, and relevant coursework

Stanford University logoSU

Stanford University

Master's Degree in Computer Science, Computer Science

2016 - 2017

Earned a master's degree in Computer Science at Stanford University.

Stanford University logoSU

Stanford University

Bachelor's Degree in Computer Science, Computer Science

2012 - 2016

Earned a bachelor's degree in Computer Science at Stanford University.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan