Long Yang
@longyang
AI/LLM agent architect and senior software engineer building production ML systems with RAG, tool-calling, and measurable impact.
What I'm looking for
I’m an AI Architect and Senior Software Engineer with 14+ years building and scaling production ML systems for enterprise and consumer products, from Google and Uber to Foursquare. I specialize in LLM/GenAI and agentic platforms—especially multimodal systems, RAG/grounded retrieval, and tool/function-calling workflows—deployed on cloud-native infrastructure.
At Google, I built production agent services using grounded RAG (Vertex AI Search datastores) to deliver reliable, citation-able answers for operational decision support. I architected agent orchestration with tool/function calling, structured outputs, and workflow state management using LangChain plus internal frameworks, delivering production-grade NestJS APIs with retries, clear interfaces, and observability.
I lead high-ambiguity initiatives end-to-end and consistently drive measurable outcomes. For example, I architected an enterprise multimodal agent platform that automated manufacturing QC and operator workflows, improving accuracy by 87% and reducing manual inspection time by 65%, while establishing a GenAI quality loop (golden datasets, regression suites, retrieval precision/recall checks, hallucination triage) and governance guardrails for regulated workflows.
Previously, I engineered ranking, discovery, and personalization at Foursquare, improving engagement 23% and relevance/accuracy while shipping scalable pipelines (Kafka → Spark → MongoDB). At Uber, I delivered rider churn prediction and real-time payment fraud detection services, reducing churn 10% and fraudulent transactions 27% through robust ETL, feature pipelines, and monitoring; I also design production-ready data services with correctness, auditing, and operational reliability in mind.
Experience
Work history, roles, and key accomplishments
Built production RAG-grounded agent services using Vertex AI Search datastores to deliver reliable, citation-able answers. Architected grounded, tool-using multimodal agent workflows for manufacturing QC, improving accuracy by 87% and reducing manual inspection time by 65%, while adding GenAI quality loops and guardrails for regulated use.
Designed ranking and recommendation systems to improve venue discovery by leveraging POI attributes and behavioral signals. Built a Kafka→Spark pipeline for near real-time personalization and shipped ML/NLP models that increased engagement by 23%, improved search performance by 31%, and achieved 87% categorization accuracy.
Junior AI Engineer
WINLAB, Rutgers University
Jan 2013 - Dec 2017 (4 years 11 months)
Built predictive analytics models to identify at-risk cohorts and support earlier interventions, improving retention by 12%. Designed ETL pipelines and analytics dashboards, improving administrative decision-making and student satisfaction by 15%.
Developed risk models combining ML scores with rule execution patterns to power investigator workflows. Built ETL and production services for churn prediction and real-time payment fraud detection, reducing churn by 10%, fraud by 27% (preventing ~$1.2M/year), and cutting preprocessing time by 50%.
Education
Degrees, certifications, and relevant coursework
University of Electronic Science and Technology of China
Bachelor of Science
Earned a Bachelor of Science (completed in 2011) at the University of Electronic Science and Technology of China, with coursework including machine learning, deep learning, and natural language processing.
Rutgers University
Ph.D., Computer Engineering
2016 -
Pursued a Ph.D. in Computer Engineering at Rutgers University starting in 2016.
Tech stack
Software and tools used professionally
D3.js
Kubernetes
MySQL
PostgreSQL
MongoDB
Hadoop
Gmail
Rollout
Node.js
Django
NestJS
Databricks
Redis
TensorFlow
PyTorch
MLflow
scikit-learn
Kubeflow
DataRobot
H2O
Kafka
Milvus
Airflow
GuardRails
SQL
Hugging Face
Seldon
Dagster
Temporal
Qdrant
LangChain
LlamaIndex
Weaviate
Weights & Biases
Evidently AI
AutoGen
BentoML
Pinecone
Feast
Ray
DVC (Data Version Control)
KServe
Anthropic Claude API
JAX
Langfuse
ClearML
Haystack
Agentic
Faiss
Determined AI
Loops
Metaflow
Task
Matrix
Check
Sentence Transformers
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Long?
You can contact Long and 90k+ other talented remote workers on Himalayas.
Message LongFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
