Michael Zhu
@michaelzhu
Senior Full Stack AI Engineer building end-to-end LLM, NLP, and RAG systems that ship reliably to production.
What I'm looking for
I’m an innovative Full Stack AI Engineer with 8+ years of experience building and deploying AI-powered applications, LLM-based systems, and scalable full-stack platforms in fast-paced environments. I bring strong ownership mindset and focus on delivering end-to-end solutions—from ideation to production—with emphasis on performance, reliability, and user experience.
In my current role, I designed and deployed end-to-end AI-powered applications integrating LLMs, NLP, and agentic workflows using LangChain and LangGraph. I built scalable RAG-based systems with FAISS, Pinecone, and pgvector, improving retrieval accuracy by 40%, and reduced manual workload by 55% through intelligent automation for summarization, search, and decision-making.
I’ve also delivered real-time NLP and enterprise conversational AI using LangChain and backend APIs, improving model performance and application reliability by 25% through optimization, debugging, and evaluation pipelines. Across teams, I build reusable, modular AI components and production-ready backend APIs and frontend integrations, while optimizing throughput by 30% using asynchronous processing and microservices architecture.
Experience
Work history, roles, and key accomplishments
Senior Software Engineer
Alluxii
Jun 2023 - Present (3 years)
Designed and deployed end-to-end AI-powered applications using LangChain and LangGraph, building RAG systems with FAISS, Pinecone, and pgvector to improve retrieval accuracy by 40%. Reduced manual workload by 55% via intelligent automation and increased throughput by 30% using asynchronous processing and microservices architecture.
Senior AI Engineer
DivIHN
Mar 2019 - Jun 2023 (4 years 3 months)
Built and deployed NLP-driven applications and enterprise conversational AI using LangChain and backend APIs, enabling real-time AI interactions and automation workflows. Improved model performance and application reliability by 25% through optimization, debugging, and evaluation pipelines.
Software Engineer Intern
DivIHN
Sep 2018 - Feb 2019 (5 months)
Developed LLM, semantic search, and multimodal AI pipelines, building real-time backend systems with sub-200ms latency using Redis Streams, WebRTC, and distributed architecture. Implemented FAISS-based semantic search to improve recommendation quality and user experience.
Education
Degrees, certifications, and relevant coursework
University of Utah Graduate School
Master of Science, Applied Mathematics
Earned an M.S. in Applied Mathematics from the University of Utah Graduate School.
Availability
Location
Authorized to work in
Social media
Job categories
Skills
Interested in hiring Michael?
You can contact Michael and 90k+ other talented remote workers on Himalayas.
Message MichaelFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
