Xin Zhang
@xinzhang1
Senior AI engineer specializing in LLM, RAG, and scalable production AI infrastructure.
What I'm looking for
I’m a Senior AI Engineer with 10+ years of experience building LLM, Generative AI, RAG, recommendation systems, and cloud-native ML platforms. I focus on distributed training, model optimization, AI infrastructure, and NLP, with scalable backends that deliver production AI supporting millions of users.
At Apple, I developed AI-powered cloud services for large-scale products, built LLM applications and RAG pipelines using enterprise knowledge bases, and optimized distributed AI services for low latency and high availability. I also improved LLM inference performance through GPU acceleration and model optimization, delivering AI infrastructure serving millions of daily requests and leading features from design to production.
Previously at Oracle, I built deep learning recommendation systems improving CTR by 22%, and developed NLP document intelligence and semantic search pipelines. I deployed distributed Spark training pipelines with feature stores and MLOps workflows, and reduced model deployment time from weeks to hours through CI/CD automation, while earlier roles covered fraud detection, predictive analytics, and ML APIs on containerized cloud infrastructure.
Experience
Work history, roles, and key accomplishments
Developed AI-powered cloud services for large-scale Apple products, building LLM applications and RAG pipelines using enterprise knowledge bases. Optimized distributed AI services for low latency and high availability and improved inference performance via GPU acceleration and model optimization.
Built deep learning recommendation systems and improved CTR by 22%. Developed NLP document intelligence and semantic search pipelines, and deployed distributed Spark training pipelines with feature stores and MLOps workflows.
Machine Learning Engineer
Oracle NetSuite
Jan 2018 - Jan 2021 (3 years)
Developed fraud detection and predictive analytics for enterprise SaaS, including building ML features for ERP finance and transaction analytics. Improved real-time anomaly detection and engineered features for large-scale datasets, and built ML APIs for containerized cloud deployments.
Developed Java/Python microservices on Kubernetes and built ML models and REST APIs for enterprise analytics. Optimized backend performance using caching and database systems, and productionized ML models using Agile and DevOps practices.
Education
Degrees, certifications, and relevant coursework
University of Southern California
Master of Science in Computer Science, Computer Science
2017 - 2018
Earned an M.S. in Computer Science at the University of Southern California (2017–2018).
Shanxi University of Finance and Economics
Bachelor of Science in Computer Science, Computer Science
2012 - 2016
Earned a B.S. in Computer Science at Shanxi University of Finance and Economics (2012–2016).
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Xin?
You can contact Xin and 90k+ other talented remote workers on Himalayas.
Message XinGet matched with your dream remote job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
