chen xie
@chenxie
Senior LLM engineer building reliable multi-agent, RAG, and coding-agent systems.
What I'm looking for
Working Experience
Snorkel AI
Sep 2025 – Present
Part-time Contractor (Python + Docker + Agent Framework + LoRA SFT)
• Contributing to Terminus Bench, an open-source coding agent evaluation benchmark providing sandboxed environments for
safely executing and testing AI-generated code.
• Designed and implemented new coding agent eval benchmarks to expand task coverage and improve assessment reliability
across diverse agent frameworks.
• Fine-tuned DeepSeek V4 Flash using LoRA SFT on curated domain-specific instruction datasets; implemented the end-to-end
training pipeline with ms-swift framework, managed data preprocessing and tokenisation, and integrated the fine-tuned model
into the benchmark evaluation pipeline to assess performance gains on coding agent tasks.
Oct 2024 – Jan 2026
Senior LLM Engineer (Python + Pytorch + Verl + RAG)
Microsoft
Dec 2022 – Oct 2024
Software Engineer (Azure + Python + Pytorch + RAG)
Kafang Tech
Sep 2019 – May 2021
Software Engineer (Go/C++ + Microservice + Kubernetes + gRPC + Vue.js)
Open Source ProjectsTerminus Bench | Coding Agent Evaluation Benchmark
Education
Carnegie Mellon University – Pittsburgh, PA
Jun 2021 – Dec 2022
Master of Science in Computer Engineering (Software Engineering track)
Dalian University of Technology – Dalian, China
Sep 2015 – Sep 2019
Bachelor of Engineering in Chemical Engineering (Computation track)
Experience
Work history, roles, and key accomplishments
Designed and implemented coding agent evaluation benchmarks to expand task coverage and improve assessment reliability. Fine-tuned DeepSeek V4 Flash with LoRA SFT using ms-swift and integrated the model into the benchmark pipeline to measure performance gains.
Contributed to Terminus Bench, an open-source coding agent evaluation benchmark with sandboxed Docker environments for safely executing and testing AI-generated code. Designed new agent-eval benchmarks and integrated a LoRA fine-tuned DeepSeek V4 Flash model into the benchmark pipeline to improve assessment performance on coding agent tasks.
Senior LLM Engineer
Oct 2024 - Jan 2026 (1 year 3 months)
Built a multi-agent, multi-turn report generation system using RAG with short- and long-term memory. Implemented SFT plus DPO-style RL optimization to improve dialogue quality and increased key engagement metrics (transition rate and CTR), and developed an iterative Search-R1-inspired RL framework within a modular multi-LLM search pipeline.
Implemented fine-tuning pipelines on Azure ML Studio for multiple base models (including Llama3, GPT-4o, and Mistral) and managed training data via Azure Blob Storage. Built multilingual training pipelines, generated SBOM artifacts for data and model training/deployment steps, and evaluated fine-tuned models for Microsoft Teams meeting summaries.
Software Engineer (ML Platform)
Kafang Tech
Sep 2019 - May 2021 (1 year 8 months)
Designed and implemented a cloud-native ML training platform deployed across two Kubernetes clusters, including user authentication and REST APIs for training-data management. Improved performance using Redis caching (reduced data retrieval time by 60%) and built task-queue architecture with Redis and gRPC, plus monitoring/alerting in Prometheus and Grafana that reduced false alerts from 46.6% to
Education
Degrees, certifications, and relevant coursework
Carnegie Mellon University
Master of Science in Computer Engineering, Computer Engineering
2021 - 2022
Master of Science in Computer Engineering (Software Engineering track) at Carnegie Mellon University from 2021 to 2022.
Dalian University of Technology
Bachelor of Engineering in Chemical Engineering, Chemical Engineering
2015 - 2019
Bachelor of Engineering in Chemical Engineering (Computation track) at Dalian University of Technology from 2015 to 2019.
Dalian University of Technology
Bachelor of Engineering, Chemical Engineering
2015 - 2019
Earned a Bachelor of Engineering in Chemical Engineering (Computation track) at Dalian University of Technology.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Social media
Job categories
Skills
Interested in hiring chen?
You can contact chen and 90k+ other talented remote workers on Himalayas.
Message chenFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
