Skip to main content
chen xieCX
Open to opportunities

chen xie

@chenxie

Senior LLM engineer building reliable multi-agent, RAG, and coding-agent systems.

United States
Message

What I'm looking for

I’m looking for a role where I can build and evaluate reliable LLM systems—RAG, multi-agent workflows, and coding-agent benchmarks—then iterate quickly with measurable improvements in quality, retrieval efficiency, and user outcomes.

Working Experience

Snorkel AI

Sep 2025 – Present

Part-time Contractor (Python + Docker + Agent Framework + LoRA SFT)

• Contributing to Terminus Bench, an open-source coding agent evaluation benchmark providing sandboxed environments for

safely executing and testing AI-generated code.

• Designed and implemented new coding agent eval benchmarks to expand task coverage and improve assessment reliability

across diverse agent frameworks.

• Fine-tuned DeepSeek V4 Flash using LoRA SFT on curated domain-specific instruction datasets; implemented the end-to-end

training pipeline with ms-swift framework, managed data preprocessing and tokenisation, and integrated the fine-tuned model

into the benchmark evaluation pipeline to assess performance gains on coding agent tasks.

Weibo

Oct 2024 – Jan 2026

Senior LLM Engineer (Python + Pytorch + Verl + RAG)

Microsoft

Dec 2022 – Oct 2024

Software Engineer (Azure + Python + Pytorch + RAG)

Kafang Tech

Sep 2019 – May 2021

Software Engineer (Go/C++ + Microservice + Kubernetes + gRPC + Vue.js)

Open Source ProjectsTerminus Bench | Coding Agent Evaluation Benchmark

Education

Carnegie Mellon University – Pittsburgh, PA

Jun 2021 – Dec 2022

Master of Science in Computer Engineering (Software Engineering track)

Dalian University of Technology – Dalian, China

Sep 2015 – Sep 2019

Bachelor of Engineering in Chemical Engineering (Computation track)

Experience

Work history, roles, and key accomplishments

Snorkel AI logoSA
Current

Coding Agent Eval Contractor

Sep 2025 - Present (9 months)

Contributed to Terminus Bench, an open-source coding agent evaluation benchmark with sandboxed Docker environments for safely executing and testing AI-generated code. Designed new agent-eval benchmarks and integrated a LoRA fine-tuned DeepSeek V4 Flash model into the benchmark pipeline to improve assessment performance on coding agent tasks.

Weibo logoWE

Senior LLM Engineer

Weibo

Oct 2024 - Jan 2026 (1 year 3 months)

Built a multi-agent, multi-turn report generation system using RAG with short- and long-term memory. Implemented SFT plus DPO-style RL optimization to improve dialogue quality and increased key engagement metrics (transition rate and CTR), and developed an iterative Search-R1-inspired RL framework within a modular multi-LLM search pipeline.

Microsoft logoMI

Software Engineer (Azure ML)

Dec 2022 - Oct 2024 (1 year 10 months)

Implemented fine-tuning pipelines on Azure ML Studio for multiple base models (including Llama3, GPT-4o, and Mistral) and managed training data via Azure Blob Storage. Built multilingual training pipelines, generated SBOM artifacts for data and model training/deployment steps, and evaluated fine-tuned models for Microsoft Teams meeting summaries.

KT

Software Engineer (ML Platform)

Kafang Tech

Sep 2019 - May 2021 (1 year 8 months)

Designed and implemented a cloud-native ML training platform deployed across two Kubernetes clusters, including user authentication and REST APIs for training-data management. Improved performance using Redis caching (reduced data retrieval time by 60%) and built task-queue architecture with Redis and gRPC, plus monitoring/alerting in Prometheus and Grafana that reduced false alerts from 46.6% to

Education

Degrees, certifications, and relevant coursework

Carnegie Mellon University logoCU

Carnegie Mellon University

Master of Science in Computer Engineering, Computer Engineering

2021 - 2022

Master of Science in Computer Engineering (Software Engineering track) at Carnegie Mellon University from 2021 to 2022.

Dalian University of Technology logoDT

Dalian University of Technology

Bachelor of Engineering in Chemical Engineering, Chemical Engineering

2015 - 2019

Bachelor of Engineering in Chemical Engineering (Computation track) at Dalian University of Technology from 2015 to 2019.

Dalian University of Technology logoDT

Dalian University of Technology

Bachelor of Engineering, Chemical Engineering

2015 - 2019

Earned a Bachelor of Engineering in Chemical Engineering (Computation track) at Dalian University of Technology.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan