Steven Chen
@stevenchen1
Senior machine learning engineer building scalable LLM and RAG systems for real-world impact.
What I'm looking for
I’m a Senior Machine Learning Engineer and Data Scientist with 10+ years building scalable AI systems and deploying production-grade machine learning and generative AI solutions. I specialize in LLM systems, multi-agent architectures, retrieval-augmented generation, and cloud-native ML platforms, with a track record of delivering reliable, high-impact AI products across healthcare and fintech.
I lead end-to-end LLMOps and MLOps implementations—prompt versioning, evaluation workflows for factuality and safety, hallucination detection, and token/cost monitoring—while building robust safety and compliance guardrails. At Truveta, I developed fine-tuned clinical LLMs (SFT, PEFT/LoRA, RLHF), deployed hybrid RAG pipelines over large-scale EHR data, and engineered MCP-style tool orchestration for automated clinical research workflows.
Experience
Work history, roles, and key accomplishments
Led clinical LLM development using SFT, PEFT/LoRA, and RLHF, improving factual accuracy and guideline alignment by 31%. Built LangGraph multi-agent workflows and hybrid RAG over large-scale EHR data, reducing research cycle time by 35% and improving answer relevance by 24%.
Built and fine-tuned transformer models (BERT/RoBERTa/DistilBERT) for NLP tasks, improving accuracy by up to 20% on benchmark datasets. Delivered low-latency inference with FastAPI and deployed embedding-based retrieval using Sentence Transformers and FAISS, improving retrieval precision by 25%.
Built demand forecasting models using DeepAR and LSTM time-series methods to improve inventory planning for volatile and long-tail retail categories. Implemented large-scale Spark/EMR pipelines for feature generation and backtesting, and added monitoring for data quality and forecast stability to reduce undetected model degradation.
Built data preprocessing pipelines with Hive/EMR/SQL to aggregate product view and clickstream signals for Amazon search and homepage experimentation. Conducted exploratory data analysis and prototyped predictive models in Python/NumPy to estimate user re-engagement and purchase intent.
Education
Degrees, certifications, and relevant coursework
Texas A&M University
Bachelor of Science, Computer Science
2010 - 2014
Earned a Bachelor of Science in Computer Science from Texas A&M University from 2010 to 2014.
Tech stack
Software and tools used professionally
Azure Synapse
GitHub
GitLab
Kubernetes
Azure Kubernetes Service
Jenkins
GitHub Actions
GitLab CI
NumPy
Pandas
Gmail
Databricks
Java
Amazon Machine Learning
Azure Machine Learning
TensorFlow
PyTorch
MLflow
scikit-learn
Keras
Kubeflow
Neptune
Kafka
FastAPI
Windows
Milvus
Airflow
GuardRails
SQL
Hugging Face
Apache Arrow
Temporal
LangChain
Weaviate
Weights & Biases
Polars
AutoGen
Pinecone
CrewAI
Ray
Delta Lake
Trino
JAX
Evidence
Bash
Faiss
LangGraph
Loops
PEFT
Bridge
Remote
Jan
Sentence Transformers
Falcon
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Steven?
You can contact Steven and 90k+ other talented remote workers on Himalayas.
Message StevenFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
