Jack Wang
@jackwang
Production-focused ML engineer specializing in LLMs, RAG, MLOps, and low-latency systems.
What I'm looking for
I am a production-focused machine learning engineer with 7+ years shipping scalable ML systems across fintech and AI platforms, specializing in adapting foundation models through fine-tuning and RAG architectures.
I build MLOps infrastructure, real-time data pipelines, and low-latency model serving that power customer-facing LLM applications, and I have delivered sub-200ms p95 inference latency and production RAG systems for enterprise document repositories.
I combine deep knowledge of ML frameworks (PyTorch, TensorFlow, JAX), vector stores, and cloud infrastructure with systems engineering practices—mentoring engineers, establishing automated deployment and monitoring, and driving reproducible model evaluation in production.
Experience
Work history, roles, and key accomplishments
Architected production LLM inference infrastructure delivering sub-200ms p95 latency and built end-to-end RAG systems and real-time embedding services powering customer-facing LLM applications.
Machine Learning Engineer
Two Sigma
Jul 2019 - Dec 2022 (3 years 5 months)
Developed gradient-boosted and neural models and built real-time feature pipelines with millisecond latency for quantitative trading, plus distributed GPU training and automated hyperparameter optimization.
Software Engineer Intern
Uber
May 2018 - Aug 2018 (3 months)
Built TensorFlow demand-prediction models and feature pipelines on Spark, implemented A/B testing frameworks, and optimized inference latency via ONNX conversion and batching.
Education
Degrees, certifications, and relevant coursework
University of California, Berkeley
Bachelor of Computer Science, Computer Science
2015 - 2019
Completed a Bachelor of Computer Science focusing on core computer science principles and practical software and machine learning applications.
Tech stack
Software and tools used professionally
Apache Spark
Apache Flink
Kubernetes
NumPy
Pandas
dbt
PostgreSQL
Gmail
Databricks
Redis
Terraform
TensorFlow
PyTorch
MLflow
scikit-learn
Kubeflow
DeepSpeed
Neptune
Kafka
RabbitMQ
FastAPI
Grafana
Kibana
Prometheus
GraphQL
gRPC
Elasticsearch
Milvus
pytest
Airflow
Apache Beam
XGBoost
Hugging Face
LightGBM
CatBoost
Seldon
Temporal
Qdrant
LangChain
DuckDB
LlamaIndex
Weaviate
ChromaDB
Weights & Biases
Evidently AI
Polars
BentoML
Pinecone
Ray
Delta Lake
vLLM
Great Expectations
ArgoCD
JAX
Stable Diffusion
ONNX Runtime
Scale AI
Ruff
Faiss
Loops
PEFT
Dynamic
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Jack?
You can contact Jack and 90k+ other talented remote workers on Himalayas.
Message JackFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
