Jack Wang
@jackwang1
Senior software engineer building scalable Python backend, data platforms, and LLM-powered workflows that ship reliably.
What I'm looking for
I’m a Senior Software Engineer with 10+ years of experience designing and building scalable backend systems, data platforms, and machine learning workflows using Python. I’m passionate about leveraging Python and modern technologies to solve complex problems and deliver reliable, high-impact systems.
At Scale AI, I built a Python-based data processing platform with FastAPI to replace brittle batch scripts, adding consistent validation, transformation, and versioning for large annotation datasets. I implemented LLM-powered workflows using OpenAI APIs, and scaled distributed task execution with Celery + Redis while designing a real-time ingestion pipeline with Apache Kafka.
I also strengthen production quality and performance through schema enforcement and validation with Pydantic, redesigned PostgreSQL queries and indexing strategies to reduce latency, and deployed microservices on AWS using Terraform. Previously, at Two Sigma, I built end-to-end ML pipelines with scikit-learn, XGBoost, PySpark, TensorFlow, and MLflow—turning research into scalable production systems.
Experience
Work history, roles, and key accomplishments
Senior Software Engineer
Scale AI
Apr 2022 - Present (4 years 2 months)
Built a Python-based data processing platform with FastAPI to replace brittle batch scripts, improving consistent validation, transformation, and versioning of annotation datasets for model training. Implemented LLM-powered workflows with OpenAI APIs and added Celery + Redis for asynchronous processing, plus Kafka-based real-time ingestion to keep downstream systems updated without batch delays.
Machine Learning Engineer
Two Sigma
Feb 2019 - Mar 2022 (3 years 1 month)
Built end-to-end machine learning pipelines using scikit-learn and XGBoost to transform raw financial data into predictive models for trading strategies. Developed time-series deep learning models with TensorFlow, created PySpark feature engineering pipelines for large datasets, and improved reliability with validation/backtesting plus MLflow experiment tracking.
Built Python (Flask) backend services supporting internal ride-data analysis across multiple regions. Improved API performance and reliability by introducing Memcached for frequently accessed metrics, designing REST APIs for trip-level aggregation, and adding PyTest unit tests to reduce regression issues.
Software Engineer Intern
Kloudless
Sep 2017 - Dec 2017 (3 months)
Built React components to visualize data from multiple cloud storage providers in a unified interface. Implemented data normalization for inconsistent third-party formats and improved data consistency using MongoDB/Mongoose and Mocha/Chai validation, alongside Selenium-based cross-platform testing.
Created internal operational dashboards using HTML/CSS/JavaScript and automated repetitive data-processing tasks with Python scripts. Built basic backend APIs with Django and added unit tests to ensure correctness and catch edge cases early.
Education
Degrees, certifications, and relevant coursework
University of California, Berkeley
Bachelor's degree in Computer Science, Computer Science
2015 - 2018
Earned a computer science bachelor’s degree at the University of California, Berkeley from 2015 to 2018.
Tech stack
Software and tools used professionally
Google Tag Manager
Postman
Apache Spark
GitHub
Kubernetes
Cloudflare
GitHub Actions
Jupyter
NumPy
Pandas
PySpark
MySQL
PostgreSQL
MongoDB
Memcached
Hadoop
Node.js
Django
Google Analytics
Redis
Terraform
Jira
Mocha
Chai
React
JavaScript
Python
HTML5
Java
CSS 3
TensorFlow
PyTorch
MLflow
scikit-learn
Kafka
FastAPI
asyncio
SQLAlchemy
Linux
TypeScript
pytest
Docker
Kloudless
s3-lambda
SQL
XGBoost
Pydantic
OpenAI API
Score
Bash
Scale AI
Task
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Jack?
You can contact Jack and 90k+ other talented remote workers on Himalayas.
Message JackFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
