Skip to main content
HimalayasHimalayas logo
JW
Open to opportunities

Jack Wang

@jackwang1

Senior software engineer building scalable Python backend, data platforms, and LLM-powered workflows that ship reliably.

United States
Message

What I'm looking for

I’m looking to lead backend and data/ML platform work—designing production APIs, real-time pipelines, and LLM automation—at a team that values reliability, scalable architecture, and hands-on engineering impact.

I’m a Senior Software Engineer with 10+ years of experience designing and building scalable backend systems, data platforms, and machine learning workflows using Python. I’m passionate about leveraging Python and modern technologies to solve complex problems and deliver reliable, high-impact systems.

At Scale AI, I built a Python-based data processing platform with FastAPI to replace brittle batch scripts, adding consistent validation, transformation, and versioning for large annotation datasets. I implemented LLM-powered workflows using OpenAI APIs, and scaled distributed task execution with Celery + Redis while designing a real-time ingestion pipeline with Apache Kafka.

I also strengthen production quality and performance through schema enforcement and validation with Pydantic, redesigned PostgreSQL queries and indexing strategies to reduce latency, and deployed microservices on AWS using Terraform. Previously, at Two Sigma, I built end-to-end ML pipelines with scikit-learn, XGBoost, PySpark, TensorFlow, and MLflow—turning research into scalable production systems.

Experience

Work history, roles, and key accomplishments

Scale AI logoSA
Current

Senior Software Engineer

Scale AI

Apr 2022 - Present (4 years 2 months)

Built a Python-based data processing platform with FastAPI to replace brittle batch scripts, improving consistent validation, transformation, and versioning of annotation datasets for model training. Implemented LLM-powered workflows with OpenAI APIs and added Celery + Redis for asynchronous processing, plus Kafka-based real-time ingestion to keep downstream systems updated without batch delays.

Two Sigma logoTS

Machine Learning Engineer

Two Sigma

Feb 2019 - Mar 2022 (3 years 1 month)

Built end-to-end machine learning pipelines using scikit-learn and XGBoost to transform raw financial data into predictive models for trading strategies. Developed time-series deep learning models with TensorFlow, created PySpark feature engineering pipelines for large datasets, and improved reliability with validation/backtesting plus MLflow experiment tracking.

Kloudless logoKL

Software Engineer Intern

Kloudless

Sep 2017 - Dec 2017 (3 months)

Built React components to visualize data from multiple cloud storage providers in a unified interface. Implemented data normalization for inconsistent third-party formats and improved data consistency using MongoDB/Mongoose and Mocha/Chai validation, alongside Selenium-based cross-platform testing.

Education

Degrees, certifications, and relevant coursework

University of California, Berkeley logoUB

University of California, Berkeley

Bachelor's degree in Computer Science, Computer Science

2015 - 2018

Earned a computer science bachelor’s degree at the University of California, Berkeley from 2015 to 2018.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan