John Wang
@johnwang1
Senior Data Engineer building trusted data platforms, pipelines, and analytics systems for real business impact.
What I'm looking for
I’m a Senior Data Engineer with 9+ years of experience building data platforms, pipelines, and analytics systems across ad tech, consulting, digital media, and healthcare. I focus on batch and near real-time data pipelines, data modeling, and warehouse/lakehouse systems that help teams trust their data.
At Chewy, I led development of the core data platform for Sponsored Ads, unifying impression, click, conversion, catalog, and campaign data into trusted datasets for advertiser analytics, ML feature generation, and closed-loop attribution. I designed scalable ELT workflows using medallion architecture to process 1TB+ of daily ad event and commerce data into curated bronze, silver, and gold layers.
I build feature-ready datasets by joining campaign configuration, catalog, pricing, and purchase signals, and I implement data quality checks, backfill patterns, and freshness monitoring to reduce reporting drift. I orchestrate batch and near real-time pipelines using Airflow, handling dependencies, scheduling, and failure recovery across large-scale Spark and SQL workflows.
Earlier, at Slalom and Admiral, I delivered cloud data platform modernization with Snowflake, Databricks, dbt, Azure Data Factory, and AWS/Azure, and built event-driven pipelines for high-volume publisher monetization. I’m comfortable partnering with product, engineering, data science, and business teams to turn complex data needs into practical, reliable solutions.
Experience
Work history, roles, and key accomplishments
Led development of Chewy Sponsored Ads core data platform, unifying impression, click, conversion, catalog, and campaign data into trusted datasets for advertiser analytics and closed-loop attribution across a platform with 20M+ active customers and billions of ad impressions annually. Designed scalable ELT medallion workflows processing 1TB+ of daily ad and commerce data and implemented data qual
Delivered cloud data platform modernization for enterprise clients by replacing fragmented legacy ETL with governed, cloud-native warehouse and lakehouse pipelines using Snowflake, Databricks, dbt, Azure Data Factory, and AWS/Azure. Built ingestion and transformation pipelines for millions of ERP and operational records per day and migrated reporting logic into reusable dimensional models, improvi
Data Engineer
Admiral
May 2018 - Oct 2019 (1 year 5 months)
Built event-driven data pipelines for Admiral’s publisher monetization platform using Python and SQL to transform pageview, adblock, consent, registration, and conversion signals into analytics-ready datasets for customer reporting and product decisioning. Implemented validation, deduplication, and anomaly checks in a noisy browser-event environment to reduce false reporting and distinguish teleme
Data Analyst / Data Engineer
Exactech
Sep 2017 - May 2018 (8 months)
Built ETL pipelines and analytical datasets for orthopedic and surgical systems by organizing procedure logs, implant metadata, software data, and de-identified case records into trusted reporting models. Applied HIPAA-compliant data engineering practices (governed access, de-identified reporting, reproducible transformations, and schema control) to improve traceability and maintain compliance in
Education
Degrees, certifications, and relevant coursework
University of Florida
Bachelor of Science, Computer Science
2013 - 2017
Earned a Bachelor of Science in Computer Science at the University of Florida from 2013 to 2017.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Social media
Job categories
Interested in hiring John?
You can contact John and 90k+ other talented remote workers on Himalayas.
Message JohnFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
