HimalayasHimalayas logo
Bryan SchaeferBS
Looking for a job

Bryan Schaefer

@bryanschaefer1

Lead Data Engineer building scalable data platforms for analytics and machine learning.

United States
Message

What I'm looking for

I seek a senior data engineering role building reliable, scalable data platforms that support analytics and ML, with strong CI/CD, observability, and collaborative teams.

I am a Lead Data Engineer specializing in architecting and operating scalable data platforms that power analytics, reporting, and machine learning workloads. I design and implement both batch and streaming pipelines, cloud-native architectures, and distributed processing frameworks using modern data stack technologies.

At Flatiron Health I directed development of 30 batch and streaming pipelines processing 5TB daily and integrated 12 healthcare data sources to deliver curated analytics datasets that support BI and ML teams. I established data reliability and observability frameworks across 50 production pipelines, reducing failure rates by 35% and delivered feature-ready datasets for 20 ML models.

Previously, I engineered ETL and streaming pipelines at Flare and Uber, optimizing multi-terabyte workflows, improving query performance and pipeline throughput, and strengthening data quality monitoring to reduce incidents. I mentor engineers, introduced CI/CD and modular pipeline standards, and translate complex business and clinical requirements into production-grade, maintainable data solutions.

I bring strong foundations in data modeling, orchestration, observability, and automated data quality controls, and I collaborate closely with engineering, analytics, and data science teams to ensure platform scalability, reliability, and long-term sustainability.

Experience

Work history, roles, and key accomplishments

FH
Current

Lead Data Engineer

Flatiron Health

Sep 2021 - Present (4 years 6 months)

Directed development of 30 batch and streaming pipelines processing 5TB daily to enable analytics and ML, established observability and reliability frameworks that reduced pipeline failures by 35%, and mentored a team of 6 engineers while improving deployment efficiency by 40%.

UB

Data Engineer

Uber

Feb 2018 - Oct 2018 (8 months)

Built high-throughput Spark pipelines processing billions of ride and event records, improved pipeline throughput by 25% via partitioning strategies, and maintained reliability across 20 production workflows supporting operational analytics.

MI

Data Analyst Intern

MindEase

Jan 2017 - Jan 2018 (1 year)

Analyzed operational datasets with SQL and Python to support 10 reporting dashboards, implemented validation checks that improved reporting accuracy by 20%, and produced BI dashboards tracking 15 KPIs.

Education

Degrees, certifications, and relevant coursework

Texas Tech University logoTU

Texas Tech University

Bachelor of Science, Computer Science

2013 - 2016

Grade: 3.8

Completed a Bachelor of Science in Computer Science with coursework in algorithms, data structures, distributed systems, and database systems; applied Python, Java, and SQL in academic projects.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan