Bob H
@bobh
Senior data engineer specializing in large-scale ML data platforms, pipelines, and research enablement.
What I'm looking for
I am a senior data engineer with deep experience building and operating large-scale data platforms for ML and research. I have supported high-throughput training, evaluation, and production analytics for conversational models and helped scale data systems for mRNA vaccine research.
At OpenAI I worked on the data systems behind ChatGPT, owning pipelines that ingest multi-terabyte daily language datasets and building dataset versioning, validation, and lineage to ensure trustworthy training data. I partnered closely with researchers and ML teams to deliver feature-ready datasets, embedding and vector pipelines, and near-real-time streaming signals for safety and performance monitoring.
At Moderna I helped integrate experimental, genomics, and clinical data to enable reproducible research and predictive modeling in a regulated environment. I implemented data lineage, versioning, and auditability, and optimized availability and processing performance to accelerate analysis for vaccine R&D.
Earlier, I built enterprise healthcare data lakes and ETL pipelines with strong data quality and HIPAA-compliant controls while helping migrate systems toward cloud platforms. I focus on reliability, observability, cost efficiency, and enabling teams to trust and act on their data.
Experience
Work history, roles, and key accomplishments
Built and maintained large-scale data systems for ChatGPT training, evaluation, and production analytics, owning multi-terabyte ingestion pipelines and improving dataset trust through versioning and validation.
Developed data platforms and pipelines for COVID-19 mRNA vaccine R&D and clinical analytics, enabling reproducible research through lineage, versioning, and auditability in a regulated environment.
Data Engineer
Sentara Health
Aug 2017 - Mar 2020 (2 years 7 months)
Built enterprise data lake and ETL pipelines integrating EHR, claims, and operational data, improving data quality and implementing HIPAA-compliant access controls while migrating toward Azure.
Education
Degrees, certifications, and relevant coursework
Florida International University
Bachelor of Science, Computer Science
2013 - 2017
Completed a Bachelor of Science in Computer Science, supporting preparation for roles in data engineering and analytics.
Availability
Location
Authorized to work in
Social media
Job categories
Interested in hiring Bob?
You can contact Bob and 90k+ other talented remote workers on Himalayas.
Message BobFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
