Overview / Objective
We are seeking a Data Engineer to join our Sports Analytics & Engineering Practice. This role sits at the heart of our cloud-native data ecosystem and is responsible for bringing data into the platform reliably, securely, and at scale.
You will design and operate ingestion pipelines that power downstream analytics, fan engagement, marketing, and reporting use cases across the league ecosystem. Your work ensures that raw data—batch and streaming—arrives on time, complete, governed, and analytics-ready, forming the backbone of a world-class customer data platform.
Key Responsibilities
- Design, build, and operate robust ingestion pipelines for batch and near-real-time data using AWS-native services
- Implement CDC-based ingestion patterns for databases, SaaS platforms, and external partners
- Standardize ingestion frameworks for files, APIs, event streams, and cross-account data sharing
- Define and maintain raw and staging data models that preserve source fidelity and lineage
- Partner with source system owners to define ingestion SLAs, contracts, schemas, and change management strategies
- Ensure ingestion pipelines meet data quality, observability, and reliability standards
- Implement metadata capture, schema evolution handling, and data validation at ingestion time
- Automate infrastructure using AWS CDK and integrate CI/CD pipelines via CodeCommit and CodePipeline
- Optimize ingestion workflows for scalability, cost efficiency, and fault tolerance
- Support Agile delivery and collaborate closely with offshore engineering teams
