Bitdeer AI Lab is looking for a Research Engineer to help build the data foundation for frontier AI models. The role involves designing and implementing large-scale data processing pipelines, acquiring training data from open-source corpora, and validating data quality through proxy training runs and downstream evaluations.
Requirements
- Strong Python engineering skills
- Experience with distributed data processing frameworks
- Solid experience with large-scale data processing pipelines
- Experience acquiring training data from open-source corpora
- Experience designing training data mixtures
- Experience validating data quality
- Experience with synthetic data generation and data augmentation
Benefits
- Attractive welfare benefits
- Developmental opportunities such as training and mentoring
