We're building a dataset to evaluate AI coding agents by creating challenging tasks and evaluation criteria within realistic simulated environments.
Requirements
- Degree in Computer Science, Software Engineering, or related fields
- 5+ years in software development, primarily Python
- Background in full-stack development, with experience building React-based interfaces and robust back-end systems
- Experience writing tests (functional, integration)
- Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis)
- CI/CD understanding (GitHub Actions as a user)
- English proficiency - B2
Benefits
- Opportunity to work on a project-based AI platform
- Chance to earn up to $17 per hour equivalent
- Flexible part-time work schedule
