We are looking for a freelance AI Evaluation Engineer to create challenging coding test cases for leading tech companies, focused on testing, evaluating, and improving AI systems.
Requirements
- Degree in Computer Science, Software Engineering or related fields
- 5+ years in software development, primarily Python
- Background in Full-Stack development
- Experience writing tests (functional, integration)
- Docker containers (running evaluations locally in containers)
- CI/CD understanding (GitHub Actions as a user)
- English proficiency - B2
Benefits
- Flexible working hours
- Potential to earn up to $17 per hour equivalent
