We are looking for a Freelance AI Evaluation Engineer to create challenging coding test cases for leading tech companies. The opportunity involves reviewing and refining realistic coding tasks, writing comprehensive functional tests, and analyzing AI failures. The ideal candidate has a degree in Computer Science, Software Engineering, or related fields, and 5+ years of experience in software development, primarily in Python.
Requirements
- Degree in Computer Science, Software Engineering or related fields
- 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
- Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
- Experience writing tests (functional, integration – not just running them)
- Docker containers (running evaluations locally in containers)
- CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
- English proficiency - B2
Benefits
- Flexible work schedule
- Opportunity to work on various projects
- Competitive compensation (up to $30 per hour equivalent)
