Join Mindrift as a Freelance Agent Evaluation Engineer and work on challenging AI coding agent tasks. Create tasks, evaluation criteria, and virtual environments to test and improve AI systems.
Requirements
- Degree in Computer Science, Software Engineering, or related fields
- 5+ years in software development, primarily Python
- Background in full-stack development, with experience building React-based interfaces and robust back-end systems
- Experience writing tests (functional, integration — not just running them)
- Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis)
- CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
Benefits
- Up to $50 per hour equivalent
- 20 hours of estimated work per task
