Gramian Consultancy is seeking an AI Evaluation Engineer to design benchmark tasks for complex data analysis workflows. The ideal candidate has 5+ years of experience in data analysis and strong proficiency in Python and SQL.
Requirements
- 5+ years of experience in data analysis or analytics-heavy roles
- Strong proficiency in Python (pandas, NumPy) and SQL
- Experience working with real-world, messy datasets (CSV, JSON, logs, reports)
- Ability to design analytical problems with clear, verifiable answers
- Solid understanding of statistics (distributions, correlations, outliers)
- Familiarity with AI benchmarks or evaluation environments (e.g., SWE-bench or similar)
- Hands-on experience with Docker (Dockerfiles, image builds, debugging)
Benefits
- Flexible work arrangements
