Join Cohere's mission to scale intelligence to serve humanity by developing synthetic data pipelines for advanced language models. As a Member of Technical Staff, you will bridge research and engineering to drive innovation in natural language processing.
Requirements
- Strong software engineering skills, with proficiency in Python and experience building data pipelines
- Familiarity with data processing frameworks such as Apache Spark, Apache Beam, Pandas, or similar tools
- Experience working with LLMs through work projects, open-source contributions or personal experimentation
- Familiarity with LLM inference frameworks such as vLLM and TensorRT
- Experience working with large-scale datasets, including web data, code data, and multilingual corpora
- A passion for bridging research and engineering to solve complex data-related challenges in AI model training
Benefits
- An open and inclusive culture and work environment
- Work closely with a team on the cutting edge of AI research
- Weekly lunch stipend, in-office lunches & snacks
- Full health and dental benefits, including a separate budget to take care of your mental health
- 100% Parental Leave top-up for up to 6 months
- Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
- Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
- 6 weeks of vacation (30 working days!)
