Poolside is building a world where AI is the engine behind economically valuable work and scientific progress. As a Member of Engineering (Reinforcement Learning), you'll work on improving reasoning and coding abilities of Large Language Models through reinforcement learning.
Requirements
- Research and experiment on ways to improve reasoning and code generation for LLMs.
- Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation.
- Design, analyze, and iterate on data generation and training of LLMs.
- Implement and iterate on RL training pipelines that scale reliably across domains.
- Diagnose training instabilities and failures, debug RL runs and propose mitigation methods.
- Write high-quality, reproducible and maintainable code.
Benefits
- Fully remote work & flexible hours
- 37 days/year of vacation & holidays
- Health insurance allowance for you and dependents
- Company-provided equipment
- Wellbeing, always-be-learning and home office allowances
- Frequent team get togethers
