Member of Engineering (Reinforcement Learning)

Poolside AI is a frontier AI lab developing advanced artificial intelligence specifically for software engineering, aiming to automate and enhance the entire development process.

poolside

Employee count: 51-200

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Poolside is building a world where AI is the engine behind economically valuable work and scientific progress. As a Member of Engineering (Reinforcement Learning), you'll work on improving reasoning and coding abilities of Large Language Models through reinforcement learning.

Requirements

Research and experiment on ways to improve reasoning and code generation for LLMs.
Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation.
Design, analyze, and iterate on data generation and training of LLMs.
Implement and iterate on RL training pipelines that scale reliably across domains.
Diagnose training instabilities and failures, debug RL runs and propose mitigation methods.
Write high-quality, reproducible and maintainable code.

Benefits

Fully remote work & flexible hours
37 days/year of vacation & holidays
Health insurance allowance for you and dependents
Company-provided equipment
Wellbeing, always-be-learning and home office allowances
Frequent team get togethers

Apply now

Please let poolside know you found this job on Himalayas. This helps us grow!

Apply now

About the job

Apply before

Jun 27, 2026

Posted on

Apr 28, 2026

Hiring timezones

United States +/- 0 hours

Job categories

Reinforcement Learning Engineer Machine Learning Engineer Reinforcement Learning

Skills

Reinforcement Learning Large Language Models Génération De Code LLM Training Pipelines Data Generation LLM Training Research Code Reproducibility Keep Poolside Remote

Browse similar jobs

Remote Entry-level Reinforcement-Learning-Engineer Jobs Remote Full Time Reinforcement-Learning-Engineer Jobs Remote Entry-level Reinforcement-Learning-Engineer Jobs in United States Remote Full Time Jobs in United States Remote Reinforcement-Learning-Engineer Jobs in United States

About poolside

Learn more about poolside and their company culture.

View company profile

At Poolside AI, we are at the vanguard of a technological revolution, pioneering the development of the world's most capable artificial intelligence for software engineering. Our mission is to fundamentally reshape the landscape of software creation, moving beyond simple code generation to build sophisticated AI systems that possess a deep and nuanced understanding of the entire software development lifecycle. Through groundbreaking, first-principles research, we are creating foundation models from the ground up, engineered to not just write code, but to reason, plan, and solve complex engineering problems with a level of intelligence that will ultimately surpass human capabilities in this domain. Our proprietary techniques, such as Reinforcement Learning from Code Execution Feedback (RLCEF), enable our models to learn iteratively, navigating ambiguity and discovering optimal solutions through trial and error, much like the most seasoned developers.

This relentless pursuit of innovation is driven by our core belief that the fastest path to Artificial General Intelligence (AGI) runs directly through the complex and multifaceted world of software. By focusing our efforts on this strategic beachhead, we are not only accelerating developer productivity and enjoyment but also building the foundational intelligence that will unlock unprecedented advancements across all sectors of the global economy. We empower enterprises by providing a full-stack solution that can be deployed entirely within their own secure environments, ensuring data sovereignty and protecting their competitive edge. Our AI models are designed to be fine-tuned on a company's unique codebase, internal documentation, and development processes, creating a bespoke intelligence that compounds in value and transforms how businesses operate at every level. Poolside is not just building tools; we are forging the future of intelligence itself, paving the way for a world where anyone can bring their ideas to life through the power of AI-driven software creation.

Tech stack

Learn about the tools and technologies that poolside uses to build, market, and sell its products.

View tech stack

Dremio

Redpanda

Clickhouse

poolside employees can create an account to update this tech stack.

Employee benefits

Learn about the employee benefits and perks provided at poolside.

View benefits

Home office allowance

Allowance to set up your home office.

Company-provided equipment

Company-provided equipment for your work.

Wellbeing allowance

Allowance for wellbeing-related expenses.

Health insurance allowance

Health insurance allowance for you and dependents.

View poolside's employee benefits

Apply now

Please let poolside know you found this job on Himalayas. This helps us grow!

Apply now

About the job

Apply before

Jun 27, 2026

Posted on

Apr 28, 2026

Job type

Full Time

Experience level

Entry-level

Location requirements

United States

Hiring timezones

United States +/- 0 hours

Job categories

Reinforcement Learning Engineer Machine Learning Engineer Reinforcement Learning

Skills

Reinforcement Learning Large Language Models Génération De Code LLM Training Pipelines Data Generation LLM Training Research Code Reproducibility Keep Poolside Remote

Browse similar jobs

Claim this profile

poolside

Company size

51-200 employees

Founded in

2023

Chief executive officer

Jason Warner, Eiso Kant

Markets

Artificial Intelligence Software Engineering Developer Tools Machine Learning AI Model Training Enterprise Software Software Development Lifecycle Automation Artificial General Intelligence (AGI)Code Generation AI Infrastructure

Employees live in

France

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

United States only

Senior Machine Learning Engineer, Data Mining

Motional

Salary: 172k-229k USD

Full Time

Infrastructure

United States only

Reinforcement Learning Engineer

Code Metal

Employee count: 11-50

Full Time

Machine Learning Engineer

United States only

Machine Learning Researcher / Engineer (Foundational Models)

Pathway

Employee count: 11-50

Full Time

Machine Learning Engineer

AU, CA + 5 more

Senior AI Engineer

Talkspace

Employee count: 201-500

Salary: 169k-200k USD

Full Time

AI Engineering

United States only

Staff AI Engineer

MLabs

Employee count: 51-200

Salary: 175k-250k USD

Full Time

AI Engineering

United States only

Research Engineer

Foundation EGI

Full Time

Research Engineer

6 remote jobs at poolside

Explore the variety of open remote roles at poolside, offering flexible work options across multiple disciplines and skill levels.

View all jobs at poolside

United States only

Member of Engineering (Technical Support Engineer)

poolside

Employee count: 51-200

Full Time

Technical Support Engineer

FR and US only

Head of Experience

poolside

Employee count: 51-200

Salary: 150k-250k USD

Full Time

Head Of Customer Experience

GB and US only

Member of Engineering (Evaluations)

poolside

Employee count: 51-200

Full Time

Evaluation Engineer

United States only

Member of Engineering (Evaluations / Engineering)

poolside

Employee count: 51-200

Full Time

Engineering Team Member

United States only

Member of Engineering (Console, Full-Stack)

poolside

Employee count: 51-200

Full Time

Staff Fullstack Software Engineer

Top remote companies

Remote companies like poolside

Find your next opportunity by exploring profiles of companies that are similar to poolside. Compare culture, benefits, and job openings on Himalayas.

View all companies

OP77 jobs

OpenAI

Salaries Benefits Tech stack

OpenAI is an AI research and deployment company. Our mission is to ensure that artificial general intelligence benefits all of humanity.

Artificial Intelligence Machine Learning

DI1 job

Divelement

Salaries Benefits Tech stack

Divelement is a nearshore software development and technology consulting company that provides custom-tailored solutions in areas like AI/ML, cloud development, and full-stack development to help businesses innovate and scale.

Software Development Digital Transformation

TA3 jobs

Together AI

Salaries

Together AI is a leading research-driven artificial intelligence company specializing in scalable AI solutions for organizations.

Artificial Intelligence Generative AI

TU16 jobs

Turing

Salaries Benefits Tech stack

Turing specializes in generative AI solutions and LLM training, helping organizations leverage advanced AI technologies to improve productivity and solve complex challenges.

Artificial Intelligence Generative AI

QU28 jobs

Quantiphi

Benefits Tech stack

Quantiphi is an award-winning AI-first digital engineering company that solves complex business problems using artificial intelligence, machine learning, and cloud technologies.

Artificial Intelligence Machine Learning

Stride

Benefits Tech stack

Stride is an AI-powered software engineering firm that helps companies build high-quality software, modernize legacy systems, and future-proof their tech organizations through a collaborative and agile approach.

Software Development Artificial Intelligence

Top remote companies

Remote companies like poolside

Find your next opportunity by exploring profiles of companies that are similar to poolside. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!