HimalayasHimalayas logo
poolsidePO

Member of Engineering (Reinforcement Learning)

Poolside AI is a frontier AI lab developing advanced artificial intelligence specifically for software engineering, aiming to automate and enhance the entire development process.

poolside

Employee count: 51-200

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Poolside is building a world where AI is the engine behind economically valuable work and scientific progress. As a Member of Engineering (Reinforcement Learning), you'll work on improving reasoning and coding abilities of Large Language Models through reinforcement learning.

Requirements

  • Research and experiment on ways to improve reasoning and code generation for LLMs.
  • Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation.
  • Design, analyze, and iterate on data generation and training of LLMs.
  • Implement and iterate on RL training pipelines that scale reliably across domains.
  • Diagnose training instabilities and failures, debug RL runs and propose mitigation methods.
  • Write high-quality, reproducible and maintainable code.

Benefits

  • Fully remote work & flexible hours
  • 37 days/year of vacation & holidays
  • Health insurance allowance for you and dependents
  • Company-provided equipment
  • Wellbeing, always-be-learning and home office allowances
  • Frequent team get togethers

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Location requirements

Hiring timezones

United States +/- 0 hours

About poolside

Learn more about poolside and their company culture.

View company profile

At Poolside AI, we are at the vanguard of a technological revolution, pioneering the development of the world's most capable artificial intelligence for software engineering. Our mission is to fundamentally reshape the landscape of software creation, moving beyond simple code generation to build sophisticated AI systems that possess a deep and nuanced understanding of the entire software development lifecycle. Through groundbreaking, first-principles research, we are creating foundation models from the ground up, engineered to not just write code, but to reason, plan, and solve complex engineering problems with a level of intelligence that will ultimately surpass human capabilities in this domain. Our proprietary techniques, such as Reinforcement Learning from Code Execution Feedback (RLCEF), enable our models to learn iteratively, navigating ambiguity and discovering optimal solutions through trial and error, much like the most seasoned developers.

This relentless pursuit of innovation is driven by our core belief that the fastest path to Artificial General Intelligence (AGI) runs directly through the complex and multifaceted world of software. By focusing our efforts on this strategic beachhead, we are not only accelerating developer productivity and enjoyment but also building the foundational intelligence that will unlock unprecedented advancements across all sectors of the global economy. We empower enterprises by providing a full-stack solution that can be deployed entirely within their own secure environments, ensuring data sovereignty and protecting their competitive edge. Our AI models are designed to be fine-tuned on a company's unique codebase, internal documentation, and development processes, creating a bespoke intelligence that compounds in value and transforms how businesses operate at every level. Poolside is not just building tools; we are forging the future of intelligence itself, paving the way for a world where anyone can bring their ideas to life through the power of AI-driven software creation.

Employee benefits

Learn about the employee benefits and perks provided at poolside.

View benefits

Home office allowance

Allowance to set up your home office.

Company-provided equipment

Company-provided equipment for your work.

Wellbeing allowance

Allowance for wellbeing-related expenses.

Health insurance allowance

Health insurance allowance for you and dependents.

View poolside's employee benefits
Claim this profilepoolside logoPO

poolside

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

6 remote jobs at poolside

Explore the variety of open remote roles at poolside, offering flexible work options across multiple disciplines and skill levels.

View all jobs at poolside

Remote companies like poolside

Find your next opportunity by exploring profiles of companies that are similar to poolside. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan