HimalayasHimalayas logo
FluidstackFL

VP, Engineering

What started as an ambitious initiative in the tech sector has blossomed into Fluidstack, a premier AI cloud platform dedicated to providing unparalleled compute power for leading AI laboratories across the globe.

Fluidstack

Employee count: 51-200

Salary: 280k-450k USD

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

About Fluidstack

At Fluidstack, we’re building the infrastructure for abundant intelligence. We partner with top AI labs, governments, and enterprises—including Mistral, Poolside, Black Forest Labs, Meta, and more—to unlock compute at the speed of light. We’re working with urgency to make AGI a reality. As such, our team is highly motivated and committed to delivering world‑class infrastructure. We treat our customers’ outcomes as our own, taking pride in the systems we build and the trust we earn. If you’re motivated by purpose, obsessed with excellence, and ready to work very hard to accelerate the future of intelligence, join us in building what’s next.

About The Role

As VP of Software Engineering, you will own the full software and SRE organizations responsible for our managed orchestration (Kubernetes and SLURM) offerings as well as our managed inference services. You will set the technical direction, build and scale the team, and personally drive architectural decisions that determine how the world’s leading AI organizations train and serve their models. You still ship production systems at scale and can go deep on a kernel scheduler, NCCL collective, or KV cache implementation when it matters. You think in terms of systems boundaries, failure modes, and second‑order effects. You know how to grow engineering organizations without losing velocity. You ensure we strike the right balance between fast delivery and reliable operation.

You Will

  • Own and scale the engineering organization across managed Kubernetes and SLURM, as well as our managed inference product, including Software Engineers and SREs across all three product areas.
  • Set the technical and architectural roadmap for cluster orchestration and AI inference serving, from bare‑metal provisioning through control‑plane design and developer‑facing APIs.
  • Drive reliability, performance, and scalability standards across the stack, owning SLAs for customers running production AI training and inference workloads on Fluidstack infrastructure.
  • Partner closely with Product, Sales, and Customer Success to translate customer needs from top AI labs and enterprises into concrete engineering investments and prioritization decisions.
  • Establish engineering culture, hiring bar, and operational practices that attract and retain exceptional talent in a competitive market.
  • Remain hands‑on at the level of design reviews, architecture decisions, and critical incident response, maintaining deep technical credibility with the team.
  • Build and maintain a high‑trust, high‑accountability team environment where engineers own outcomes end‑to‑end, from design through production operations.

Basic Qualifications

  • 10+ years of software engineering or systems engineering experience, with at least 4 years managing engineering teams including both Software Engineers and SREs.
  • Deep hands‑on experience with Kubernetes and SLURM in production environments, including scheduling internals, resource management, and multi‑tenant cluster operations.
  • Strong background in bare‑metal infrastructure and GPU/accelerator systems, including server imaging, networking (InfiniBand/RoCE), firmware, and hardware lifecycle management.
  • Demonstrated ability to build and scale AI inference serving infrastructure, including familiarity with inference optimization techniques (quantization, continuous batching, speculative decoding, KV cache management).
  • Track record of building and growing high‑performing engineering organizations of 40+ engineers across complex, cross‑functional domains.
  • Strong communicator who can represent technical strategy to executive leadership, customers, and board‑level stakeholders.

Preferred Qualifications

  • Prior experience in an AI infrastructure neocloud, hyperscaler (AWS, GCP, Azure), or AI lab (OpenAI, Anthropic, DeepMind) in a senior technical or engineering leadership role.
  • Hands‑on experience with large‑scale GPU cluster operations: multi‑node training job scheduling, collective communication tuning, topology‑aware placement, and fault recovery.
  • Familiarity with frontier model inference serving frameworks (vLLM, TensorRT‑LLM, SGLang) and the systems‑level tradeoffs involved in latency, throughput, and cost optimization.
  • Experience with GPU NPI processes, cluster bring‑up, and hardware qualification at scale.
  • Exposure to agentic inference workloads and the distinct systems requirements they impose relative to batch or streaming inference.
  • Contributions to open‑source infrastructure projects in the Kubernetes, SLURM, or MLOps ecosystems.

Salary And Benefits

The base salary range for this role is $280,000 to $450,000. Starting salary will be determined based on relevant experience, skills, and market location. In addition to base salary, this role includes a meaningful equity package, performance bonus, and the following benefits:

  • Competitive total compensation package (cash + equity)
  • Health, dental, and vision insurance
  • Retirement plan
  • Generous PTO policy

We are committed to pay equity and transparency.

Equal Employment Opportunity

Fluidstack is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Fluidstack will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

#J-18808-Ljbffr

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Salary

Salary: 280k-450k USD

Location requirements

Hiring timezones

United States +/- 0 hours

About Fluidstack

Learn more about Fluidstack and their company culture.

View company profile

What started as an ambitious initiative in the tech sector has blossomed into Fluidstack, a premier AI cloud platform dedicated to providing unparalleled compute power for leading AI laboratories across the globe. Founded with a vision to democratize access to top-tier GPU resources, Fluidstack has quickly positioned itself as a trusted partner for companies requiring substantial computational resources for AI training and inference.

Fluidstack's offerings are centered around instant access to thousands of NVIDIA GPUs, including cutting-edge models such as the H100 and A100. Organizations can deploy large-scale GPU clusters that can exceed 4,096 GPUs, made possible through their fully managed infrastructure utilizing Slurm and Kubernetes. This deployment capability is complemented by impressive storage solutions, featuring over 1PB of shared storage and high-speed InfiniBand for optimal data handling. With a commitment to customer satisfaction, Fluidstack promises a remarkable 99% uptime and industry-leading 15-minute response times, making it an ideal choice for companies needing robust support while focusing on their groundbreaking AI projects. Trusted by major players in the AI sphere, Fluidstack continues to expand its services, launching new GPU instances and enhancing its infrastructure to meet the demanding needs of AI businesses worldwide.

Claim this profileFluidstack logoFL

Fluidstack

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

8 remote jobs at Fluidstack

Explore the variety of open remote roles at Fluidstack, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Fluidstack

Remote companies like Fluidstack

Find your next opportunity by exploring profiles of companies that are similar to Fluidstack. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan