Himalayas logo
NebiusNE

Senior Technical Product Manager Token Factory - Inference

Nebius is a cutting-edge AI cloud platform that offers scalable infrastructure for developing and deploying AI solutions.

Nebius

Employee count: 201-500

Salary: 204k-255k USD

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Why work at NebiusNebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.

Where we workHeadquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with RD hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI RD team.

The role

In this role, you will lead the definition, development, and delivery of Nebius Token Factory’s inference capabilities, focusing on highly scalable, production-grade machine learning systems. You will be responsible for shaping the direction of our inference platform, driving product decisions that balance performance, reliability, and real-world customer needs. This includes working closely with engineering and research teams to design and optimize real-time and batch inference workflows, supporting customer PoCs, and translating technical challenges into clear product requirements.

You will work directly with customers and internal stakeholders to understand ML workflows at scale, identify bottlenecks, and define features that improve latency, throughput, orchestration, and deployment efficiency. You will also guide product adoption by delivering intuitive tools and robust infrastructure that solve complex inference problems across diverse use cases. This role requires a strong technical foundation in ML systems and a product mindset oriented toward execution, clarity, and long-term scalability.

You are welcome to work remotely from the US.

Your responsibilities will include:

  • Own the product roadmap for Nebius Token Factory inference capabilities, focusing on high-load, production-grade ML scenarios.
  • Be involved in customer PoCs involving distributed ML model deployment, inference orchestration, and optimization.
  • Work closely with engineering and research teams to shape scalable infrastructure for real-time and batch inference.
  • Act as the technical voice in customer conversations, translating ML workflows into product requirements.
  • Drive product adoption by delivering tools and features that solve real-world inference problems at scale.

We expect you to have:

  • 3–5 years of product management experience, ideally in cloud infrastructure, ML platforms, or developer tools.
  • Strong technical foundation (e.g. Computer Science or Engineering degree) with ability to dive deep into model architectures and serving systems.
  • Familiarity with modern ML inference tools and frameworks (e.g., Triton Inference Server, vLLM, SGLang, TensorRT-LLM, Dynamo, KServe, Ray Serve).
  • Proven track record of delivering technically complex products that support distributed and high-throughput ML pipelines.
  • Strong communicator with experience working across engineering, research, and customer-facing teams.

It will be an added bonus if you have:

  • Deep understanding of modern ML architectures, including transformer-based models and their inference characteristics.
  • Experience delivering or supporting ML solutions in production as part of a customer-facing or solutions role.
  • Knowledge of MLOps or AIOps cycles, including observability, performance optimization, and continuous delivery of ML systems.

Key employee benefits in the US:

  • Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
  • 401(k) plan: Up to 4% company match with immediate vesting.
  • Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
  • Remote work reimbursement: Up to $85/month for mobile and internet.
  • Disability life insurance: Company-paid short-term, long-term and life insurance coverage.

Compensation

We offer competitive salaries, ranging from $204k - $255k OTE + equity based on your experience

What we offer

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.

We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Salary

Salary: 204k-255k USD

Location requirements

Hiring timezones

United States +/- 0 hours

About Nebius

Learn more about Nebius and their company culture.

View company profile

At Nebius, we offer an advanced AI cloud platform designed for those who wish to develop, tune, and deploy their AI models with the most efficient infrastructure available. Our platform utilizes cutting-edge NVIDIA GPU clusters, including the H100 and H200, optimized for maximum performance with InfiniBand. One of the standout features of Nebius is our comprehensive fine-tuning ecosystem that includes on-demand GPUs and tools necessary for robust dataset processing, ensuring that AI teams can efficiently manage their computational resources according to demand.

We recognize the importance of AI inference in deploying real-world applications. Hence, we provide a resilient and cost-effective infrastructure that has been optimized for rapid deployment of Generative AI applications. Our services span the entire lifecycle of AI solutions, from model training to inference, making Nebius not just a GPU cloud but a full-stack AI platform. Additionally, we pride ourselves on supporting our clients with 24/7 expert guidance, offering resources to help architects and engineers harness our AI-optimized data centers to build scalable solutions.

Claim this profileNebius logoNE

Nebius

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

23 remote jobs at Nebius

Explore the variety of open remote roles at Nebius, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Nebius

Remote companies like Nebius

Find your next opportunity by exploring profiles of companies that are similar to Nebius. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Nebius hiring Senior Technical Product Manager Token Factory - Inference • Remote (Work from Home) | Himalayas