About
Learn more about the company and their company culture.
Stay safe on Himalayas
Never send money to companies. Jobs on Himalayas will never require payment from applicants.
Please let Nebius know you found this job on Himalayas. This helps us grow!
Learn more about the company and their company culture.
Here are other jobs you might want to apply for.
Nebius is a cutting-edge AI cloud platform that offers scalable infrastructure for developing and deploying AI solutions.
Why work at NebiusNebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.
Where we workHeadquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with RD hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI RD team.
In this role, you will lead the definition, development, and delivery of Nebius Token Factory’s inference capabilities, focusing on highly scalable, production-grade machine learning systems. You will be responsible for shaping the direction of our inference platform, driving product decisions that balance performance, reliability, and real-world customer needs. This includes working closely with engineering and research teams to design and optimize real-time and batch inference workflows, supporting customer PoCs, and translating technical challenges into clear product requirements.
You will work directly with customers and internal stakeholders to understand ML workflows at scale, identify bottlenecks, and define features that improve latency, throughput, orchestration, and deployment efficiency. You will also guide product adoption by delivering intuitive tools and robust infrastructure that solve complex inference problems across diverse use cases. This role requires a strong technical foundation in ML systems and a product mindset oriented toward execution, clarity, and long-term scalability.
You are welcome to work remotely from the US.
We offer competitive salaries, ranging from $204k - $255k OTE + equity based on your experience
We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!
Full Time
Salary: 204k-255k USD
Learn more about Nebius and their company culture.
At Nebius, we offer an advanced AI cloud platform designed for those who wish to develop, tune, and deploy their AI models with the most efficient infrastructure available. Our platform utilizes cutting-edge NVIDIA GPU clusters, including the H100 and H200, optimized for maximum performance with InfiniBand. One of the standout features of Nebius is our comprehensive fine-tuning ecosystem that includes on-demand GPUs and tools necessary for robust dataset processing, ensuring that AI teams can efficiently manage their computational resources according to demand.
We recognize the importance of AI inference in deploying real-world applications. Hence, we provide a resilient and cost-effective infrastructure that has been optimized for rapid deployment of Generative AI applications. Our services span the entire lifecycle of AI solutions, from model training to inference, making Nebius not just a GPU cloud but a full-stack AI platform. Additionally, we pride ourselves on supporting our clients with 24/7 expert guidance, offering resources to help architects and engineers harness our AI-optimized data centers to build scalable solutions.
201-500 employees
Employee count: 51-200
Employee count: 5000+
Salary: 113k-150k USD
Employee count: 1001-5000
Salary: 134k-194k USD
Explore the variety of open remote roles at Nebius, offering flexible work options across multiple disciplines and skill levels.
Employee count: 201-500
Salary: 98k-140k USD
Find your next opportunity by exploring profiles of companies that are similar to Nebius. Compare culture, benefits, and job openings on Himalayas.
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Find your next opportunity by exploring profiles of companies that are similar to Nebius. Compare culture, benefits, and job openings on Himalayas.
Lambda Labs is an AI infrastructure company providing GPU cloud services, servers, and workstations designed to accelerate deep learning and machine learning processes.
CoreWeave is a specialized AI cloud provider delivering a massive scale of GPU compute resources on the industry's fastest and most flexible infrastructure, purpose-built for AI, machine learning, and VFX rendering workloads.
Aethir is a decentralized cloud infrastructure (DCI) provider focused on delivering enterprise-grade GPU-as-a-Service for AI and cloud gaming applications.
Northern Data Group is a leading provider of high-performance computing (HPC) solutions that empower organizations to harness the immense potential of technology for innovation and growth.
TensorWave is a pioneering AI-focused cloud platform that leverages AMD's MI300X accelerators, enabling organizations to optimize AI workloads with enhanced performance and lower costs.
DataCrunch is a cloud service provider specializing in high-performance GPU servers and clusters for machine learning, powered by renewable energy.
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Join the remote work revolution
Join over 100,000 job seekers who get tailored alerts and access to top recruiters.