HimalayasHimalayas logo
NexGen CloudNC

HPC Cluster Architect

NexGen Cloud is a global leader in sustainable AI Cloud solutions, providing high-performance GPU infrastructure and on-demand computing to empower businesses and innovators.

NexGen Cloud

Employee count: 51-200

United Kingdom only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

HPC Cluster Architect

Location: UK / Remote

Reporting to: Head of Infrastructure

Department: Infrastructure

ABOUT NEXGEN CLOUD:

NexGen Cloud is the company behind Hyperstack, a full-stack AI cloud serving tens of thousands of customers from AI researchers to enterprises running the world's most compute-intensive workloads. We deliver on-demand and private GPU infrastructure to teams who treat performance as a requirement, not a feature.

We're a tight-knit, fast-moving team working at the cutting edge of AI cloud infrastructure. We practice what we preach, equipping our people with AI at every level so we can solve harder problems, ship faster, and keep raising the bar for what enterprise GPU infrastructure looks like.

THE ROLE: HPC Cluster Architect

This role exists because NexGen Cloud is winning large-scale dedicated GPU cluster contracts and needs someone who can own the full architecture cycle — from first customer conversation to production deployment. This is a capability that doesn’t exist yet in a dedicated role; we’re building it now because the pipeline demands it.. You’ll have direct ownership over cluster architecture across compute, networking, storage, and physical design — translating customer requirements into production-ready, commercially optimised GPU deployments.

Role positioning: This is a senior hands-on role for someone who has lived and breathed HPC cluster design — and who wants to be the technical authority, not one voice in a committee. You’ll own designs end-to-end and see them go live.

WHAT YOU’LL BE DOING

Rather than a long checklist, here’s what success in this role looks like:

  • Own end-to-end cluster architecture for large-scale NVIDIA GPU deployments — from customer requirement through rack layouts, BOM, power and cooling design, to production handover
  • Design high-performance network fabrics across compute (InfiniBand, RDMA, NVLink/NVSwitch), storage, and WAN — defining topology, oversubscription models, and scaling strategies
  • Engage directly with OEMs and vendors — validating hardware configurations, reviewing quotes, and ensuring designs are both technically sound and commercially optimised
  • Provide technical oversight during deployment and bring-up — supporting hardware validation, performance testing, and acting as escalation point for complex integration issues
  • Act as a senior technical leader across Solutions Architecture, Cloud Engineering, and data centre partners — contributing to standardised reference designs and building out the HPC engineering function

ABOUT YOU:

We’re more interested in how you think and work than in a perfect CV. You’ll likely bring a combination of the following:

Essential

  • Proven experience designing and delivering GPU-based HPC or AI clusters at scale — covering the full lifecycle from design through procurement, deployment, and validation
    • Deep hands-on knowledge of NVIDIA GPU platforms (H100/H200/B-series) and NVIDIA reference architectures
    • Strong InfiniBand/RDMA design experience — topology, performance tuning, and high-performance Ethernet fabrics
    • Solid grounding in Linux systems, PCIe topology, NUMA alignment, and server-level performance considerations
    • Background from an OEM, hyperscaler, neo-cloud, or enterprise/research HPC environment — with demonstrable exposure to the full design-to-deployment lifecycle
    • Confident engaging with customers, vendors, OEMs, and internal engineering teams as a technical authority — able to translate complex design trade-offs into clear decisions

Nice to Have

  • Experience with Spectrum-X or next-generation Ethernet fabrics
  • Prior involvement in large-scale cluster deployments (1,000+ GPUs) and performance benchmarking (NCCL, MLPerf)
  • Exposure to both air-cooled and liquid-cooled HPC environments, and/or automation/infrastructure-as-code

WHAT WE OFFER

  • Competitive salary and annual discretionary bonus scheme
  • Employee wellbeing benefits
  • 25 days of holiday, plus public holidays
  • Flexible working arrangements (remote or hybrid, depending on role and location)
  • Real ownership and autonomy, with the trust to take initiative and experiment
  • The opportunity to make a visible, meaningful impact as we scale
  • Clear career progression and growth opportunities in a fast-growing company
  • A collaborative, international culture built on trust, transparency, and ownership
  • The chance to help shape NexGen Cloud’s team, culture, and future alongside ambitious, mission-driven colleagues

MORE INFORMATION

Head over to our NexGen Cloud careers page to view current opening and follow us on LinkedIn and X to learn more about our journey, newest releases and hear exciting news in the neocloud space.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Location requirements

Hiring timezones

United Kingdom +/- 0 hours

About NexGen Cloud

Learn more about NexGen Cloud and their company culture.

View company profile

For too long, businesses and innovators in Europe have faced significant hurdles in accessing the high-performance computing resources necessary to compete on a global scale. Many have been reliant on US-based hyperscalers, which can create challenges around data sovereignty, compliance with regional regulations, and cost-effectiveness. Our customers, ranging from burgeoning AI startups to large-scale enterprises, require powerful, scalable, and secure AI infrastructure that doesn't compromise on their data's integrity or their budget. They need a partner who understands the European landscape and is committed to fostering technological innovation within the region.

At NexGen Cloud, we address these challenges head-on by providing a leading-edge, sustainable AI Cloud infrastructure. Founded in 2020, we are on a mission to democratize access to accelerated computing. Our customers benefit from our AI Supercloud, a bespoke environment designed for large-scale, compute-intensive AI projects, and Hyperstack, our on-demand GPU cloud platform that delivers enterprise-grade GPU access in minutes. We understand that our clients' success depends on having access to the latest technology, which is why we are proud to be an NVIDIA Elite Partner, offering access to the most advanced GPUs, including the H100 and the upcoming Blackwell platform. A core tenet of our service is our commitment to sustainability; all our data centers are powered by 100% renewable energy. This allows our customers to scale their AI workloads and innovate responsibly. By offering sovereign cloud solutions within Europe and North America, we empower our clients to execute sensitive AI applications and research while maintaining full control over their data, ensuring they can stay ahead in the next evolution of technology.

Employee benefits

Learn about the employee benefits and perks provided at NexGen Cloud.

View benefits

Flexible work hours

Offers flexible work hours and remote work options.

Paid Time Off

NexGen Cloud offers paid time off to its employees.

Career development

Opportunity to work with cutting-edge technologies in cloud computing and GPU infrastructure with opportunities for growth and career development.

View NexGen Cloud's employee benefits
Claim this profileNexGen Cloud logoNC

NexGen Cloud

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

3 remote jobs at NexGen Cloud

Explore the variety of open remote roles at NexGen Cloud, offering flexible work options across multiple disciplines and skill levels.

View all jobs at NexGen Cloud

Remote companies like NexGen Cloud

Find your next opportunity by exploring profiles of companies that are similar to NexGen Cloud. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan