Lambda is seeking a Senior HPC Systems Architect to design and architect advanced HPC systems optimized for large-scale computational workloads and AI applications. The ideal candidate will have extensive experience designing, developing, and testing large-scale high-performance computing (HPC) infrastructures.
Requirements
- 8+ years of experience designing and architecting large-scale HPC and distributed computing systems
- Expert-level knowledge of HPC hardware including GPU clusters, compute nodes, high-speed networking (InfiniBand, Ethernet), and distributed storage
- Hands-on experience with direct-to-chip liquid cooling systems
- Proven expertise in creating robust performance benchmarks, capacity planning, and system validation
- Exceptional skills in system architecture, design documentation, and technical specifications
- Ability to work collaboratively across teams, ensuring alignment of technical solutions with business objectives
- Self-motivated, strategic thinker with strong analytical and problem-solving capabilities
Benefits
- Health, dental, and vision coverage for you and your dependents
- Wellness and Commuter stipends for select roles
- 401k Plan with 2% company match (USA employees)
- Flexible Paid Time Off Plan that we all actually use