We are hiring for a Software Engineer position in Compute Infrastructure to build the platform behind OpenAI's research and products. The role involves designing and operating infrastructure across accelerators, CPUs, NICs, switches, networking protocols, storage, data centers, cluster orchestration, scheduling, and fleet health. The ideal candidate will have strong software engineering skills, experience with distributed systems, and the ability to debug complex system behavior.
Requirements
- Strong software engineering skills and experience building, operating, or improving production infrastructure systems
- Experience in one or more relevant areas such as distributed systems, operating systems, networking protocols, RDMA, NCCL or collective communication, storage, Kubernetes, scheduling, observability, reliability engineering, high-performance computing, GPU infrastructure, CaaS, agent infrastructure, hardware-aware performance optimization, benchmarking, developer experience, or infrastructure tooling
- Ability to debug complex system behavior across software, hardware, networking, and workload layers, then turn findings into robust improvements
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Visa Sponsorship
- Four Day Work Week
- Generous Parental Leave
- Tuition Reimbursement
- Relocation Assistance
