DigitalOcean is seeking a Senior Engineer 2 to play a key technical role in our AI Inference Optimization team. You will be responsible for the architectural decisions that maximize throughput and minimize latency for the world’s most advanced large models.
Requirements
- 5+ years of experience in high-performance computing or AI infrastructure
- Proven track record of solving compute utilization and memory bandwidth bottlenecks
- Deep familiarity with the Gen AI (LLM, VLM, LMM) landscape
- Hands-on experience with attention-layer optimizations and parallelization strategies across distributed GPU environments
- Comprehensive understanding of NVIDIA and AMD GPU architectures and their respective software ecosystems
- Extensive experience integrating, building with, and contributing to open-source software projects
- Excellent system design skills, particularly related to low-level GPU programming - optimization, memory access patterns, and parallel execution
- Experience acting as a technical lead, driving design and delivery through cross-functional alignment and expert-level delegation
Benefits
- Competitive salary range ($167,200.00 to $209,000)
- Bonus opportunities
- Equity compensation
- Employee Stock Purchase Program
- Flexible time off policy
- Employee Assistance Program
- Local Employee Meetups
- Reimbursement for relevant conferences, training, and education
- Access to LinkedIn Learning's 10,000+ courses
