We're looking for a Manager of Reliability Operations to lead how we detect, respond to, and learn from failures across our platform ecosystem. The role sits at the intersection of Operations and Engineering, bringing structure to incident response, accountability to follow-through, and clarity to reliability insights.
Requirements
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
- 7+ experience in systems operations, site reliability, or platform engineering
- 2+ years experience leading teams or major operational functions
- Proven experience managing incidents in a 24/7 production environment
- Strong background in troubleshooting, root cause analysis, and operational improvement
- Experience with change management practices
- Ability to translate complex technical data into clear insights
- Strong communication skills, especially in high-pressure situations
Benefits
- Traditional and Roth 401k with company matching
- A collaborative team culture
- Consistent/set work hours
- Challenging non-redundant daily duties
- A voice in how things get done
