We are looking for a Senior Site Reliability Engineer - Observability to build the world's best AI cloud at Lambda. The position requires 8+ years of experience in software engineering and 5+ years of experience in Site Reliability Engineering practices. The successful candidate will deploy and operate observability platforms, automate deployment and operation, and lead members of other engineering teams in development of solutions for their monitoring challenges.
Requirements
- 8+ years of experience in software engineering
- 3+ years in Go
- 5+ years of experience in Site Reliability Engineering practices
- Proven understanding of Observability tools and practices
- Experience with application deployment and monitoring using Kubernetes
- Strong experience with modern devops practices
- Collaborating across team boundaries to help engineering teams meet their observability needs
Benefits
- Generous cash & equity compensation
- Health, dental, and vision coverage
- Wellness and commuter stipends
- 401k Plan with 2% company match
