DigitalOcean is seeking a highly skilled Staff Software Engineer to join their Customer Observability/Insights team. The role focuses on architecting, building, and maintaining large-scale distributed systems that power DigitalOcean's Customer facing Observability ecosystem. The ideal candidate will collaborate with cross-functional teams to deliver reliable and scalable solutions for cloud infrastructure monitoring and optimization. The company values a growth mindset, teamwork, and innovation.
Requirements
- 15+ years of relevant industry experience building and operating large-scale cloud services or distributed systems.
- Strong programming experience in Go (Golang) and deep understanding of distributed systems fundamentals.
- Solid understanding of observability, monitoring, and alerting systems (e.g., Prometheus, Grafana).
- Experience with OTEL (OpenTelemetry) Collector.
- Proven experience designing and implementing scalable event-driven architectures using Kafka or similar technologies.
- Experience with gRPC, Terraform, and Ansible for service communication and infrastructure automation.
- Working knowledge of SQL, Redis, and NoSQL databases.
- Demonstrated ability to drive operational excellence and improve system reliability.
- Excellent communication and collaboration skills, especially with geographically distributed teams.
Benefits
- Competitive salary
- Equity compensation
- Learning and Development opportunities
- Employee Assistance Program
- Local Employee Meetups
