Deel is the all-in-one payroll and HR platform for global teams. Our vision is to unlock global opportunity for every person, team, and business. We're not just building software; we're creating the infrastructure for the future of work, enabling a more diverse and inclusive global economy.
Requirements
- Design, implement, and maintain scalable observability solutions for cloud-native environments
- Own monitoring across AWS and Kubernetes (EKS) environments, covering clusters and workloads
- Operate and maintain self-hosted monitoring stacks (e.g., Prometheus, Grafana, Mimir, Loki, Tempo)
- Manage and optimize DataDog (metrics, logs, APM, alerts, cost monitoring)
- Improve observability architecture to support high availability, scalability, and fault tolerance
- Implement monitoring cost optimization strategies (log/trace sampling, retention policies, storage optimization)
- Automate observability infrastructure using Infrastructure as Code (Terraform, Helm, etc.)
- Integrate monitoring and alerting into CI/CD pipelines (GitHub Actions is an advantage)
- Support capacity planning and performance tuning initiatives
- Collaborate with DevOps, SRE, and Engineering teams to embed observability best practices
- Drive continuous improvement of monitoring standards, tooling, and reliability practices
Benefits
- Stock grant opportunities dependent on your role, employment status, and location
- Additional perks and benefits based on your employment status and country
- The flexibility of remote work, including optional WeWork access
