We are looking for a Senior SRE to join our team in Brazil, with experience in monitoring backend application Java, FinOps, and cloud computing. The candidate must have excellent communication skills and analytical skills to manage the backlog and propose effective solutions.
Requirements
- Experience as a Site Reliability Engineer (SRE) and its metrics
- Experience in monitoring backend application Java monitoring
- Solid experience in FinOps and cost management practices in cloud environments
- Experience working with observability tools such as Datadog, Grafana, Prometheus, Thanos
- Experience with AWS-based platforms (ECS, EKS) and/or Kubernetes and Docker
- Experience with Linux
- Technical knowledge of GitHub, Jenkins, and Splunk (desired)
- Experience in CI/CD pipeline (GitHub Actions, Code Build, Code Pipeline)
- Infrastructure as Code (Terraform)
- Analytical skills and problem-solving capacity, with a desire to learn and adapt in a dynamic environment
- Performance testing, stress testing
- Understanding of the Chaos Theory (what to test, what to validate, which failures to try to insert in the application, extract a BD, what happens to the application)
- Ability to resolve problems efficiently (troubleshooting) and propose continuous improvements (Splunk, dashs, tracers)
- Differential: Knowledge of application mobile monitoring (Android and IOS), Knowledge of Google Analytics, Firebase Crashlytics, -> Know some of, Knowledge of programming languages such as Java, Shell Script, Golang, Phyton
Benefits
- Health and dental plan
- Food and meal allowance
- Creche allowance
- Extended parental leave
- Partnership with gyms and professionals of health and well-being via Wellhub (Gympass) TotalPass
- Participation in profits and results (PLR)
- Life insurance
- Continuous learning platform (CI&T University)
- Discount club
- Free online platform dedicated to the promotion of physical, mental, and well-being health
- Pregnant and parental education course
- Partnership with online course platforms
- Language learning platform
