We're looking for a Site Reliability Engineer to join our team at ScalePad, a market-leading software-as-a-service (SaaS) company. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, scalability, and efficiency of our infrastructure and development platforms. You will support developer experience, automate operational tasks, and optimize system performance to maintain high availability and seamless deployments.
Requirements
- Strong proficiency in system operations, observability, and infrastructure monitoring
- Full understanding of AWS offerings, including core compute, networking, storage, IAM
- Experience with Infrastructure as Code (IaC) tools such as Terraform
- Proficiency in scripting and automation using Python, Bash, or equivalent languages
- Base knowledge of Java, Go, and Python is a strong plus
- Knowledge of CI/CD pipelines and best practices for continuous integration and delivery
- Experience with containerization and orchestration technologies such as Kubernetes and Docker
- Strong understanding of SLOs, SLAs, and incident management best practices
Benefits
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Flexible Time Off
- 100% medical and dental coverage fully employer-paid
- RRSP matching after one year of employment
- Monthly stipend to help offset the costs of the hybrid experience
