This is a remote position.
Requirements
Responsibilities:
- Develop and maintain automation tools for building, testing, and deploying software applications and services.
- Deploy all web, mobile, and API applications in production, plan their releases, ensure consistency, and follow up on testing.
- Work closely with developers, QA, and product teams to ensure timely and high-quality releases.
- Develop and maintain monitoring and alerting systems to ensure high availability and performance of applications and services.
- Monitor metrics and logs from all infrastructure and app components, writing integrations if necessary, and creating dashboards to observe the production systems.
- Create alert triggers and monitor performance for all components to identify bottlenecks and modify auto-scaling rules if necessary.
- Upgrade infrastructure resources and respond to cloud vendor recommendations of rotating secrets, upgrading databases, and machine clusters.
- Continuously evaluate the cost of cloud services and ensure we are not paying expenses unnecessarily.
- Troubleshoot and resolve issues related to infrastructure, deployment, and application performance.
- Work with third-party vendors to integrate with their services for observability, security, monitoring, and error reporting.
- Oversight and implementation, operation and monitoring of information security tools and processes in customer production environments
- Conduct IT risk assessments, documenting identified threats and maintaining risk register
- Communicate information security risks to executive leadership
- Report information security risks annually to company leadership and gain approvals to bring risks to acceptable levels
- Develop disaster recovery plans and participate in their execution during disaster recovery events.
Requirements- Bachelor's degree in Computer Science or related field.
- At least 5 years of experience in a DevSecOps/SRE or related role.
- Strong experience in deploying web, mobile, and API applications in production.
- Strong experience in monitoring and observability tools, such as NewRelic, Data-dog, or Prometheus/Granada.
- Strong experience with CI/CD pipelines and associated tools such as Azure Pipelines, Jenkins, or CircleCI.
- Strong experience with containerization technologies such as Docker, Kubernetes and Helm
- Experience with cloud infrastructure such as AWS, Azure, or GCP.
- Experience with scripting languages such as Bash.
- Experience with incident response and disaster recovery planning.
- Excellent communication and collaboration skills.
- Bachelor's degree in Computer Science or related field.
- At least 5 years of experience in a DevSecOps/SRE or related role.
- Strong experience in deploying web, mobile, and API applications in production.
- Strong experience in monitoring and observability tools, such as NewRelic, Data-dog, or Prometheus/Granada.
- Strong experience with CI/CD pipelines and associated tools such as Azure Pipelines, Jenkins, or CircleCI.
- Strong experience with containerization technologies such as Docker, Kubernetes and Helm
- Experience with cloud infrastructure such as AWS, Azure, or GCP.
- Experience with scripting languages such as Bash.
- Experience with incident response and disaster recovery planning.
- Excellent communication and collaboration skills.
