This is a remote position.
- Coordinate and collaborate with multiple teams in investigating problems and provide root cause analysis during major and critical outages.
- Carry out deployments as part of Business Flexible Releases and Infrastructure Maintenance Releases to implement application/infra enhancements for Web/Cloud hosted applications.
- Drive and improve performance, scalability and availability of application resources.
- Implement organization's continuance delivery mechanism more effectively.
- Work with internal customers to understand requirements and implement configuration changes. Should be able to debug and understand logs.
- Prepare and maintain a clear documentation on organization’s infrastructure and architecture.
- Optimize scaling/performance of application resources.
- Automate the manual processes using suitable scripting language.
- Implement preventative and corrective measures.
Requirements
- Bachelor’s degree in Computer Science, Computer Engineering or a related field, plus 4-8 years of progressive professional experience.
- Minimum of three years hands-on experience with production AWS systems
- Experience in GitHub, Jenkins and CI/CD
- Hands on experience in EC2, Dockers and Container Services, AWS Lambda, Step Functions, AWS CLI, CloudFormation, Terraform, Route53, CloudFront.
- Experience in deployment and configuration management tools such as Chef or Puppet.
- Experience in supporting apache, tomcat, jetty or similar servers.
- Experience of working with APIs and Postman Tool.
- Ability to write and manage CloudFormation stacks.
- Effective use of Auto Scaling, Monitoring and logging.
- Knowledge of AWS security features and best practices.
- Experience in design of self-healing and fault-tolerant services.
- Knowledge of Techniques and strategies for maintaining high availability.
- Strong system administration experience (Linux/Unix or Windows).
- Experience in one or more scripting languages (Python, Shell, Perl, Ruby) and should understand web services.
- Experience in Multi-tier architectures: load balancers, caching, web servers, application servers, databases, and networking, Templates and other configurable items to enable automation.
- Understanding of monitoring techniques in a dynamic environment.
- Preferred: AWS Certified SysOps Administrator – Associate.
- Strong communication skills. Ability to communicate at all levels of the organization.
- Ability to comfortably work in a fast paced, dynamic and event driven environment.
- Availability for shift work.
- Approachable/Personable/Team Player.
