The selected candidate will be responsible for engineering and managing server infrastructure in a large-scale, multi-datacenter environment, developing solutions to enable internal customers' success, and ensuring the environment runs optimally.
Requirements
- Actively design, implement and support HPC compute resources, and related infrastructure
- Assist application teams with optimizing workflows for the environment
- Develop and support automation and scripts (Python, Perl, bash, Ansible) and the servers related to those automations (Satellite, Ansible Automation Platform)
- Monitor and troubleshoot system failures (occasional on-call)
