At GDIT, we're looking for an HPC Systems Admin to join our team and support the National Oceanic and Atmospheric Administration (NOAA), Weather and Climate Operational Supercomputer System (WCOSS). This is a remote position with working hours aligned to the Eastern time zone.
Requirements
- Bachelor’s degree or equivalent and 10+ years of experience with Linux-based HPC systems operations
- Experience working in a 24X7 operational environment
- Linux system administration experience (e.g., SLES, RedHat or CentOS)
- Batch management/scheduling systems (SLURM, PBSPro, LSF) experience, PBSpro preferred
- Parallel filesystem configuration and monitoring experience (e.g., Lustre, NFS), Lustre preferred
- High Speed Network interconnect configuration and monitoring experience (Infiniband, OPA, Ethernet, Slingshot)
- Programming or scripting in at least two languages (e.g., Bash, Perl, Python, C)
- Strong writing skills for technical documents, system procedures, user wiki’s and FAQs
- Ability to work both independently and as part of a team
- Knowledge/experience managing computer systems under Service Level Agreements (SLAs)
Benefits
- medical plan options
- dental plan options
- vision plan
- 401(k) plan
- full flex work weeks
- paid time off plans
- short and long-term disability benefits
- life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance