This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Datacenter Hardware Operations Technician, AI Compute Infrastructure - Stargate in the United States.
This role is a key on-site position supporting advanced AI compute infrastructure within a large-scale datacenter environment. You will collaborate closely with partner teams to ensure that high-density compute hardware operates at peak performance. Your work will include coordinating maintenance, repairs, and lifecycle activities while translating software-detected issues into actionable on-site solutions. The position emphasizes problem-solving, technical alignment, and documentation of best practices to guide future operations. You will play a critical role in maintaining system reliability, operational efficiency, and scalability, while shaping standards for subsequent AI infrastructure deployments.
Accountabilities
In this role, you will:
- Serve as the primary on-site hardware contact, collaborating with partner teams and vendors to plan and coordinate maintenance, repairs, and lifecycle activities.
- Ensure that all hardware work supports compute requirements, quality targets, and operational goals.
- Track hardware performance trends and recommend improvements to optimize reliability and efficiency.
- Translate software-detected issues into actionable on-site hardware interventions in partnership with engineering teams.
- Prepare documentation, runbooks, and operational playbooks for current and future datacenter projects.
- Coordinate spare parts, schedules, and issue escalation to minimize downtime.
- Provide technical guidance to partner personnel while respecting their operational responsibilities.
Requirements
Candidates should have:
- 7+ years of experience in datacenter hardware operations, hardware engineering, or large-scale server maintenance, with at least 2 years in a senior or lead technician capacity.
- Deep knowledge of high-density server hardware, including x86 servers, GPUs, storage devices, and power/cooling systems.
- Strong problem-solving skills, with the ability to diagnose complex hardware issues and coordinate repairs.
- Proven ability to collaborate effectively across partner teams, vendors, and internal stakeholders without direct management authority.
- Willingness to be on-site full-time at a partner-operated datacenter campus in Abilene, Texas.
- Excellent communication skills and attention to detail.
Preferred Skills:
- Familiarity with cluster management and monitoring tools (IPMI, BMC, Prometheus, Nagios).
- Experience with GPU-accelerated compute clusters or high-performance computing hardware.
- Knowledge of Linux/Unix system administration and command-line diagnostic tools.
- Relevant certifications such as CompTIA Server+, OEM hardware certifications, or equivalent.
- Experience applying Environmental Health and Safety best practices in critical operations environments.
Benefits
This role offers:
- Competitive base salary of $144K – $228K, with equity and performance-based bonuses.
- Comprehensive medical, dental, and vision coverage for you and your family.
- Health Savings Accounts, FSA, Dependent Care FSA, and commuter pre-tax accounts.
- 401(k) retirement plan with employer match.
- Paid parental leave, medical leave, caregiver leave, and flexible PTO.
- 13+ paid holidays and company-wide office closures for focus and recharge.
- Mental health and wellness support, plus employer-paid life and disability coverage.
- Annual learning and development stipend for professional growth.
- Relocation support and additional fringe benefits such as charitable donation matching.
Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.
When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.
🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.
📊 It compares your profile to the job’s core requirements and past success factors to determine your match score.
🎯 Based on this analysis, we automatically shortlist the 3 candidates with the highest match to the role.
🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.
The process is transparent, skills-based, and free of bias — focusing solely on your fit for the role. Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team.