This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer (SRE) in the United States.
In this role, you will be responsible for ensuring the reliability, performance, and scalability of critical infrastructure and cloud-based services. You will work across engineering and operations teams to deploy, monitor, and optimize systems, while leading initiatives to improve operational processes and automation. This position offers the opportunity to solve complex technical challenges, support high-availability environments, and contribute to building a robust multi-cloud platform. You will also provide technical guidance to internal teams and customers, ensuring smooth deployments, rapid incident resolution, and top-tier service quality. The role requires a hands-on approach, strong problem-solving skills, and a customer-focused mindset in a fast-paced, collaborative environment.
Accountabilities
- Deploy, maintain, and optimize software for cloud and on-premises environments.
- Respond to and resolve system incidents efficiently to minimize downtime and user impact.
- Collaborate with engineers to identify root causes and implement long-term solutions.
- Develop and maintain automated operational tools and scripts to improve efficiency.
- Monitor system health and performance proactively, implementing improvements where needed.
- Manage on-call rotations and document post-incident analyses for continuous learning.
- Provide tier 2/3 technical support, including troubleshooting, onboarding, and best-practice guidance for customers.
- Create and maintain runbooks, technical documentation, and knowledge base articles.
- Partner with cross-functional teams to ensure smooth customer experiences and rapid issue resolution.
Requirements
- Bachelor’s degree in Computer Science or a related field.
- 3+ years of experience in Site Reliability Engineering.
- 2+ years experience with cloud platforms and automation tools, particularly AWS.
- Strong expertise in Kubernetes, Linux, AWS networking (VPC), Terraform, and GitOps deployment models.
- Experience with monitoring and alerting tools such as Prometheus and Grafana; Bazel and Helm experience is a plus.
- Familiarity with software configuration best practices.
- Ability to work independently and manage multiple priorities in a fast-paced environment.
- Excellent communication skills, capable of explaining technical concepts to technical and non-technical audiences.
- Strong customer service orientation, with patience and empathy for resolving complex issues.
Preferred Qualifications:
- Experience leading small technical teams and providing technical leadership.
- Exposure to multi-cloud and distributed systems environments.
- Hands-on approach with high intellectual curiosity and low ego.
Benefits
- Competitive base salary, equity, and bonus potential.
- Comprehensive healthcare including medical, dental, and vision coverage.
- Flexible remote work opportunities and a collaborative work environment.
- Opportunities for career growth, mentorship, and technical leadership development.
- Employee-friendly policies supporting work-life balance and professional development.
Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching. When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.
🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.
📊 It compares your profile to the job’s core requirements and past success factors to determine your match score.
🎯 Based on this analysis, we automatically shortlist the 3 candidates with the highest match to the role.
🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.
The process is transparent, skills-based, and free of bias—focusing solely on your fit for the role. Once the shortlist is completed, it is shared directly with the company. The final decision and next steps (such as interviews or additional assessments) are made by their internal hiring team.
