Himalayas logo
JobgetherJO

Founding Site Reliability Engineer (Remote - US)

Jobgether
United States only

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Founding Site Reliability Engineer in the United States.

This is a unique opportunity to join a rapidly growing AI company as the first SRE hire in the San Francisco office. In this role, you will define and scale the Site Reliability Engineering discipline, ensuring the platform is reliable, secure, and performant at enterprise scale. You will work closely with engineering leads, product teams, and company founders to build infrastructure, establish best practices, and drive the organization’s reliability culture. The role involves hands-on system design, automation, and observability work, while providing leadership and strategic input to shape long-term operational excellence. Ideal candidates are technically strong, highly collaborative, and motivated by building world-class systems from the ground up.

Accountabilities

  • Establish and scale the SRE discipline, including best practices, tooling, and culture.
  • Ensure >99.9% uptime of production systems and maintain global platform reliability.
  • Architect, automate, and manage AWS infrastructure using Terraform, CI/CD pipelines, and Infrastructure as Code.
  • Design and implement observability systems across microservices, APIs, and vector workloads, including metrics, tracing, and logging.
  • Lead incident management, reducing MTTR through runbooks, alerts, and postmortems.
  • Collaborate with engineering teams to embed reliability principles into the software development lifecycle.
  • Influence organizational strategy and culture as a founding voice in the engineering team.

Requirements

  • 5+ years of experience in SRE, DevOps, or infrastructure roles, ideally in enterprise SaaS environments.
  • Expertise in AWS services (EC2, ECS/EKS, Lambda, RDS, VPC, IAM).
  • Proven experience with Infrastructure as Code (Terraform, Kubernetes/EKS, CDK, or CloudFormation).
  • Hands-on experience with observability and monitoring stacks (CloudWatch, Grafana, Prometheus, Datadog).
  • Experience in incident management, on-call responsibilities, and postmortem-driven reliability improvements.
  • Bonus: exposure to AI/ML platforms, data-heavy systems, or multi-agent workloads.
  • Strong problem-solving, communication, and collaboration skills.

Benefits

  • Competitive salary and equity options.
  • Health, dental, and vision insurance, including dependents coverage.
  • Paid time off and holidays, with parental leave benefits.
  • 401(k) plan and other financial perks.
  • Opportunity to shape company culture and systems at a high-growth AI startup.

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

When you apply, your profile goes through our AI-powered screening process, designed to identify top talent efficiently and fairly.

🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.
📊 It compares your profile to the job’s core requirements and past success factors to determine your match score.
🎯 Based on this analysis, we automatically shortlist the 3 candidates with the highest match to the role.
🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.

This process is transparent, skills-based, and free of bias, focusing solely on your fit for the role. Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team.

Thank you for your interest!

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Location requirements

Hiring timezones

United States +/- 0 hours
Claim this profileJobgether logoJO

Jobgether

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

741 remote jobs at Jobgether

Explore the variety of open remote roles at Jobgether, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Jobgether

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Jobgether hiring Founding Site Reliability Engineer (Remote - US) • Remote (Work from Home) | Himalayas