Second Front SystemsSS

Site Reliability Engineer - Observability

Second Front Systems (2F) is a public-benefit software company that fast-tracks government access to disruptive, commercially-proven software-as-a-service (SaaS) applications for national security missions.

Second Front Systems

Employee count: 51-200

United States only
ABOUT THE ROLE
Second Front Systems' (2F) Product team is seeking a highly skilled and motivated Senior Site Reliability Engineer to join our Observability team. We are a small team working to accelerate the deployment of emerging technology into national security use-cases. We are seeking technical professionals who want to operate on the front lines of an exciting and disruptive mission.
As a Senior SRE for Second Front Systems, you'll be responsible for deploying, maintaining, and scaling our observability infrastructure across multiple DoD networks. You'll work with Kubernetes-based platforms, BigBang charts from DoD Platform One, and build automation to make our monitoring stack easier to deploy for new customers. You'll be empowered to collaborate with others to implement infrastructure that delivers unique capabilities for our commercial and government customers, including the Department of Defense.
The Observability team is looking for a strong SRE with deep DevSecOps and Kubernetes experience. Someone who has deployed and maintained monitoring infrastructure at scale, with an eye for security in highly-regulated environments. Experience with DoD software deployments, Platform One, and single-tenant architectures is highly valued.
We are a fast-growing entrepreneurial team working at the convergence of technology and national security. If this type of effort interests you, come join us!
Note: This position requires U.S. citizenship due to government contract requirements.

What You’ll Do

  • Deploy and maintain observability stack (Grafana, Mimir, Prometheus) across multiple customer clusters and DoD networks
  • Build Helm chart abstractions and automation to streamline monitoring deployments for new customers
  • Troubleshoot and debug complex Kubernetes issues, networking problems, and monitoring stack failures
  • Configure and maintain BigBang charts and DoD Platform One integrations
  • Design and implement infrastructure automation using tools like Pulumi, ArgoCD, and Flux
  • Work with Istio service mesh and Keycloak for authentication in secure environments
  • Monitor and optimize performance of monitoring infrastructure across multiple environments
  • Collaborate with security teams to ensure compliance with NIST requirements and DoD standards
  • Participate in on-call rotation and incident response for production environments

Skills You’ll Bring to Our Team

  • 5+ years of Site Reliability Engineering or DevOps experience
  • Deep experience with Kubernetes administration, troubleshooting, and scaling
  • Hands-on experience deploying and maintaining observability tools (Prometheus, Grafana, Mimir/Cortex)
  • Strong understanding of Helm charts, GitOps practices, and CNCF tooling
  • Experience with service mesh technologies (Istio preferred)
  • Proven ability to debug complex distributed systems and networking issues
  • Understanding of authentication systems and security in regulated environments
  • Ability to work independently and collaborate with team members in a remote environment

Preferred Qualifications

  • Active security clearance or ability to obtain a Secret-level security clearance
  • Previous experience with DoD software deployments and Platform One
  • Experience with BigBang charts and Iron Bank containers
  • Experience working in national security or highly regulated environments
  • Familiarity with compliance frameworks (NIST, FedRAMP, etc.)
  • Experience with infrastructure as code (Pulumi, Terraform)

Technologies we Use

  • Observability: Grafana stack, Prometheus, custom alerting tools
  • Kubernetes: Helm, ArgoCD, Flux, Tekton, BigBang charts
  • Security: Istio, Keycloak, Kyverno
  • Infrastructure: AWS/GCP/Azure, Pulumi, Git/GitLab
  • Languages: YAML, Bash, Go

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Location requirements

Hiring timezones

United States +/- 0 hours

About Second Front Systems

Learn more about Second Front Systems and their company culture.

View company profile

Second Front Systems (2F) is a public-benefit software company dedicated to accelerating the delivery of emerging technologies to U.S. and allied government agencies for national security missions. Founded by former U.S. Marines who witnessed firsthand the critical need for rapid technology adoption in defense, the company aims to bridge the gap between commercial innovation and government requirements. 2F's core mission is to equip defense and national security professionals with the cutting-edge tools necessary to maintain a strategic advantage. The company's flagship product, Game Warden, is a DevSecOps platform designed to streamline the often lengthy and complex process of software accreditation and deployment within government environments, including highly secure and classified networks. This platform enables commercial software-as-a-service (SaaS) providers to configure, secure, and deploy their applications to Department of Defense (DoD) customers with a fully managed and compliant production environment, significantly reducing the time and cost associated with traditional Authority to Operate (ATO) processes.

Second Front Systems positions itself as a crucial enabler for both commercial technology companies seeking to enter the government market and government agencies needing to rapidly leverage innovative software solutions. The company emphasizes its commitment to public benefit, guiding its operations with principles such as transparent pricing to ensure its work centers on the public good. By simplifying and accelerating every step of the software development, compliance, delivery, and operations process, 2F empowers its clients to focus on their core missions. Their offerings include the 2F Suite, which encompasses 2F Workshop for secure development, 2F Game Warden for streamlined compliance and accreditation, and 2F Frontier for deploying software to edge devices in remote or disconnected environments. Through these solutions, Second Front Systems plays a vital role in modernizing government technology infrastructure and enhancing national security capabilities by ensuring that the latest commercial software innovations can be securely and efficiently utilized by those on the frontlines.

Claim this profileSecond Front Systems logoSS

Second Front Systems

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

9 remote jobs at Second Front Systems

Explore the variety of open remote roles at Second Front Systems, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Second Front Systems

Remote companies like Second Front Systems

Find your next opportunity by exploring profiles of companies that are similar to Second Front Systems. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Second Front Systems hiring Site Reliability Engineer - Observability • Remote (Work from Home) | Himalayas