Site Reliability Engineer
Company
Orcrist is building a next generation data intelligence platform using cutting-edge technologies. We're handling petabyte-scale data with sub-second queries. Our product is a Kubernetes-based platform delivered as B2B SaaS or as a self-hosted on-prem solution, including air-gapped deployments. We enable customers across defense, law enforcement, and enterprise to turn mission-critical data into actionable intelligence.
Role
Join our team to deploy and operate our data intelligence platform in agency-controlled environments. You’ll build and run secure, highly available Kubernetes clusters on-prem and in hybrid setups, act as a forward-deployed SRE during incidents and upgrades, and ensure that our systems meet stringent privacy, audit, and legal evidence requirements for law-enforcement use cases.
What you'll do
- Deploy, install, and manage Kubernetes clusters for OIP in on-prem and hybrid environments.
- Configure and maintain GitOps workflows, Helm/Kustomize, and artifact registries in restricted networks.
- Design, operate, and lead incident response for the observability stack (Prometheus, Grafana) and enforce disaster recovery.
- Harden environments with network segmentation, mTLS, IAM, and vulnerability remediation.
- Produce compliance documentation, runbooks, and train agency and Orcrist teams on operations.
About You
- 5+ years SRE/DevOps experience, on-call ownership, and operating production systems.
- Deep hands-on experience with Kubernetes (on-prem/hybrid), GitOps (Argo CD/Flux), and infrastructure automation (Ansible, Terraform).
- Strong background in observability (Prometheus, Grafana, Loki) and complex incident response/troubleshooting.
- Proficiency in German and English (C1+), authorized to work in Germany, with willingness to travel (20–30%).
Nice‑to‑haves
- Deep knowledge of law-enforcement or wider public-sector IT and governance structures.
- Relevant certifications such as CKA/CKAD, ISO 27001 Lead Implementer, CISSP, or GDPR practitioner.
- Proven experience integrating with key enterprise systems, including Identity and Access Management (SAML, LDAP), Security Information and Event Management (SIEM) platforms (Splunk/Elastic), and forensic logging or digital-evidence systems.
- Familiarity with digital evidence workflows and supporting judicial proceedings.
- Previous experience managing sensitive environments, such as air-gapped systems, sensitive investigative tooling, or mission-critical public-safety systems.
What We Offer
- High-impact role ensuring reliable, lawful operations for agencies protecting communities.
- Modern architecture & stack.
- Remote-first in Germany with occasional team events in Berlin.
- Home office budget and great equipment.
- 30 days vacation.
- Direct impact on critical missions across private and public-sector customers.
