We're hiring a Site Reliability Engineer to own and evolve deepset's cloud and customer infrastructure end to end, shaping how our AI platform is built, deployed, and scaled for our own cloud and for customers running it in their own environments.
Requirements
- 2-5 years of experience working with large-scale production infrastructure
- Fluent German language skills
- Experience with distributed or service-oriented architectures
- Hands-on expertise with: AWS, Kubernetes, CI/CD and GitOps (e.g. ArgoCD), Working knowledge of Infrastructure as Code (Terraform preferred)
- Solid troubleshooting skills - you can debug across systems, not just within one layer
- A pragmatic mindset: you balance speed, simplicity, and reliability
- Ownership and accountability - you take responsibility for systems end-to-end
- Ability to work independently while staying aligned with the team's goals
Benefits
- Remote-first setup with flexible hours & tech of your choice
- 30 days vacation + extra days for family sick leave
- Competitive salary & stock options for every team member
- Monthly sports & mental health support allowance with Oliva
- Annual learning & development budget
- Monthly team socials & in-person meetups
