We're building the "Datadog" of the LLM engineering category. We're a small, engineering-heavy team in Berlin and San Francisco, hiring for engineering in EU timezones. We're looking for a strong infrastructure or SRE engineer to own uptime, performance, and cost efficiency across our entire cloud footprint.
Requirements
- Strong infrastructure or SRE engineer with experience operating production workloads on AWS or comparable hyperscale vendors.
- Comfortable with container orchestration — Kubernetes and/or ECS, Helm charts, Docker.
- Experience with infrastructure-as-code (Terraform, Pulumi, CloudFormation, or similar).
- Strong monitoring and observability instincts — you've built dashboards and alerts that actually caught problems.
- Interest in open source software and genuine enjoyment helping users debug their self-hosted deployments.
Benefits
- Competitive salary
- Generous Paid Time Off
- 401k Matching
- Retirement Plan
- Visa Sponsorship
- Four Day Work Week
- Generous Parental Leave
- Tuition Reimbursement
- Relocation Assistance
