Datavant is a data platform company seeking a Senior Site Reliability Engineer to join their high-performing team. The role involves managing and supporting 3rd party tools and platforms, increasing cloud efficiency, and leading a team of peers.
Requirements
- Platform Management: Expertise in managing Kubernetes (EKS), CI/CD tools (e.g., ArgoCD, GitHub Actions), and observability platforms (e.g., Datadog).
- Automation and IaC: Proficiency in automating platform deployment and maintenance tasks (e.g., cluster upgrades, CI/CD workflows).
- Third-Party Tools: Familiarity with integrating tools like Terraform, Elasticsearch, Kafka, Cassandra, and Databricks into the broader platform.
- Reliability Engineering: Knowledge of scaling, failover, and platform reliability best practices.
- Cross-Team Collaboration: Ability to work with Embedded Teams to meet workload-specific needs.
- Leadership: The ability to work with and influence engineers, product, and other SREs is critical.
Benefits
- Comprehensive health insurance
- 401(k) matching
- Flexible PTO
- Professional development opportunities
