About Intermedia
About the role: Design, build, operate, and support the company’s Kubernetes platform across on-premises and multiple cloud environments. This role ensures a reliable, standardized Kubernetes runtime that integrates with the Internal Developer Platform (IDP), enabling application teams to deploy and operate services independently. This is a generalist platform role with shared operational responsibility and a strong focus on automation, standardization, and reliability
Key Responsibilities
- Build, operate, and evolve Kubernetes clusters:
- On-premises (VM-based)
- Public cloud Kubernetes (AWS EKS, GCP GKE, Azure AKS, Oracle OKE)
- Own cluster lifecycle management:
- Provisioning, upgrades, patching, and decommissioning
- Develop and maintain Infrastructure as Code:
- Terraform modules
- Cluster bootstrap and configuration automation
- Implement and operate GitOps workflows for platform components
- Integrate Kubernetes capabilities into the Internal Developer Platform (IDP):
- Standard cluster and namespace patterns
- Approved ingress, secrets, and observability integrations
- Participate in a rotational on-call/support model for platform-level incidents
- Troubleshoot Kubernetes platform issues and improve reliability
- Create and maintain documentation, runbooks, and operational standards
- Collaborate with IDP, application support, infrastructure, and security team
Skills, Knowledge and Expertise
- Strong hands-on experience operating production Kubernetes environments
- Experience with on-premises, VM-based infrastructure
- Solid understanding of:
- Kubernetes internals
- Linux systems and networking
- Experience with Infrastructure as Code, preferably Terraform
- Experience working with multiple cloud environments (at least one deeply)
- Familiarity with Git-based workflows and GitOps tools
- Proven ability to troubleshoot distributed systems and production issues
- Experience with multiple managed Kubernetes services (EKS, GKE, AKS, OKE)
- Experience running Kubernetes on-prem (bare metal or VM-based)
- Exposure to:
- Observability stacks (Prometheus, Grafana, logging systems, Open Telemetry)
- Ingress and traffic management
- Secrets and certificate management
- Prior experience integrating infrastructure platforms into an Internal Developer Platform (IDP)
- Understanding of SRE concepts (SLOs, error budgets, incident response)
Soft Skills
- Ownership mindset: accountable for platform outcomes
- Generalist approach: comfortable across infrastructure, Kubernetes, and operations
- Strong problem solver: able to handle ambiguous and complex issues
- Clear communicator: explains technical topics effectively
- Collaborative: works well across teams and shares responsibility
- Calm under pressure: effective during incidents and outages
- Documentation-driven: values clarity and knowledge sharing
