Our client is a fast‑growing technology company building modern infrastructure for AI‑driven applications. To strengthen their cloud foundation, were looking for a Senior Software Engineer to join the Cloud Operations team — a group responsible for the reliability, scalability, and evolution of their cloud‑native platform.
This role is ideal for engineers who enjoy working at the intersection of software engineering, distributed systems, and cloud infrastructure. You'll design and operate core platform components, shape the future of their managed cloud offering, and help ensure the system runs smoothly at scale.
Your Role
Platform Engineering
- Design, build, and operate foundational cloud platform components
- Develop and maintain Kubernetes clusters, including custom operators
- Write production‑grade Go and Python code for platform services and automation
Reliability & Scalability
- Improve the stability, performance, and cost efficiency of cloud environments across AWS, GCP, and Azure
- Strengthen observability through metrics, logging, alerting, and monitoring frameworks
- Participate in incident response, root‑cause analysis, and long‑term system hardening
Automation & Operations
- Automate operational workflows, integrations, and infrastructure processes
- Reduce operational overhead (KTLO) through engineering‑driven improvements
- Collaborate closely with Platform, Regions & Clusters, and Feature teams to ensure seamless delivery
What You Bring
- 5–7+ years in platform engineering, infrastructure, or SRE‑focused roles
- Strong proficiency in Go and Python (expertise in one with willingness to use both)
- Hands‑on experience running Kubernetes in production
- Solid understanding of distributed systems and cloud‑native architectures
- Experience with major cloud providers (AWS, GCP, or Azure)
- Familiarity with CI/CD, infrastructure‑as‑code, and automation tooling
- Comfortable participating in on‑call rotations and managing production incidents
- Strong ownership mindset and clear communication skills
Nice to Have
- Experience building Kubernetes operators or control‑plane components
- Background in SaaS, database, or systems‑level products
- Exposure to Prometheus, Grafana, OpenTelemetry, or similar observability tools
- Knowledge of networking, load balancing, or service meshes
- Contributions to open‑source projects
What's on Offer
- Competitive salary, equity, and benefits
- Fully remote role with flexible working hours
- High‑impact position within a core cloud engineering team
- Opportunity to work on large‑scale Kubernetes and multi‑cloud systems
- Clear growth path
