Do you have the passion to architect and lead the next generation of public cloud infrastructure?
Would you like to lead modernization initiatives while building a public cloud platform from scratch?
Join our IaaS Site Reliability Engineering (SRE) team.
We design, develop, and operate infrastructure and services that power the backbone of our cloud platform. This is a rare opportunity to help build a public cloud from the ground up.
Partner with the best
As Senior II SRE, you will provide technical leadership, strategic direction across Linux-based infrastructure. You will architect solutions for scale, drive modernization in critical services like DNS, PKI, and secure access. You will be trusted to solve the most complex problems, set technical direction across multiple systems. Your leadership will shape the reliability, automation, observability of new public cloud platform.
As a Senior II Site Reliability Engineer, you will be responsible for:
- Architecting Linux infrastructure and core platform services for reliability and scale
- Influencing architectural decisions to ensure performance, reliability, and cost-efficiency at scale
- Driving automation frameworks with Python, Terraform, and Ansible/Salt and defining observability strategies and enforcing metrics and SLO's.
- Leading CI/CD best practices: GitOps workflows, progressive rollouts, automated canary analysis, and rollback strategies
- Modernizing services including DNS, PKI, secure access, time sync, and package repositories
- Leading OS lifecycle management and large-scale migrations and operating, evolving service discovery, service mesh & diagnostics platforms.
- Leading high-impact incident response and ensuring long-term reliability improvements
- Mentoring engineers across levels and setting technical standards, providing strategic leadership for the design & delivery.
Do what you love
To be successful in this role you will:
- Have 10+ years of Linux, SRE, or infrastructure engineering experience
- Possess deep expertise in Linux internals, troubleshooting, and performance tuning
- Have advanced Python programming skills
- Be proven with Terraform and Ansible/Salt at scale
- Have led infrastructure modernization initiatives across OS, virtualization, and cloud-native layers
- Demonstrate the ability to mentor, influence, and provide technical leadership at a global scale
Apply only if you have the skills mentioned above. Without hands-on Linux expertise, an automation mindset, and curiosity, this role will not be a fit.
Work in a way that works for you
Wherever you are in India is your office. We are a 100% remote-first team. We will support you, take care of you, and give you unmatched flexibility, along with employee-friendly perks.
Join us
This is a once-in-a-career opportunity to architect and deliver a new public cloud platform, driving reliability at internet scale. If you are the best at what you do, passionate about solving problems, and curious to grow, this is your role. If you prefer excuses over solutions, this is not the opportunity for you.
Career Path
This role is part of a clear career path in our SRE team: SRE I → SRE II → Senior → Senior II → Principal → Senior Principal. Each step builds deeper expertise in Linux, automation, reliability, and leadership. At the Senior II level, you are a trusted technical authority, and the natural progression from here is into Principal and Senior Principal roles, where you will help set strategy, guide large-scale initiatives, and influence the future of our cloud platform.
Learn more
Not sure if this job is the right match for you or want to learn more about the job before you apply? Schedule a 15-minute exploratory call with the Recruiter and they would be happy to share more details.