Himalayas logo
ConcentrixCO

Lead Site Reliability Engineer

Concentrix Corporation is a leading global provider of technology and services solutions, helping brands enhance customer experiences.

Concentrix

Employee count: 5000+

India only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Job Title:

Lead Site Reliability Engineer

Job Description

As a Lead Site Reliability Engineer, you’ll play a strategic role in shaping and scaling our DevSecOps ecosystem. You’ll lead the design and implementation of automated CI/CD pipelines, enforce enterprise-grade security and compliance standards, and drive reliability across the entire software delivery lifecycle.

Partnering closely with development and operations teams, you’ll define best practices, optimize deployment workflows, and ensure our applications are resilient, observable, and continuously improving. Your expertise will be key to accelerating innovation while maintaining the highest levels of quality and performance.

Additionally, you will be expected to extensively use and lead the group to adopt AI within the SRE role and domain. The ideal candidate will have a "builder" mindset with strong software engineering skills that can a be "force-multiplier" - you will generate automation and platform code daily, and always looking to improve and build upon what can be imagined, leveraging the latest tools to deliver faster, more efficient, more effective, and more autonomous solutions.

About the Role

As a Lead Site Reliability Engineer, you will own the reliability and availability of our production systems. You will champion SRE principles across engineering teams — defining SLOs, managing error budgets, and leading a culture of blameless incident response. This is a hands-on leadership role where you will partner closely with product and engineering teams to balance the pace of innovation with the stability our customers depend on.

Key Responsibilities

Reliability Ownership

  • Define, implement, and own Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets across critical services.
  • Use error budget policies to drive data-informed conversations between engineering and product on release velocity vs. reliability trade-offs.
  • Conduct capacity planning and proactive risk assessments to prevent incidents before they occur.

Incident Management

  • Lead incident response as incident commander — coordinating teams, driving resolution, and maintaining clear stakeholder communication during outages.
  • Facilitate thorough, blameless postmortems and ensure action items are tracked, prioritized, and resolved.
  • Develop and continuously improve runbooks, escalation paths, and on-call practices to reduce MTTD and MTTR.

Observability & Monitoring

  • Design and maintain observability strategies using modern tooling (Prometheus, Grafana, OpenTelemetry, ELK) to ensure full visibility into system health.
  • Define intelligent alerting that is actionable and minimizes alert fatigue.
  • Drive adoption of distributed tracing and structured logging across services.

Toil Reduction & Automation

  • Identify and measure toil across the engineering organization and lead initiatives to eliminate it through automation.
  • Build internal tooling and self-service capabilities that improve developer productivity and system reliability.

Infrastructure & Platform Reliability

  • Collaborate with platform and infrastructure teams on cloud-native patterns for fault tolerance, auto-scaling, and disaster recovery.
  • Provide SRE input into CI/CD pipelines and deployment strategies (e.g., canary releases, blue/green deployments) to minimize production risk.
  • Manage infrastructure using IaC practices (Terraform or equivalent) with a focus on reliability and consistency.

Leadership & Culture

  • Mentor and grow junior SREs, fostering a culture of ownership, curiosity, and continuous improvement.
  • Act as an SRE advocate across engineering — embedding reliability thinking into the software development lifecycle.
  • Partner with key stakeholders to align SRE strategy with broader organizational goals.
  • Conduct regular 1:1s with direct reports and participate in team rituals.

AI Expectations

As with all engineers at our organization, this role requires an AI-native mindset. Specifically, you will be expected to:

  • Embed AI tools and practices into how we build and run our platform — deploying AI-powered capabilities and shipping real AI features into production.
  • Support engagement and solutioning for AI-powered offerings, translating technical capabilities into tangible business value.
  • Collaborate with cross-functional partners — including Product, Data, Security, and Legal — to ensure AI is delivered safely, effectively, and in compliance with relevant standards.

What We're Looking For

Must-Haves

  • 7+ years of experience in SRE, platform engineering, or a related discipline.
  • Proven experience defining and managing SLOs, SLIs, and error budgets in a production environment.
  • Strong incident management experience, including leading postmortems and driving reliability improvements.
  • Hands-on experience with observability tooling (Prometheus, Grafana, OpenTelemetry, or similar).
  • Solid understanding of cloud platforms (AWS, Azure, or GCP) and containerized environments (Kubernetes).
  • Proficiency in at least one scripting or programming language (Python, Go, or Bash).

Nice to Have

  • Experience with chaos engineering tools (e.g., Chaos Monkey, Gremlin, LitmusChaos).
  • Familiarity with IaC tooling such as Terraform or Pulumi.
  • Knowledge of DevSecOps practices and security tooling.
  • Experience with GitOps workflows and CI/CD pipelines.
  • Bilingual proficiency (English & Spanish).

Location:

IND Work-at-Home

Language Requirements:

Time Type:

Full time

If you are a California resident, by submitting your information, you acknowledge that you have read and have access to the Job Applicant Privacy Notice for California Residents

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Experience

7 years minimum

Location requirements

Hiring timezones

India +/- 0 hours

About Concentrix

Learn more about Concentrix and their company culture.

View company profile

Concentrix is a global technology and services leader that powers the world's best brands, today and into the future. We are a human-centered, tech-powered, and intelligence-fueled company. Every day, we design, build, and run fully integrated, end-to-end solutions at speed and scale across the entire enterprise. Our innovative culture focuses on delivering exceptional value for our clients and their customers through tailored solutions.

Established in 1983, Concentrix has transformed the way organizations engage with their customers, helping them to achieve their business goals through comprehensive services in the realm of customer experience, business process outsourcing, and digital services. By leveraging advanced analytics and data-driven insights, we empower brands to enhance their customer journeys, increase loyalty, and drive revenue growth. Our team is dedicated to driving continuous improvement and leveraging technology to make businesses more efficient and effective.

Claim this profileConcentrix logoCO

Concentrix

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

176 remote jobs at Concentrix

Explore the variety of open remote roles at Concentrix, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Concentrix

Remote companies like Concentrix

Find your next opportunity by exploring profiles of companies that are similar to Concentrix. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Concentrix hiring Lead Site Reliability Engineer • Remote (Work from Home) | Himalayas