HimalayasHimalayas logo
Sushma KashyapSK
Looking for a job

Sushma Kashyap

@sushmakashyap

Senior Cloud Operations Engineer specializing in Azure reliability, AKS, and CI/CD at scale.

India
Message

What I'm looking for

I’m looking for a role where I can own cloud operations and reliability on Azure—running incident management, monitoring, and CI/CD for scalable containerized workloads—while continuously improving runbooks, automation, and SLA-focused production stability.

I’m a Senior Cloud Operations Engineer with over 6 years of experience in Microsoft Azure and modern cloud infrastructure environments. I bring hands-on knowledge of DevOps technologies like Kubernetes, Ansible, Kafka, Jenkins, Git, and Terraform, with shell and Python scripting concepts.

In my current role as a Senior CloudOps Engineer at OneTrust, I administer and monitor Microsoft Azure cloud infrastructure supporting 60+ production workloads, sustaining 99.9%+ platform availability for enterprise privacy and compliance applications. I troubleshoot infrastructure issues across 60+ AKS Kubernetes clusters and Kafka messaging systems, resolving 30–40 production incidents per month while minimizing service disruption. I also manage configuration across 1000+ cloud resources using Terraform and Ansible to improve environment consistency across development and production.

I support scalable cloud systems through CI/CD pipelines and deployments for 50+ microservices using Jenkins and Git, enabling frequent releases without compromising production stability. I monitor performance and operational alerts across 200+ metrics dashboards using Prometheus and Grafana, and I focus on incident management with SLA response targets under 30 minutes for critical incidents.

I strengthen reliability through documentation and continuous improvement—developing and maintaining 40+ technical documents and troubleshooting playbooks in Jira and Confluence to reduce recurring operational issues. Previously, as an Engineer | Azure Administrator at Mindtree Ltd., I provisioned and managed Azure infrastructure across 80+ virtual machines, implemented monitoring with LogicMonitor and New Relic, and supported incident and problem management using Jira and Slack.

Experience

Work history, roles, and key accomplishments

OneTrust logoON

Senior CloudOps Engineer

Dec 2021 - Mar 2026 (4 years 3 months)

Administered and monitored Microsoft Azure infrastructure supporting 60+ production workloads, maintaining 99.9%+ availability. Troubleshot issues across 60+ AKS clusters and Kafka systems, resolving 30–40 production incidents per month, while managing configuration for 1000+ resources with Terraform and Ansible and monitoring 200+ metrics dashboards using Prometheus and Grafana.

ML

Azure Administrator

Mindtree Ltd.

Jul 2019 - Dec 2021 (2 years 5 months)

Provisioned and managed Azure infrastructure across 80+ virtual machines and networking components supporting enterprise client applications. Implemented monitoring and alerting with LogicMonitor and New Relic (100+ resources), resolved 20+ incidents per month, and supported incident/problem management using Jira and Slack with root-cause analysis.

Education

Degrees, certifications, and relevant coursework

GV

Global Academy of Technology (VTU)

Bachelor of Engineering, Computer Science and Engineering

2019 -

B.E. in Computer Science and Engineering at Global Academy of Technology (VTU), Bangalore, starting July 2019.

JC

Jain College

PUC, Pre-University Education

2014 -

PUC (Karnataka State Board) at Jain College, Jayanagar, Bangalore, starting July 2014.

CS

Carmel School

SSLC, Secondary Education

2012 -

SSLC (Karnataka State Board) at Carmel School, Padmanabhanagar, Bangalore, completed May 2012.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan