nikhil kandi
@nikhilkandi
Site Reliability Engineer ensuring 24/7 clinical system availability, automation, and observability.
What I'm looking for
I am a results-driven Site Reliability Engineer with deep experience maintaining 24/7 clinical systems, cloud infrastructure, and monitoring solutions for healthcare platforms. I focus on uptime, compliance, security, and seamless patient care while collaborating closely with cross-functional and clinical teams.
I have implemented observability stacks (Splunk, Dynatrace, Grafana, Honeycomb), deployed applications on Kubernetes with Docker and Helm, and automated operational workflows using Azure Functions, PowerShell, and Ansible. I have led on-call rotations, conducted root cause analysis during critical outages, and overseen clinical system upgrades ensuring regulatory and business continuity.
I continuously improve reliability through CI/CD pipelines, infrastructure automation, cost optimization in AWS/Azure, and runbook/process documentation to reduce incident resolution time and upskill teams.
Experience
Work history, roles, and key accomplishments
Site Reliability Engineer
Data Solutions Inc
Dec 2022 - Present (2 years 10 months)
Maintained mission-critical clinical systems (EHR, CDSS, claims) with 24/7 availability, implemented observability (Splunk, Dynatrace, Grafana) and automated recovery processes to reduce outage impact and improve JVM performance.
AWS Devops Engineer
Data Solutions Inc
Sep 2020 - Dec 2022 (2 years 3 months)
Managed AWS infrastructure across multiple environments, automated deployments and cost optimizations, and designed high-availability EC2 architectures with secure S3/Glacier backups to improve reliability.
Build & Release Engineer
Data Solutions Inc
Apr 2019 - Sep 2020 (1 year 5 months)
Developed CI/CD pipelines and rollback strategies to ensure consistent, timely releases and reduced deployment risk through automation and coordinated release processes.
Software Engineer
Data Solutions Inc
Dec 2017 - Apr 2019 (1 year 4 months)
Built backend Java Spring modules, authored unit tests with JUnit, and managed CI jobs and Control-M scheduling to support production stability and feature delivery.
Process Associate
Amazon India
Aug 2015 - Nov 2015 (3 months)
Provided backend operational support and stakeholder service, resolving process inquiries and ensuring SLA adherence through timely escalation and cross-functional coordination.
Process Developer
Genpact
Jul 2014 - Jul 2015 (1 year)
Designed and developed UI features using HTML/CSS/JavaScript within Agile teams, contributing to web solutions and promoting continuous improvement practices.
Education
Degrees, certifications, and relevant coursework
Southern Arkansas University
Master of Science, Computer Science
2016 - 2017
Completed a Master of Science in Computer Science with coursework and projects in advanced computing and systems administration.
Jawaharlal Nehru Technological University, Hyderabad
Bachelor of Science, Computer Science
2010 - 2014
Earned a Bachelor's degree in Computer Science focusing on software development and foundational computing concepts.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring nikhil?
You can contact nikhil and 90k+ other talented remote workers on Himalayas.
Message nikhilFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
