Loading...
Loading...
Himalayas
About usHimalayas PlusCommunityTech stackEmployee benefitsTerms and conditionsPrivacy policyContact usFor job seekers
Create your profileBrowse remote jobsDiscover remote companiesJob description keyword finderRemote work adviceCareer guidesJob application trackerAI resume builderResume examples and templatesAI cover letter generatorCover letter examplesAI headshot generatorAI interview prepInterview questions and answersAI interview answer generatorAI career coachFree resume builderResume summary generatorResume bullet points generatorResume skills section generator© 2025 Himalayas. All rights reserved. Built with Untitled UI. Logos provided by Logo.dev. Voice powered by Elevenlabs Grants
Join the remote work revolution
Join over 100,000 job seekers who get tailored alerts and access to top recruiters.
@akhiljanardanan
SRE-focused IT Operations Engineer driving observability, incident response, and reliability improvements.
I am an analytical, proactive IT Operations Engineer with 6+ years’ experience ensuring service reliability across distributed systems. I specialize in observability, incident response, and automation, using tools like Datadog, Splunk, SignalFx, CAL, IRIS, and OpenTelemetry to drive operational excellence.
I have a proven record reducing MTTR by 30%, cutting false alerts by 40%, and restoring services within SLA during high-severity incidents. I lead major incident handling, RCA, and cross-functional coordination, and I build dashboards, automated workflows, and documentation to improve uptime and transparency.
I bring strong communication and stakeholder management skills, an SRE-oriented mindset, and a continuous-improvement approach to incident prevention and performance engineering. I seek roles where I can scale observability, automate operations, and improve service availability.
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Work history, roles, and key accomplishments
Coordinate and lead resolution of high-severity incidents for global payment services, leveraging Datadog, Splunk and OpenTelemetry to isolate root causes within minutes and reduce business impact. Implemented automated escalation and alerting workflows that lowered MTTR and improved SLA adherence.
Pearl Group Services Limited
Jun 2019 - Aug 2023 (4 years 2 months)
Maintained 24x7 infrastructure availability and integrated CAL, IRIS and OpenTelemetry into monitoring stacks to improve observability and reduce alert noise by 40%. Designed Splunk dashboards and collaborated with DevOps to align operations with SRE practices.
Degrees, certifications, and relevant coursework
Master of Business Administration, Production & Operations Management; Finance
2021 - 2023
Completed a Master of Business Administration with concentrations in Production & Operations Management and Finance, focusing on operations, process improvement, and managerial finance.
Bachelor of Engineering, Information Technology
2015 - 2019
Earned a Bachelor of Engineering in Information Technology with coursework in software, systems, and IT fundamentals supporting infrastructure and operations roles.
Software and tools used professionally
You can contact Akhil and 90k+ other talented remote workers on Himalayas.
Message AkhilRajesh Kumar Jena
Associate, JP Morgan Chase
Mayank Shrivastava
Site Reliability Engineer, IndusInd Bank
Durgansh Taneja
Grafana Engineer, Samsung SDS India
sagar sidbatte
Systems Engineer, Thomson Reuters Corporation
Srikanta Sahu
Senior Site Reliability Engineer, Morgan Stanley
Ankit Vishwakarma
Monitoring Engineer, Shift Ahead Technology
Pranjal Rathod
Lead Site Reliability Engineer, Blue Yonder
Shrikant Dandge
Site Reliability Engineer, Granicus Technologies India Pvt Ltd
sharathkumar sortur
NOC Engineer, Vymo
Rajeev Kulkarni
Site Reliability Engineer 2, Collective Health