Jack Akova
@jackakova
Incident and problem management leader driving service resilience and continuous improvement.
What I'm looking for
I am a detail-oriented incident and problem management leader with 18+ years of experience owning major incidents, reducing downtime, and improving service resilience across enterprise environments. I specialize in incident command, postmortems, known error databases, and creating runbooks while collaborating with SRE, observability, and support teams to implement lasting corrective actions.
I have led incident programs at large organizations including Allstate, Airbnb, CarMax, and Penn Entertainment, leveraging tools such as ServiceNow, Splunk, Dynatrace, Datadog, SolarWinds, and Grafana. I focus on continuous improvement, clear executive communications, and building processes that reduce MTTR and prevent repeat outages.
Experience
Work history, roles, and key accomplishments
Incident Commander
The Score / Penn Entertainment
Jun 2024 - Present (1 year 3 months)
Drive cross-functional incident response and runbook development, perform postmortems and root-cause analysis, and manage problem lifecycle to reduce repeat outages and improve service resilience.
Oversaw critical incident lifecycle and on-call response, produced operational metrics, led Severity 0–2 incidents and AARs, and drove process improvements to reduce downtime.
Principal Engineer, Service Management
CarMax
Jul 2021 - Nov 2022 (1 year 4 months)
Led incident and problem process improvements, hosted post-incident reviews, and coordinated monitoring and CAB activities to minimize business impact of critical applications.
Incident / Problem Manager Lead
Arthrex
Mar 2020 - Jul 2021 (1 year 4 months)
Led root-cause investigations for P1/P2 incidents, built incident metrics and runbooks, and coordinated cross-team efforts to improve monitoring and reduce critical outages.
Enterprise Incident / Problem Manager Lead
AutoNation
Apr 2019 - Mar 2020 (11 months)
Owned major incident bridges and executive communications, created dashboards and runbooks to detect and prevent critical incidents, and delivered monthly availability metrics and continuous improvements.
Led command center incident and problem operations across two sites, trained incident managers, drove RCA and SOPs, and improved MTTR and enterprise application availability over a multi-year tenure.
Managed incident detection and resolution processes, escalations, and communications to maintain service availability.
Performed data center operations and monitoring to support enterprise services and ensure operational continuity.
Education
Degrees, certifications, and relevant coursework
ITT Technical Institute
Associate Degree in IT, Computer Network Systems
Completed an Associate Degree in Computer Network Systems focusing on IT and networking fundamentals.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Interested in hiring Jack?
You can contact Jack and 90k+ other talented remote workers on Himalayas.
Message JackFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
