Skip to main content
VK
Open to opportunities

Vitalii Kravchenko

@vitaliikravchenko

Site Reliability Engineer automating deployments and improving reliability with observability-driven practices.

Cyprus
Message

What I'm looking for

I’m looking for a team where I can own reliability outcomes (SLA/SLO), strengthen observability and incident response, automate CI/CD and infrastructure, and help run scalable Kubernetes platforms with strong security and compliance.

I’m a Site Reliability / DevOps Engineer with 6+ years of hands-on experience automating software development, testing, and deployment to improve reliability and operational outcomes. I focus on reliability engineering (SLA/SLO/SLI), incident response, and turning operational pain into repeatable automation.

In my current role at Spotware Systems, I ensure reliability, scalability, and high availability through strong incident management—root cause analysis, postmortems, and remediation. I also built a centralized observability platform (Loki, VictoriaMetrics, Vector) with metrics, distributed tracing, and log aggregation, cutting MTTD by 60% and MTTR by 40%.

I manage infrastructure using IaC and modern orchestration: Terraform, Ansible, Kubernetes, and OpenShift. I’ve also led secure secret management and migrations with HashiCorp Vault, alongside performance optimization, security, access management, and compliance.

Previously, at MTS Group and Sberbank, I designed and implemented CI/CD pipelines that reduced deployment time by 80%, deployed clustered messaging infrastructure for production reliability, and streamlined operational workflows like automated SSL certificate requests and service deployments. Earlier, at Komus and during my EPAM internship, I delivered monitoring, containerized apps with Docker, and built cloud infrastructure with Terraform and AWS (EKS/VPC/RDS), reinforcing an end-to-end DevOps mindset.

Experience

Work history, roles, and key accomplishments

SS
Current

Site Reliability Engineer

Spotware Systems

Jan 2025 - Present (1 year 5 months)

Improved service reliability and high availability using SLA/SLO/SLI management and incident response with root-cause analysis and postmortems. Built a centralized observability platform (Loki, VictoriaMetrics, Vector), reducing MTTD by 60% and MTTR by 40%, and automated CI/CD and infrastructure with Terraform, Ansible, Kubernetes, and Vault.

ER

System Engineer (Internship)

EPAM Remote

Sep 2021 - Jun 2022 (9 months)

Developed a 3-tier application using Python, PostgreSQL, and Flask, and built cloud infrastructure with Terraform on AWS (VPC, EKS, RDS). Containerized services with Docker and set up Kubernetes deployment with EKS, including monitoring with Prometheus/Grafana and code quality checks with SonarQube.

Education

Degrees, certifications, and relevant coursework

OO

Orel State University (OrelSTU)

Bachelor of Economics, Economics

Earned a Bachelor's in Economics from Orel State University (OrelSTU).

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan