Stefan Bullones
@stefanbullones
Senior Platform Engineer specializing in Kubernetes, Terraform, and cloud-native reliability across GCP and AWS.
What I'm looking for
I’m a Senior Platform Engineer focused on building and operating reliable, cloud-native platforms. In my recent work at InteractiveAI, I’ve driven containerized infrastructure with Kubernetes and Helm, GitOps-style delivery with ArgoCD, and secure service exposure using Gateway API and gVisor. I also own IaC with Terraform and run CI/CD via GitHub Actions, while delivering core GCP capabilities across GKE, Cloud SQL, Cloud Storage, Secret Manager, Cloud IAM, and Artifact Registry.
My career spans senior Site Reliability, DevOps, and technical leadership roles—turning complex infrastructure into dependable systems. At Vonage and other consulting engagements, I worked deeply with monitoring stacks (Prometheus, Grafana, Mimir, Loki, Alertmanager, Alloy), automation (Terragrunt, Jenkins, Bitbucket Pipelines), and production-grade cloud services on AWS (RDS/Aurora Global Databases, EKS, ECS, Lambda, SageMaker, CloudWatch, API Gateway, Step Functions, WAF, IAM). I bring strong development depth too—Golang and Python for platform tooling, plus hands-on applied AI/vector work with Qdrant and pgvector AI—so I can connect platform engineering directly to real product outcomes.
Experience
Work history, roles, and key accomplishments
Senior Platform Engineer
InteractiveAI
Nov 2025 - Apr 2026 (5 months)
Built the observability stack (Grafana, Loki, Mimir) from scratch and introduced SLI-based monitoring to define reliability targets for incident triage. Developed Go services and CLI tooling to automate PaaS provisioning on GCP/Kubernetes, hardened GKE with gVisor, migrated the vector store to Cloud SQL Postgres with pgvector, and implemented DevSecOps pipelines with automated AI-assisted code review.
Senior Platform Engineer
InteractiveAI
Nov 2025 - Apr 2026 (5 months)
Built and operated Kubernetes-based platform infrastructure using Helm, ArgoCD, and Gateway API, with Terraform-managed IaC and GitHub Actions CI/CD. Developed services in Go and Python and supported vector search using Qdrant and pgvector, alongside Prometheus/Grafana monitoring on GCP.
Led SRE work supporting Kubernetes deployments with ArgoCD and Argo Rollouts, backed by Terraform/Terragrunt and automated via GitHub Actions. Improved operational observability with Prometheus and Grafana and managed AWS data services, including RDS and Aurora Global Databases.
Embedded SRE across 3 teams supporting business-critical SMS messaging on Kubernetes and Aurora Global Databases with high availability and low latency across 6 AWS regions. Introduced Argo Rollouts for progressive deployments, led zero-downtime major-version MySQL migrations on Aurora Global across 6 regions, and defined SLI-based monitoring and internal SLAs.
Senior DevOps Consultant
Automat-IT
Jan 2024 - Apr 2024 (3 months)
Delivered DevOps consulting focused on Kubernetes platforms, including EKS, Helm, and containerized deployments. Implemented IaC with Terraform/Terragrunt and automated CI/CD with Bitbucket Pipelines.
Senior DevOps Consultant
Automat-IT
Jan 2024 - Apr 2024 (3 months)
Consulted for an ad-tech client on AWS EKS, delivering Terraform + Terragrunt infrastructure and extending Karpenter autoscaling to provision cost-efficient ARM/Graviton nodes. Deployed a Kubernetes-based Bitbucket runner autoscaler to improve CI/CD scaling.
Senior DevOps Engineer
Katch
Feb 2022 - Sep 2023 (1 year 7 months)
Built and maintained AWS-focused container and serverless infrastructure, leveraging ECS, Lambda, and managed workflows via Step Functions. Managed infrastructure with Terraform/Terragrunt, supported CI/CD with GitHub Actions, and developed automation using Python and Node.js.
Senior DevOps Engineer
Katch
Feb 2022 - Sep 2023 (1 year 7 months)
Owned infrastructure, cloud, and CI/CD as the sole DevOps in a seed-stage startup, translating the product roadmap into infrastructure delivery. Containerized services on AWS ECS with Terraform/Terragrunt, deployed a BERT-based ML service on SageMaker after a pivot, and built a fully serverless, state-machine-driven service using Step Functions, API Gateway, and Lambda.
Consulted for OpenShift, Ansible, and AWS on a long-term banking engagement, automating OpenShift cluster deployments using Terraform and Ansible. Delivered additional customer rotations including new-cluster validation (hardening, backups, load testing) and best-practice guidance for adopting Ansible and Kubernetes.
Consulted on Kubernetes and container platform deployments, including OpenShift components managed with Terraform Enterprise. Automated delivery with Jenkins and ArgoCD, supported AWS infrastructure (EC2/RDS/S3), and performed systems administration using Ansible and Python.
DevOps Technical Lead
Devo
Jun 2018 - Mar 2020 (1 year 9 months)
Provided DevOps leadership and cloud operations across AWS and monitoring/alerting workflows, using EC2, RDS, S3 and Jenkins CI/CD. Managed Linux services with nginx and Tomcat, implemented monitoring with Prometheus/Grafana, and supported automation with Ansible and Java.
DevOps Technical Lead
Devo
Jun 2018 - Mar 2020 (1 year 9 months)
Led automation and monitoring for new services after a reorg, bridging developers and operations across the company. Built the monitoring stack with Prometheus, Grafana, and Netdata, and automated server provisioning and configuration with Ansible using repeatable, version-controlled workflows.
DevOps Engineer
SICPA
Jun 2017 - Jun 2018 (1 year)
Implemented CI/CD pipelines using Jenkins and supported enterprise systems administration on Linux. Managed application infrastructure including Apache, nginx, Wildfly, and VMware, with automation via Ansible and ongoing platform operations.
DevOps Engineer
SICPA
Jun 2017 - Jun 2018 (1 year)
Ran CI/CD and Linux systems administration for production deployments, using Jenkins and Ansible to manage Tomcat/JBoss application servers. Supported build, deployment, and ongoing server operations within a Scrum product team.
Systems Administrator
Catrian
Sep 2014 - Jun 2017 (2 years 9 months)
Administered Linux-based infrastructure including Apache, Tomcat, and GitLab, and supported VMware environments. Managed monitoring with Nagios, handled network components like VPNs and firewalls, and contributed with scripting/development in Java and Python using automation via Ansible.
Systems Administrator
Catrian
Sep 2014 - Jun 2017 (2 years 9 months)
Maintained and monitored client services on delegated managed services infrastructure by administering Linux servers, virtualization, and monitoring tooling. Used VMware, Nagios, and Ansible (with Python and Java support) to keep production operations stable and observable.
Education
Degrees, certifications, and relevant coursework
Red Hat
RHCSA, RHCE, Certified Specialist in Containers and Kubernetes, Systems Administration & Containers/Kubernetes
Earned Red Hat RHCSA, RHCE, and Certified Specialist in Containers and Kubernetes (Certificate ID 200-079-630) in 2021.
Universidad Rey Juan Carlos
Master in Computer Vision, Computer Vision
2014 - 2016
Master in Computer Vision at Universidad Rey Juan Carlos (Sep 2014–Jun 2016). Final project focused on pedestrian detection with a CNN using TensorFlow and OpenCV.
UNEXPO Venezuela
Bachelor in Electronics Engineering, Electronics Engineering
2008 - 2014
Bachelor in Electronics Engineering at UNEXPO Venezuela (Sep 2008–Jun 2014). Final project involved implementing a fingerprint image filter for a fingerprint recognition system.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Stefan?
You can contact Stefan and 90k+ other talented remote workers on Himalayas.
Message StefanFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
