HimalayasHimalayas logo
AA
Looking for a job

Abdoul Aw

@abdoulaw

Senior Site Reliability Engineer delivering reliable cloud platforms with Kubernetes, SLOs, and incident mastery.

United States
Message

What I'm looking for

I’m looking for a remote-first team where I can own reliability end-to-end—SLOs, incident response, observability, and DevSecOps guardrails—partnering with Security and DX to enable rapid, safe product delivery.

I’m a Senior Site Reliability Engineer with 20+ years designing and operating highly reliable, production-grade infrastructure at scale across AWS, GCP, and Azure. I’m deeply experienced in Kubernetes (EKS, GKE), Infrastructure as Code (Terraform), GitOps (ArgoCD), and full-stack observability (Prometheus, Grafana, Datadog), and I’ve owned incident management, post-incident analysis, and SLO/SLI frameworks that improved platform reliability to 97.97% and reduced MTTD by 43%.

I lead production-readiness reviews, disaster recovery exercises, and cross-functional reliability initiatives in high-growth B2B SaaS environments—partnering with Security, Developer Experience, and product engineering to embed SRE best practices, golden-path tooling, and CI/CD guardrails. I also drive DevSecOps policy-as-code controls (OPA/Gatekeeper, Kyverno), and I back it all up with measurable outcomes like reducing compute spend by 37%, while mentoring teams and thriving in remote-first, high-autonomy settings.

Experience

Work history, roles, and key accomplishments

NE
Current

Senior Site Reliability Engineer

Netstratum

Jan 2025 - Present (1 year 4 months)

Led production-readiness and reliability design reviews, defining SLO targets and rollout/rollback criteria to meet enterprise standards. Owned incident management and post-incident reviews, reducing MTTD by 43%, while building full-stack observability and disaster recovery practices that improved platform reliability to 97.97% and reduced compute spend by 37%.

TH

Senior Infrastructure Engineer

Throtle

Jan 2023 - Jan 2025 (2 years)

Designed and implemented distributed compute platforms on AWS EKS, improving workload latency by 31% while achieving 97.97% SLO reliability across production pipelines. Built Terraform/Python autoscaling and IaC automation that reduced cloud costs by 37%, drove environment drift to 0%, and reduced rollout-related defects by 31%.

ED

Senior SRE / Software Engineer

edX

Jan 2021 - Jan 2023 (2 years)

Built Python-based orchestration to automate Kubernetes provisioning, rollouts, and monitoring, eliminating manual operations by 67%. Scaled distributed microservices through refactors that reduced response latency under load by 29% and implemented self-healing remediation with Prometheus and Python.

AC

Senior DevOps Engineer

ActiveProspect

Jan 2021 - Jan 2021 (0 months)

Built monitoring-as-code (Python + Terraform) for 500+ distributed monitors, reducing alert fatigue by 41% and improving SLO accuracy across microservices. Developed CI/CD tooling and automated validation scripts, reducing rollout-related defects by 31%.

CO

Senior DevOps Engineer

Conno

Jan 2021 - Jan 2021 (0 months)

Implemented AWS and Kubernetes platforms using Terraform, Docker, and Helm, stabilizing production environments by 23%. Built automated diagnostics to reduce configuration drift and improve operational reliability.

ME

Senior Systems / DevOps Engineer

MedAllies

Jan 2018 - Jan 2021 (3 years)

Optimized multi-tier application stacks (JVM, DB, API, Linux) via profiling and load tuning, improving peak production performance by 29% and reducing peak-load latency by 23%. Engineered highly available CI/CD pipelines using Jenkins, Docker, and Ansible, with Python/Bash automation reducing environment failures by 37%.

OX

Senior Linux/Unix Systems Admin

OpenText (Xpedite)

Jan 2004 - Jan 2015 (11 years)

Diagnosed complex kernel-level failures using kdump, crash analysis, and eBPF instrumentation, reducing investigation time by 43%. Tuned CPU, memory (NUMA), network, and I/O subsystems for latency-sensitive distributed applications and reduced manual diagnostics by 53% through Python/Bash tooling.

Education

Degrees, certifications, and relevant coursework

Amazon Web Services (AWS) logoAA

Amazon Web Services (AWS)

AWS Solutions Architect (in progress), Cloud Architecture

AWS Solutions Architect certification is listed as in progress.

Google Cloud logoGC

Google Cloud

Professional Cloud Engineer (in progress), Cloud Architecture

Google Cloud Professional Cloud Engineer certification is listed as in progress.

HashiCorp logoHA

HashiCorp

Terraform Associate, Terraform

Holds the HashiCorp Terraform Associate credential.

Monmouth University logoMU

Monmouth University

Bachelor of Science (BS), Computer Science

Earned a Bachelor of Science in Computer Science from Monmouth University.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan