Abdoul Aw
@abdoulaw
Senior Site Reliability Engineer delivering reliable cloud platforms with Kubernetes, SLOs, and incident mastery.
What I'm looking for
I’m a Senior Site Reliability Engineer with 20+ years designing and operating highly reliable, production-grade infrastructure at scale across AWS, GCP, and Azure. I’m deeply experienced in Kubernetes (EKS, GKE), Infrastructure as Code (Terraform), GitOps (ArgoCD), and full-stack observability (Prometheus, Grafana, Datadog), and I’ve owned incident management, post-incident analysis, and SLO/SLI frameworks that improved platform reliability to 97.97% and reduced MTTD by 43%.
I lead production-readiness reviews, disaster recovery exercises, and cross-functional reliability initiatives in high-growth B2B SaaS environments—partnering with Security, Developer Experience, and product engineering to embed SRE best practices, golden-path tooling, and CI/CD guardrails. I also drive DevSecOps policy-as-code controls (OPA/Gatekeeper, Kyverno), and I back it all up with measurable outcomes like reducing compute spend by 37%, while mentoring teams and thriving in remote-first, high-autonomy settings.
Experience
Work history, roles, and key accomplishments
Senior Site Reliability Engineer
Netstratum
Jan 2025 - Present (1 year 4 months)
Led production-readiness and reliability design reviews, defining SLO targets and rollout/rollback criteria to meet enterprise standards. Owned incident management and post-incident reviews, reducing MTTD by 43%, while building full-stack observability and disaster recovery practices that improved platform reliability to 97.97% and reduced compute spend by 37%.
Senior Infrastructure Engineer
Throtle
Jan 2023 - Jan 2025 (2 years)
Designed and implemented distributed compute platforms on AWS EKS, improving workload latency by 31% while achieving 97.97% SLO reliability across production pipelines. Built Terraform/Python autoscaling and IaC automation that reduced cloud costs by 37%, drove environment drift to 0%, and reduced rollout-related defects by 31%.
Senior SRE / Software Engineer
edX
Jan 2021 - Jan 2023 (2 years)
Built Python-based orchestration to automate Kubernetes provisioning, rollouts, and monitoring, eliminating manual operations by 67%. Scaled distributed microservices through refactors that reduced response latency under load by 29% and implemented self-healing remediation with Prometheus and Python.
Senior DevOps Engineer
ActiveProspect
Jan 2021 - Jan 2021 (0 months)
Built monitoring-as-code (Python + Terraform) for 500+ distributed monitors, reducing alert fatigue by 41% and improving SLO accuracy across microservices. Developed CI/CD tooling and automated validation scripts, reducing rollout-related defects by 31%.
Senior DevOps Engineer
Conno
Jan 2021 - Jan 2021 (0 months)
Implemented AWS and Kubernetes platforms using Terraform, Docker, and Helm, stabilizing production environments by 23%. Built automated diagnostics to reduce configuration drift and improve operational reliability.
Senior Systems / DevOps Engineer
MedAllies
Jan 2018 - Jan 2021 (3 years)
Optimized multi-tier application stacks (JVM, DB, API, Linux) via profiling and load tuning, improving peak production performance by 29% and reducing peak-load latency by 23%. Engineered highly available CI/CD pipelines using Jenkins, Docker, and Ansible, with Python/Bash automation reducing environment failures by 37%.
Senior Linux/Unix Systems Admin
OpenText (Xpedite)
Jan 2004 - Jan 2015 (11 years)
Diagnosed complex kernel-level failures using kdump, crash analysis, and eBPF instrumentation, reducing investigation time by 43%. Tuned CPU, memory (NUMA), network, and I/O subsystems for latency-sensitive distributed applications and reduced manual diagnostics by 53% through Python/Bash tooling.
Education
Degrees, certifications, and relevant coursework
Amazon Web Services (AWS)
AWS Solutions Architect (in progress), Cloud Architecture
AWS Solutions Architect certification is listed as in progress.
Google Cloud
Professional Cloud Engineer (in progress), Cloud Architecture
Google Cloud Professional Cloud Engineer certification is listed as in progress.
HashiCorp
Terraform Associate, Terraform
Holds the HashiCorp Terraform Associate credential.
Monmouth University
Bachelor of Science (BS), Computer Science
Earned a Bachelor of Science in Computer Science from Monmouth University.
Availability
Location
Authorized to work in
Salary expectations
Job categories
Skills
Interested in hiring Abdoul?
You can contact Abdoul and 90k+ other talented remote workers on Himalayas.
Message AbdoulFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
