Skip to main content
HimalayasHimalayas logo
MS
Open to opportunities

Michael Stedding

@michaelstedding

Senior DevOps engineer who builds reliable cloud platforms, SLO-driven observability, and scalable MLOps.

United States
Message

What I'm looking for

I seek a role where I can lead cloud platform reliability, implement IaC/CI-CD, and scale observability and MLOps while collaborating across infra, security, and product.

I am a Senior DevOps engineer who treats reliability as a product feature, turning outages, audits, and migrations into boring non-events. I design cloud platforms with Infrastructure as Code and CI/CD to help teams ship faster and safer.

At Adobe I architected multi-cloud deployments and productized MLOps, reducing region spin-up from two weeks to under one day and halving model lead time. I also improved pipeline visibility with OpenTelemetry and SLO alerting, cutting MTTR about 45% across 200+ services.

Previously, I led DevOps at Social Solutions where I consolidated CI/CD, introduced SLO-driven operations, and containerized critical subsystems to boost throughput and release cadence. I implemented least-privilege access and packaged compliance evidence to accelerate procurement and audit readiness.

I bring practical experience in multi-region DR, zero-downtime secrets rotation, policy-as-code, and scalable observability. I speak infrastructure, security, and product, and I focus on delivering measurable reliability and velocity improvements.

Experience

Work history, roles, and key accomplishments

Adobe logoAD
Current

Senior DevOps Engineer

Mar 2022 - Present (4 years 3 months)

Architected multi-cloud deployment for Adobe Experience Platform, reducing new-region spin-up from 2 weeks to under 1 day and sustaining 99.95% pilot uptime; halved MLOps model lead time and cut MTTR ~45% across 200+ services.

SS

Senior DevOps Engineer

Social Solutions

Oct 2018 - Mar 2022 (3 years 5 months)

Led platform modernization and CI/CD consolidation, increasing release cadence from weekly to daily and reducing p95 latency from ~420 ms to 250 ms across production environments; instituted SLO-driven ops to halve on-call pages.

SS

Cloud Operations Engineer

Social Solutions

Nov 2016 - Oct 2018 (1 year 11 months)

Scaled autoscaling and queue workers to improve quarter-end p95 from ~700 ms to 350 ms and implemented multi-region DR to improve RTO/RPO to 4–6 h / 15 min, avoiding SLA penalties.

Education

Degrees, certifications, and relevant coursework

Towson University logoTU

Towson University

Bachelor of Science, Computer Science

2011 - 2015

Completed a Bachelor of Science in Computer Science, focusing on software engineering and systems fundamentals.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan