Skip to main content
Suraj UserSU
Open to opportunities

Suraj User

@surajkumar01

DevOps engineer who delivers multi-cloud Kubernetes, Terraform, and Airflow platforms with production-grade incident ownership.

India
Message

What I'm looking for

I’m looking for a DevOps role where I can own multi-cloud Kubernetes + Terraform, run reliable Airflow/Snowflake workflows, improve CI/CD and GitOps, and lead production incident root-cause to lasting fixes.

I’m a DevOps engineer with hands-on ownership of cloud infrastructure and large-scale data orchestration across AWS, Azure, and GCP. My day-to-day work spans Kubernetes (EKS/AKS/GKE), Terraform, ArgoCD, CI/CD, Apache Airflow, and Snowflake—built for stability in real production environments.

At Infra360 (cloud & DevOps consulting), I embed with senior data, MLOps, and data platform engineers and deliver end-to-end cloud migrations. I executed an in-place Apache Airflow 2.10 to 3.2 upgrade across AWS EKS and Azure AKS, preserving metadata with backup/rollback runbooks per cutover.

I also root-caused a production Airflow 3.1.7 DagProcessor crash loop, traced to a heartbeat/liveness-probe mismatch under memory pressure, and tuned probes/resources to drive a permanent framework upgrade. Beyond that, I automated Snowflake RBAC as Terraform across four cloud-region repositories, integrated Microsoft Entra ID SSO over OAuth 2.0, and hardened scaled API replicas, worker OOM behavior, and Postgres connection-pool exhaustion.

I replicate production into isolated multi-region GCP tenants for data residency and latency (US and Middle East), provisioning VPC/networking, autoscaling private GKE, GeoDNS failover, CDN-backed GCS, and Cloud KMS-encrypted remote state via Terraform. I build GitOps and observability/security foundations with ArgoCD ApplicationSets, Helm/ Istio, Prometheus/Grafana/Mimir, and scanners like Falco, ELK, Trivy, Snyk, and OWASP ZAP—while staying active in open source (MLflow, CNCF Kubernetes Conformance, Apache Airflow) and writing production-focused DevOps/MLOps content on my blog.

Experience

Work history, roles, and key accomplishments

IN
Current

Assistant DevOps Engineer

Infra360

Jun 2025 - Present (1 year)

Led customer cloud migrations and Kubernetes-based platform work, including an in-place Apache Airflow 2.10→3.2 upgrade across AWS EKS and Azure AKS with validated downstream pipelines. Root-caused a production Airflow scheduler crash loop and hardened multi-cloud Airflow deployments with Terraform-managed Snowflake RBAC, Entra ID SSO, and end-to-end GCP multi-region tenant provisioning.

HE

Software Engineering Fellow

Headstarter

Jul 2024 - Sep 2024 (2 months)

Built and shipped 5 AI-powered web apps in 5 weeks using React/Next.js with Firebase and Vercel, supported by CI/CD and Docker. Delivered a RAG customer-support agent using OpenAI and Pinecone within a 3-person team.

Education

Degrees, certifications, and relevant coursework

GM

Global Institute of Technology and Management

Bachelor of Technology, Computer Science & Engineering

2021 - 2025

B.Tech in Computer Science & Engineering at Global Institute of Technology and Management, Gurugram from 2021 to 2025.

GM

Global Institute of Technology and Management

Bachelor of Technology (B.Tech), Computer Science & Engineering

2021 - 2025

B.Tech in Computer Science & Engineering at Global Institute of Technology and Management, Gurugram from 2021 to 2025.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan