Role Overview
We are building a high-performance, multi-tenant OpenShift cluster on bare metal in our AI-optimized data center. Our goal is to offer OpenShift as a Service to private AI-focused clients who wish to host their compute-intensive workloads in a scalable, secure, and isolated environment.
We’re looking for a hands-on Red Hat OpenShift Engineer who can design, architect, and implement this platform from scratch using industry best practices. Once the cluster is built, this role will also lead the migration of existing on-prem and Azure workloads to the new OpenShift environment.
Key Responsibilities
- Design, architect, and build a multi-tenant OpenShift cluster on bare metal
- Configure and maintain all aspects of the OpenShift platform for high availability, scalability, and security
- Define and implement networking, storage, ingress, monitoring, and logging
- Develop detailed architecture documents, blueprints, and SOPs
- Lead and execute migration of existing workloads from on-premise and Azure environments to OpenShift
- Ensure smooth onboarding for multiple AI-focused client tenants with isolated environments
- Support DevOps team with advanced Linux/OpenShift troubleshooting
- Implement and enforce RBAC, tenant isolation, resource quotas, and compliance controls
- Optimize performance for AI-heavy workloads running on GPU-enabled infrastructure
- Own operational stability, platform upgrades, and monitoring
Required Skills & Experience
- 5+ years of deep hands-on experience with Red Hat OpenShift and Kubernetes
- Proven experience designing, building, and managing bare metal OpenShift clusters
- Solid understanding of Linux internals, container runtimes, networking, and troubleshooting
- Experience with application migration from both on-premise and Azure environments into OpenShift
- Strong experience with multi-tenancy architecture, including workload isolation and security
- Familiarity with storage (CSI), networking (CNI), and service mesh implementations
- Proficiency in monitoring and observability tools (e.g., Prometheus, Grafana, ELK)
- Experience with Infrastructure as Code (Ansible, Terraform) and CI/CD automation
- Strong documentation and communication skills
Certifications (Required)
- Red Hat Certified Specialist in OpenShift Administration
- Red Hat Certified Engineer (RHCE) or equivalent Linux certification
Nice to Have
- Familiarity with AI/ML compute environments (e.g., GPU workloads, NVIDIA operators)
- Experience with hybrid cloud or edge computing models
- Exposure to enterprise-grade security, compliance, and policy enforcement
