This is a remote position.
We are seeking a deeply technical Senior OpenStack Engineer to design, build, automate, scale, and operate large-scale production OpenStack environments powering enterprise private clouds, MSP platforms, and high-performance digital twin lab infrastructures.
This is not a UI-driven admin role. We are looking for engineers who understand OpenStack at the service, database, messaging, hypervisor, and packet-flow layers — individuals who can troubleshoot RabbitMQ queues, debug Neutron agents, tune Ceph latency, and automate full cloud deployments from bare metal upward.
You will work on multi-region architectures, high-availability designs,NVMestorage fabrics, SDN integrations, and hybrid cloud platforms supporting global customers.
Primary Responsibilities
1. OpenStack Architecture & Platform Engineering
- Design production-grade OpenStack environments across controller, compute, and storage nodes.
- Architect HA control planes usingHAProxy,Keepalived, Galera, and RabbitMQ clustering.
- Build scalable cell-based Nova architectures.
- Implement multi-region replication strategies.
- Perform platform capacitymodelingand growth forecasting.
2. Compute Virtualization (Nova)
- Nova scheduler tuning and filters.
- CPU pinning and isolation.
- NUMA topology alignment.
-HugePagesconfiguration.
- Live migrations and evacuations.
- GPU passthrough and SR-IOV provisioning.
Hypervisor stack includes KVM, QEMU,Libvirt, andVirtIO.
3. Networking & SDN (Neutron)
- ML2 plugin architecture.
- OVS, OVN, Linux Bridge deployments.
- VXLAN, Geneve, VLAN overlays.
- DVR and L3 routing.
- Floating IP NAT design.
- SR-IOV and DPDK acceleration.
- Integration with BGP EVPN, MPLS, VRFs, and SD-WAN.
4. Storage Engineering
Ceph (Primary Requirement)
- RBD block storage.
-CephFSand RGW object storage.
- CRUSH map tuning.
- Placement group optimization.
-BlueStoreperformance tuning.
-NVMeand SSD tiering.
Additional exposure toLinstor, DRBD, iSCSI, andNVMe-oFpreferred.
5. Image & Lifecycle Services
- Glance image pipelines.
- QCOW2 optimization.
- Cloud-initautomation.
- Golden image lifecycle management.
6. Identity & Access (Keystone)
- RBACmodeling.
- LDAP/AD integration.
- SAML/SSO federation.
- Token lifecycle management.
- Heat orchestration templates.
- Terraform automation.
- Ansible playbooks.
- CI/CD for infrastructure.
