We're seeking an experienced infrastructure engineer to help design and deploy a highly-performant backend that supports cloud-based inference and rendering pipelines. This role is critical to enabling scalable and responsive user experiences for advanced real-time applications. and reach users instantly via XR and web. As our Cloud Infrastructure Engineer, you’ll architect and optimize the backbone that makes this possible.
You’ll build the pipelines and GPU infrastructure to host AI and graphics rendering engines, stream output to XR clients with ultra-low latency, and scale to handle global users with minimal downtime.
Responsibilities
Design and deploy the cloud infrastructure supporting 3D rendering, AI inference, and asset storage.
Implement ultra-low-latency streaming systems (e.g., WebRTC, custom protocols) to deliver rendered content to XR clients.
Optimize backend systems for predictive pre-rendering: integrating ML models to anticipate user actions and render scenes in advance.
Ensure scalability and reliability via modern DevOps practices: containerization (Docker, Kubernetes), auto-scaling (AWS/GCP/Azure), and CI/CD pipelines.
Automate infrastructure provisioning and management using Infrastructure as Code (IaC) tools.
Collaborate with AI and graphics engineers to ensure inference and rendering workloads are optimized for GPU instances and distributed workloads.
Monitor and optimize performance, reliability, and cost of cloud services.
Troubleshoot and resolve issues related to cloud infrastructure and services.
Provide technical guidance and support to engineering teams in cloud adoption and migration strategies.
Requirements
Proven experience as a DevOps Engineer or Cloud Architect in Azure, AWS, and GCP environments, with a strong understanding of cloud infrastructure, including compute, storage, and networking.
Hands-on experience with CI/CD tools like Jenkins, GitLab CI, and proficiency in scripting languages (Bash, Python, PowerShell) for automation. Skilled in Infrastructure as Code (IaC) using Terraform or CloudFormation, and monitoring/logging with Prometheus, Grafana, and ELK Stack.
Expertise in cloud-native, GPU-accelerated pipelines for media and ML workloads. Proficient in backend development using Python, Node.js, Go, alongside IaC tools (Terraform, Helm). Knowledgeable about edge computing, CDN, and 5G MEC strategies for minimizing latency.
Bonus: Experience with game streaming platforms and cloud rendering stacks (Unreal/AWS Gamelift, NVIDIA CloudXR).
What We Value
Comfortable navigating ambiguity and working independently.
Action-oriented with a practical approach to solving complex problems.
Strong ownership mentality and proven delivery on high-impact projects.
Clear communicator with strong collaboration skills, especially with technical teams.
Experience building from the ground up in fast-moving startup environments.
Genuine enthusiasm for accelerating ML research and deployment in creative space.
Bonus Skills (Nice-to-Have)
Experience with ML pipelines involving video, image, or 3D data.
Familiarity with distributed compute frameworks (e.g., Ray) or orchestration tools (e.g., Flyte).
Familiarity with game engines (Unreal or Unity)
Knowledge of vector databases and similarity search (e.g., LanceDB).
Prior work in AI/ML research settings or startups.
Contributions to open-source ML/data infrastructure projects.
Experience designing tools directly for researchers or technical users.