Join our AI Platform Team to drive operational excellence, automation, and platform reliability. As a SRE & DevOps Engineer, you will design, implement, and maintain CI/CD pipelines, manage Kubernetes clusters, and automate operational tasks.
Requirements
- Design, implement, and maintain CI/CD pipelines for AI platform applications
- Manage and troubleshoot Kubernetes clusters, Docker containers, and cloud infrastructure
- Ensure high availability (99.999%), system reliability, and security across platforms
- Automate operational tasks, monitoring, and deployment workflows
- Collaborate with AI platform developers to deploy and scale services efficiently
- Analyze and resolve production issues, performance bottlenecks, and functional problems
- Define operational standards, versioning practices, and advise teams on DevOps best practices
- Prepare documentation, training materials, and provide technical support to platform users
- Design, build, and refactor services in React / Node.js
- Integrate backend services with interactive UI components (Jupyter notebooks, developer tools)
- Contribute to developer productivity tools, such as VS Code plugins or ML workflow integrations
- Collaborate with AI platform developers to integrate applications into automated CI/CD workflows
Benefits
- Flexible working format - remote, office-based or flexible
- Competitive salary and good compensation package
- Personalized career growth
- Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
- Active tech communities with regular knowledge sharing
- Education reimbursement
- Memorable anniversary presents
- Corporate events and team buildings
- Other location-specific benefits
