Job Title: DevOps LeadLocation: Remote – Latin America Preferred
Type of Contract: Full-Time | Contractor
Salary Range: Market RatesLanguage Requirements: Professional English required
We are seeking a skilled DevOps Lead with deep expertise in bare metal infrastructure, cloud platforms, and disaster recovery to join our growing team. You will play a key role in defining infrastructure strategy, ensuring platform reliability, and leading high-availability operations across hybrid environments. Your work will directly impact system uptime, scalability, and the organization’s ability to operate resilient, production-grade platforms.
Key Responsibilities
• Own the design, provisioning, and operations of self-hosted bare metal infrastructure, including server configuration, storage, and network topology
• Architect and manage hybrid cloud environments (AWS, GCP, or Azure), ensuring seamless integration with on-premise systems
• Define and implement disaster recovery strategies, including RTO/RPO standards, backups, replication, and failover mechanisms
• Lead CI/CD platform development, optimizing pipelines for Java-based services and broader engineering workflows (Jenkins, GitHub Actions, GitLab CI)
• Drive containerization and orchestration strategies using Docker and Kubernetes in self-hosted environments
• Establish infrastructure-as-code practices (Terraform, Ansible) as the single source of truth across all environments
• Build and lead a DevOps team, including defining on-call rotations, incident response processes, and operational best practices
Must-Have Qualifications
• 8+ years of experience in DevOps or infrastructure engineering, with at least 3 years in a leadership role
• Proven experience managing bare metal or on-premise infrastructure at production scale
• Strong expertise in cloud platforms (AWS, GCP, or Azure) and hybrid infrastructure design
• Hands-on experience with infrastructure-as-code tools such as Terraform and Ansible
• Proficiency with Kubernetes and Docker in production, self-hosted environments
• Solid experience with CI/CD tools and release engineering practices
• Working knowledge of PostgreSQL operations, including replication, backup, and recovery
Preferred Qualifications
• Experience implementing and leading disaster recovery programs with defined RTO/RPO metrics
• Familiarity with monitoring and observability tools such as Prometheus, Grafana, or ELK stack
• Understanding of security best practices including IAM, secrets management, and compliance frameworks (SOC 2, GDPR)
• Strong leadership and communication skills, with the ability to present technical concepts to non-technical stakeholders
• Experience collaborating closely with software architecture teams to align infrastructure with application design
