Join Inframark's award-winning Automation and Intelligence team as a DevOps Engineer. You'll take ownership of infrastructure, stabilize and modernize the infrastructure supporting WaterMinds, and drive infrastructure improvements. Opportunity to transition into MLOps as the platform matures.
Requirements
- Take ownership of production monitoring and alerting using Prometheus, Grafana, and CloudWatch
- Modernize production EKS cluster with GitOps practices (ArgoCD), comprehensive monitoring, and proper deployment workflows
- Streamline staging deployment process; eliminate branch-based workarounds and establish clean GitOps patterns
- Design infrastructure patterns that scale to hundreds of customers and own AWS infrastructure operations including patching, maintenance, cost optimization, and security compliance
- Expand into MLOps—building the infrastructure that enables data scientists to deploy models at scale across multiple utility customers once DevOps operations are automated
- Manage Kubernetes clusters (EKS) including pod migrations, resource optimization, troubleshooting, and security updates—proactively, not reactively
- Maintain infrastructure as code using Terraform and Ansible following best practices—all changes tested in non-production before deployment
- Support engineering teams with infrastructure needs, unblock them quickly, and establish self-service patterns where possible—anticipate needs, don't wait for requests
- Manage message queue infrastructure (Kafka/Redpanda) including retention policies, storage optimization, and performance tuning
- Document infrastructure, create runbooks, and automate operational tasks to move systems into maintenance mode
- Clean up technical debt—proactively identify infrastructure to decommission, resources to consolidate, and costs to optimize
Benefits
- health insurance
- dental insurance
- life insurance
- 401(k) plan
- paid time off
- sick leave
- holidays
- wellness plan
