We are a leading provider of nearshore staff augmentation services headquartered in New York. We deliver top-tier technology solutions to companies of all sizes, from innovative startups to industry leaders, helping them achieve their digital transformation goals.
Requirements
- Designs, implements, and evolves shared AWS CDK and CDK8s constructs used across multiple services and teams.
- Maintains core infrastructure components including VPC, EKS clusters and node groups, RDS, OpenSearch, and MSK.
- Operates and extends Kubernetes cluster addons such as ingress controllers, cert-manager, autoscalers, and monitoring/logging stacks.
- Ensures high reliability through structured alerting systems (Prometheus, CloudWatch), autoscaling strategies, and recovery mechanisms.
- Manages and publishes baseline templates, configuration schemas, and comprehensive documentation for infrastructure usage.
- Owns the CI/CD pipelines for Infrastructure as Code (IaC) codebases and platform component releases.
- Collaborates with engineering teams to troubleshoot infrastructure-related issues and deliver scalable, reliable solutions.
- Applies Site Reliability Engineering (SRE) principles—including SLIs, SLOs, observability, and fault tolerance—to all shared platform services.
- Supports IAM roles, secrets management, and tenant isolation best practices.
Benefits
- 100% Remote Work
- Highly Competitive USD Pay
- Paid Time Off
