MedTrainer is an innovator in the healthcare industry, changing the landscape of technology offerings with its Platform Solution, comprised of its proprietary Learning Management System (LMS), Compliance Training, and Managed Services offering in Credentialing and Compliance Management. We are looking for a Site Reliability Engineer who can build, scale, maintain, and monitor highly available, secure, and cost-efficient cloud platforms and Kubernetes workloads with a strong focus on reliability engineering practices.
Requirements
- Bachelor's in Computer Science or equivalent
- 3+ years working on distributed systems and cloud operations
- Strong hands-on experience with major cloud providers (Azure, AWS, GCP) and their managed Kubernetes services
- Deep experience architecting and/or operating large Kubernetes clusters
- Container expertise (Docker/OCI), packaging and configuration, and service mesh experience
- Advanced GitHub Actions expertise
- Strong Python skills (required) for Pulumi-based IaC, tooling, and automation
- Familiarity with CI/CD, change management, and experience in progressive delivery
- Observability stack experience and alerting practices tied to SLOs
- Configuration of cloud-native networking, storage, Linux, security controls, and cost governance
- Experience migrating and scaling infrastructure across clouds
- Relevant certifications (e.g., CKA) are a plus
- Advanced English (optional)
Benefits
- Major Medical Insurance and healthcare coverage
- Home office and ergonomics support (internet, electricity, office chair)
- Professional development opportunities, including English classes
- Wellness benefits such as TotalPass gym discounts
- Savings plan
- Paid time off, including personal days
