Location:
About our client:
Requirements
MLOps Engineer | Remote
- 10+ years of IT experience in which at least 6+ years of relevantexperience primarily in MLOps, Cloud, Dockerization & Containerization.
- Develop best practices and how-to guides for MLOps practicesrelating Git, CI/CD, Input data unit and statistical testing, Experimenttracking, Model Registry, Scheduling of ML pipelines, Production driftmonitoring and alerting, Optimization of cloud compute resources
- Adopt best practices for writing processed data (ML features) toappropriate data lakes, warehouses, or feature stores
- Extensive experience with infrastructure provisioning and configuringpublic and hybrid clouds with mandatory GCP experience.
- Dockerization of Machine Learning scripts and deployment in Google CloudPlatform.
- Ability to create automatically and deploy Machine Learning notebooks.Also, ability to establish connection with Big Data infrastructure from theMachine Learning notebooks.
- Experience administering Kubernetes, Google Kubernetes Engine (GKE) andunderstanding of manifest management with Helm.
- Experience with CI/CD pipelines and related tools such as Jenkins orCircleCI and Google Cloud Build.
- Experience with configuration management tools and deployment tools likeTerraform and Google Deployment Manager.
- Good knowledge of other tools likePuppet, Ansible, Chef, Consul, Packer etc.
- Setup of Monitoring and Alerts for Production workloads in Google CloudPlatform.
- Experience in automating multiple systems using Bash and languages suchas Python and Go.
- Recommend and implement automated solutions that will improve theperformance and reliability of the system.
- Strong understanding and experience operating in an agile developmentenvironment.
- Strong communication skills across the board, with a passion for findingand sharing best practices and driving greater discipline.
- Excellent verbal and written communication skills in English.
Nice to have:
- Experience in managing Linux/Unix platforms, application serveradministration (Tomcat, JBoss, etc.) and DNS, Linux system configuration andadministration (CentOS, RedHat, etc.).
- Good exposure with monitoring tools geared towards user experience anddeep diagnostics (Splunk, New Relic, AppDynamics, Prometheus, Grafana, etc.)
Details
