Your New Role: Senior DevOps Engineer, focusing on Overleaf Infrastructure
What you’ll be doing
This role requires a blend of hands-on infrastructure ownership, automation, and a strong focus on system reliability and cost efficiency:
- GCP Infrastructure Ownership: You will own our infrastructure on Google Cloud Platform and the Terraform codebase, managing critical components including VPCs, Compute Engine, Kubernetes Clusters, Cloud SQL/Redis, Load balancers, Cloud Armor, logging/monitoring pipelines, and IAM.
- Automation & CI/CD: Build and optimize CI/CD pipelines using Jenkins or similar tools, and automate routine operations with shell scripts where appropriate.
- Reliability & Monitoring: Implement and manage monitoring, alerting, and incident response systems using Google Cloud Monitoring and similar tools. You will be part of a rotating on-call schedule for critical infrastructure issues outside normal business hours.
- Database Management: Ensure the performance, reliability, and uptime of PostgreSQL and Mongo databases with proactive monitoring and tuning.
- Cost Management: Oversee resource usage on GCP to ensure we are managing our costs efficiently.
- Collaboration & Knowledge Sharing: Take a collegiate approach to sharing knowledge with engineers, building consensus for change, and writing excellent documentation.
What you’ll bring to the role
- Cloud & Containers: Significant working knowledge of cloud-computing environments such as GCP or AWS. Strong hands-on expertise in Kubernetes and Docker.
- Infrastructure as Code (IaC): Strong hands-on expertise in Terraform.
- Operating Systems & Scripting: Solid Linux/Unix systems knowledge and scripting skills (Bash/Python).
- DevOps Tooling: Experience with CI/CD tools (e.g., Jenkins) and monitoring platforms (e.g., Grafana, Google Cloud Monitoring).
- Database Expertise: Experience working with databases such as Mongo, PostgreSQL, and Redis.
- SRE Practice: Know how to implement best-practice alerting, monitoring, and observability on applications that experience high load.
- Incident Management: An excellent track record of dealing with production incidents and post-incident analysis.
- Agile: Significant experience working in an Agile methodology and implementing best practices in version control and code review.
Mindset
- A security-first mindset at all times, covering confidentiality, integrity, and availability.
- A commitment to staying up-to-date with emerging technologies and implementing innovative cloud solutions.
- Understand error budgets, SLI, and SLOs.
- Understand how to manage cloud computing costs effectively.
- Experience coding in a language such as JavaScript.
Don't worry if you don't meet every qualification—let us be the judge! Studies show that many qualified candidates from under-represented groups hesitate to apply unless they meet every single requirement. We are dedicated to building a diverse and inclusive team and strongly encourage you to submit your application.
Living our Values
At Digital Science, our vision is to see research flow seamlessly – trusted, collaborative, and accessible – fueling breakthroughs that push humanity forward. This ambitious mission is one we achieve together, by enabling open, collaborative, inclusive research.
We firmly believe that to truly innovate and solve the complex challenges faced by our customers, from researchers and universities to funders and publishers, we need diverse perspectives, experiences, and ideas. A rich mix of voices drives quality insights, fosters enhanced collaboration, and ultimately pushes knowledge forward more effectively.
As an equal opportunity employer, we are committed to building and nurturing a workplace where every individual feels valued and belongs. All applicants will be considered for employment without attention to race, colour, religion, age, sex, sexual orientation, gender identity, national origin, veteran, or disability status. Beyond recruitment, we strive to cultivate an environment where inclusivity is woven into the fabric of our culture, enabling everyone to be their best self and do their best work.
