AppspaceAP

Site Reliability Engineer

Appspace
United States only
Apply now

About Appspace:

At Appspace, we’re passionate about creating better work experiences for people everywhere, and we’re looking for people that feel the same way. Our global office locations and flexible work culture help you work wherever and however you’re at your best. Plus, we take the time to help you enjoy your work, build lasting connections, and grow your role. Join the Appspace team and be a part of a culture that’s helping people everywhere love where they work.

Your Role as a Site Reliability Engineer:

Our Cloud Operations team seeks a Site Reliability Engineer who is passionate about problem-solving, automating, and maintaining Appspace’s Cloud Platform to support the needs of our Engineering and Customer Care teams. The ideal candidate will see manual work as an opportunity to exercise automation, will understand SRE best practices, have experience automating infrastructure deployments and developing self-healing solutions to infrastructure issues. You will work closely with a global team of cloud, engineering, product, and service professionals to improve our platform’s resiliency and scalability, which directly improves our customers’ experience with Appspace. With this role, you can grow your capabilities as a Site Reliability Engineer given the large-scale size of our cloud platform combined with our smaller-sized Cloud Operations team, which means you will have opportunities to work on all Cloud Infrastructure, end-to-end. This is a mission-critical role for Appspace, therefore while we offer flex time, it should be scheduled ahead of time, otherwise shift engagement is mandatory outside lunch and break times. On-Call coverage will be required weekly during a limited window of US daytime hours over the weekend. This is your opportunity to be part of an awesome company that is rapidly growing and defining the modern workplace experience market!

A Day in the Life of a Site Reliability Engineer:

For this role, you will play a key role in maintaining our cloud platform, which includes an assortment of Kubernetes, Microservices, MongoDB, RabbitMQ, MySQL, Windows Server VM Infrastructure, Orchestration Engines, CI/CD and Monitoring platforms. Your day will consist of:

  • Automating maintenance tasks for our Cloud Platform, therefore strong experience in Python and shell scripting is a must.
  • Deploying new features and releases of our software into Kubernetes via Helm, so strong experience in Kubernetes and Helm is a must.
  • Troubleshooting performance issues or errors thrown by the cloud platform or application, and either resolving the underlying cause, or forwarding your research to Engineering to address in the product.
  • Actioning Request Tickets from other teams in support of their needs to enable and prepare for upcoming releases.
  • Monitoring the application’s performance, uptime, and cloud infrastructure’s performance, looking for improvement opportunities, and proactively taking action to solve any negative trends before they become issues.
  • Lead, Participate, or Execute within the incident management process when alerts fire, and quickly ascertain root cause, resolve the issue, and find new and creative solutions to prevent recurrence.
  • Configure, Monitor, Research, and Evaluate workload performances both on Google Cloud Platform and Microsoft Azure Clouds.
  • Collaborating with our Development and Quality Assurance teams to address issues in the product and platform.
  • Documenting new or updating existing processes and procedures to share knowledge and improve on standardized approaches to solution.

What You’ll Need:

  • Must be able to learn new technologies quickly and a desire to be a life-long learner
  • Must communicate well and adapt to working well with others across different countries and cultures.
  • Strong background in Containers, Kubernetes, Helm, Linux, Python coding, and some experience with Windows Server OS and MacOS are a must.
  • Experience with Google Cloud Platform, Google Kubernetes Engine, Google Compute Engine, and Google Storage is highly desired, but comparable experience with AWS or Azure will be considered.
  • Solid troubleshooting experience and the ability to reason through a process workflow to identify a fault or odd behavior (i.e., spending time following log trails) is a must.
  • Experience with administering MySQL & MongoDB preferred.
  • Experience with administering message brokering systems like RabbitMQ preferred.
  • Must be flexible on occasionally attending “off-hour” meetings (we’re a global team supporting a global customer base!).
  • Open to quarterly travel up to 5%.

Nice to Haves:

  • Experience with Build pipeline tools and the Atlassian suite (JIRA, Confluence, Bitbucket/Git, Bamboo, Octopus).
  • Experience with monitoring and alerting platforms, especially StackDriver.
  • Experience with HashiCorp Terraform.
  • Experience with IIS.

The Perks of Working for Appspace:

For all our US based team members, we offer a variety of benefits from competitive salaries, medical, dental and vision coverage, disability coverage, employer paid life insurance, mental health resources, 401(k) plan and a fully paid parental leave program.

Additional perks include:

  • Generous PTO
  • Flexible work schedules
  • Remote work opportunities
  • Paid company holidays
  • 1/2 Day Fridays
  • Appspace Quiet Fridays (No non-essential internal meetings scheduled)
  • A casual dress work environment

Disclaimer:

Appspace Inc. is an equal opportunity employer. All aspects of employment, including recruiting, hiring, placement, promotion, termination, compensation, and training are decided based on qualifications, merit, and business needs. We do not discriminate on the basis of race, color, religion or belief, marital status, sex, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy (including childbirth, lactation, or related medical conditions), gender, gender identity and/or expression, genetic information, sexual orientation, family or parental status, veteran status, or any other status protected under federal, state, or local laws.

Appspace is committed to creating a diverse and inclusive workplace and does not tolerate discrimination or harassment of any kind.

Elevate your application

Let our AI craft your perfect cover letter and align your resume to this job's criteria.

By using our AI tools, you consent to sharing your profile with our AI partner for this purpose.

Apply now

Please let Appspace know you found this job on Himalayas. This helps us grow!

Apply now

About the job

Apply before

Jun 23, 2024

Posted on

Apr 24, 2024

Job type

Full Time

Experience level

Mid-level

Location requirements

Hiring timezones

United States +/- 0 hours
Claim this profileAppspace logoAP

Appspace

View company profileVisit appspace.com

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

6 remote jobs at Appspace

Explore the variety of open remote roles at Appspace, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Appspace

Remote companies like Appspace

Find your next opportunity by exploring profiles of companies that are similar to Appspace. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join thousands of other remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan