MagicSchool hiring Senior Site Reliability Engineer (Observability & Resilience) • Remote (Work from Home) | Himalayas
MagicSchoolMA

Senior Site Reliability Engineer (Observability & Resilience)

We’ve designed our tools with the realities of teaching in mind.

MagicSchool

Employee count: 11-50

Salary: 130k-150k USD

United States only

WHO WE ARE: MagicSchool is the premier generative AI platform for teachers. We're just over 2 years old, and more than 5.5 million teachers from all over the world have joined our platform. Join a top team at a fast growing company that is working towards real social impact. Make an account and try us out at our website and connect with our passionate community on our Wall of Love.

Role Description:

As Senior Site Reliability Engineer (Observability & Resilience), you will lead observability across our platform and help design the resilient infrastructure our customers and educators rely on every day. In this hands-on, individual contributor role, you’ll drive instrumentation and telemetry strategy while partnering closely with product and engineering to plan for Resilience, Recovery, and Availability.

Responsibilities:

In this role, you will be responsible for driving to the following outcomes:

  • Observability Leadership: Design and implement observability patterns—including metrics, logging, tracing, and alerting—to ensure we have clear, actionable visibility into platform behavior and performance.

  • Build internal tooling and dashboards: Empower our teams with real-time system insights.

  • Operational Excellence: Define and maintain SLIs and SLOs in partnership with product and engineering teams. Establish best practices for alert tuning and signal-to-noise balancing to reduce incident fatigue and improve response accuracy.

  • Platform Resilience: Architect and support infrastructure that prioritizes high availability, disaster recovery, and graceful degradation. Leverage Terraform and infrastructure-as-code to ensure consistent, reliable deployments across AWS and Google Cloud.

  • Cross-Functional Enablement: Collaborate with engineers across teams to embed resilient design and observability from the ground up. Provide training and pairing support to product engineers, helping them build and maintain telemetry that supports the full software lifecycle.

Experience & Qualifications:

To be successful in this role, you’ll bring the following experience and qualifications:

  • Professional Experience: At least 5 years in an SRE, DevOps, or observability-focused role, with a track record of success in fast-paced, high-growth environments.

  • Observability & Resilience: Experience designing and operating systems for high availability and disaster recovery. Familiarity with incident response, alert fatigue reduction, and signal-to-noise balancing.

  • Tooling Expertise: Deep experience with observability tools such as Grafana, Prometheus, Loki, Datadog, and OpenTelemetry. Proven ability to operationalize these tools for maximum team impact.

  • Infrastructure Skills: Strong proficiency with Terraform and infrastructure-as-code workflows. Experience with multi-cloud deployments and operating resilient systems at scale.

  • Enablement & Collaboration: Passion for enabling product engineers through training and pairing on observability patterns. Ability to drive cross-functional initiatives that improve system health and team effectiveness.

  • Communication Skills: Skilled at explaining complex infrastructure and observability concepts to both technical and non-technical audiences. Calm and decisive under pressure, especially during incident response.

Nice to Have:

  • Experience with Sentinel, Loki, or similar logging/metrics stacks.

  • Exposure to educational or compliance-heavy environments.

  • Strong debugging skills and a calm presence during incidents.

Notice: Priority Deadline and Review Start Date

Please note that applications for this position will be accepted until 7/18/25 — applications received after this date will be reviewed on an intermittent basis. While we encourage early submissions, all applications received by the priority deadline will receive equal consideration. Thank you for your interest, and we look forward to reviewing your application.

Why Join Us?

  • Work on cutting-edge AI technology that directly impacts educators and students.

  • Join a mission-driven team passionate about making education more efficient and equitable.

  • Flexibility of working from home, while fostering a unique culture built on relationships, trust, communication, and collaboration with our team - no matter where they live.

  • Unlimited time off to empower our employees to manage their work-life balance. We work hard for our teachers and users, and encourage our employees to rest and take the time they need.

  • Choice of employer-paid health insurance plans so that you can take care of yourself and your family. Dental and vision are also offered at very low premiums.

  • Every employee is offered generous stock options, vested over 4 years.

  • Plus a 401k match & monthly wellness stipend

Our Values:

  • Educators are Magic: Educators are the most important ingredient in the educational process - they are the magic, not the AI. Trust them, empower them, and put them at the center of leading change in service of students and families.

  • Joy and Magic: Bring joy and magic into every learning experience - push the boundaries of what’s possible with AI.

  • Community: Foster community that supports one another during a time of rapid technological change. Listen to them and serve their needs.

  • Innovation: The education system is outdated and in need of innovation and change - AI is an opportunity to bring equity, access, and serve the individual needs of students better than we ever have before.

  • Responsibility: Put responsibility and safety at the forefront of the technological change that AI is bringing to education.

  • Diversity: Diversity of thought, perspectives, and backgrounds helps us serve the wide audience of educators and students around the world.

  • Excellence: Educators and students deserve the best - and we strive for the highest quality in everything we do.

Compensation Range: $130K - $150K

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Salary

Salary: 130k-150k USD

Location requirements

Hiring timezones

United States +/- 0 hours

About MagicSchool

Learn more about MagicSchool and their company culture.

View company profile

We’ve designed our tools with the realities of teaching in mind.

Teachers don’t have time to pore over complicated new systems — which is why our tools are so simple, you can start using them immediately.

We know your time is valuable. You can start using our tools, and saving time, as soon as you sign up. Customizing our tools takes minutes. AI is a vast and complex field, but we make it so that you can take advantage of this new technology and immediately apply it to all kinds of tasks on your plate.

Every teacher has unique context — which is why our tools are fully customizable.

The edtech field is saturated with generic (and therefore not very useful) teacher tools. We recognize that every teacher has special knowledge about their students, classroom, and school.
That’s the knowledge that makes teachers so invaluable. For maximum impact, you can always adjust our tools to your particular context, whether that’s a student’s reading level, aligned to a specific objective, teaching philosophy, or an taking into account what students have already learned.
We also know that many teachers have lessons provided - and it's their task to customize those lessons to fit their students' needs which is why many of our tools allow unique transformations of existing material.

There are some things teachers don’t need technology to help with — which is why we’ve focused on creating tools to streamline tedious tasks only teachers might recognize.

There are plenty of edtech tools meant to help teachers—and yet, most don’t serve the real needs of teachers.
Instead of only creating tools that address commonly understood aspects of teaching, we’ve drawn from research-based best practices, feedback from educators, as well as our own experiences in the classroom, to create tools that will save time on repetitive, tedious behind-the-scenes tasks that only teachers understand require tons of effort… things like generating relevant content, writing IEPs, differentiation, creating assessments, and supporting school discipline. That way, teachers can spend their time on the uniquely human, creative aspects of their work.

Tech stack

Learn about the tools and technologies that MagicSchool uses to build, market, and sell its products.

View tech stack

MagicSchool employees can create an account to update this tech stack.

Employee benefits

Learn about the employee benefits and perks provided at MagicSchool.

View benefits

Unlimited PTO

Unlimited PTO is available.

Retirement benefits

Generous 401(k) with matching.

Healthcare benefits

Medical, dental, and vision insurance for employees and dependents.

View MagicSchool's employee benefits
Claim this profileMagicSchool logoMA

MagicSchool

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

30 remote jobs at MagicSchool

Explore the variety of open remote roles at MagicSchool, offering flexible work options across multiple disciplines and skill levels.

View all jobs at MagicSchool

Remote companies like MagicSchool

Find your next opportunity by exploring profiles of companies that are similar to MagicSchool. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan