HimalayasHimalayas logo
Domino Data LabDL

Staff Site Reliability Engineer

The enterprise data science management platform trusted by over 20% of the Fortune 100.

Domino Data Lab

Employee count: 51-200

Argentina only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Who we are

At Domino, we build software that helps the largest, AI-driven organizations build and operate advanced data science and AI solutions at scale. Our platform integrates a streamlined model development environment, MLOps capabilities, and novel features for collaboration, reuse, and reproducibility — all of which make data science teams more productive, reduce time to value, and ensure compliance. Our customers — like Johnson & Johnson, GSK, Bristol Myers, UBS, FINRA and the US Navy — are using our software to solve some of the most important challenges in the world, such as developing new medicines, securing our financial markets, or protecting our country. Backed by Sequoia Capital, Coatue Management, NVIDIA, Snowflake, NetApp and other leading investors, we have been in business for a decade but are still a small team operating with the spirit of a startup. Especially in the world of AI today, we believe that the future is still being invented — and we want to be the ones building it. For more information, visit www.domino.ai

What we are building

As our infrastructure and customer footprint grow, we're investing in a new kind of SRE practice where the people who respond to incidents also build the systems that make future incidents shorter, rarer, and less painful. We're developing AI-assisted tooling that helps our support and engineering teams diagnose problems faster, learn from outages more deeply, and automate away the toil that slows everyone down. This role sits at the center of that: equal parts hands-on operator, software engineer, and technical leader. If you believe that operational experience and engineering craft make each other stronger, you'll feel right at home here.

What your impact will be

  • Lead the development of Domino's internal AI-assisted reliability tooling, including systems that analyze tickets, logs, traces, and documentation to help teams resolve outages faster with less recurring toil
  • Improve the observability coverage and signal quality for our most critical customer-facing systems, so engineers have more to work with throughout the development and support lifecycle
  • Own incident response end-to-end, from detection to remediation, and leave each problem space better documented, better understood, and less likely to recur
  • Guide the development of customer and user-facing observability tools within our products
  • Define and mature SLO/SLI frameworks for priority services, turning abstract reliability goals into measurable, actionable standards
  • Scale cloud operations practices for Domino’s single-tenant SaaS offering, and work with engineering teams to improve the reliability and repeatability of customer deployments and upgrades
  • Mentor other engineers and shape how SRE is practiced at Domino, including incident response workflows, operational readiness expectations, and post-incident learning culture

What we look for in this role

  • Deep experience in Site Reliability Engineering, platform engineering, or a software engineering role with genuine, hands-on operational ownership
  • Fluency with Kubernetes, Linux, cloud platforms, and observability tooling, and the ability to use them to investigate complex, real-world production problems
  • A strong ability to perceive and close reliability gaps in technical products, tools and processes
  • Strong software engineering skills in Python or Go, with a track record of building internal tools or services that people actually rely on
  • Comfort leading technically ambiguous work and influencing direction across teams without needing direct authority to get things done
  • A history of improving reliability through engineering and automation, not just putting out fires manually
  • Strong communication skills and real experience mentoring engineers or shaping technical decision-making on your team
  • Sound judgment about AI/LLM tooling: you know where it genuinely helps in operational workflows and where it adds noise instead of signal
  • Bonus: Experience with LLM-based systems, retrieval workflows, SaaS platform operations, or building tooling for support or developer teams

What we value

  • We strongly believe in the value of growing a diverse team and encourage people of all backgrounds, genders, ethnicities, abilities, and sexual orientations to apply
  • We value a growth mindset. High-performing creative individuals who dig into problems and see the opportunities for success
  • We believe in individuals who seek truth and speak the truth and can be their whole selves at work.
  • We value all of you that believe improving is always possible. At Domino, everything is a work in progress – we can do better at everything.
  • We emphasize an environment of teaching and learning to equip employees with the tools needed to be successful in their function and the company.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Location requirements

Hiring timezones

Argentina +/- 0 hours

About Domino Data Lab

Learn more about Domino Data Lab and their company culture.

View company profile

The enterprise data science management platform trusted by over 20% of the Fortune 100. Our products enable thousands of data scientists to develop better medicines, grow more productive crops, adapt risk models to major economic shifts, build better cars, improve customer support, or simply recommend the best purchase to make at the right time.

Data scientists are called upon to solve ever more complex problems across every facet of business and civic life. Domino empowers data science teams to develop and deploy ideas faster with collaborative, reusable reproducible analysis in a secure platform built with the needs of compliance intensive industries in mind.

Domino is backed by leading venture capital firms: Sequoia Capital, Bloomberg Beta, Coatue Management, Dell Technologies Capital, Highland Capital Partners, In-Q-Tel, and Zetta Venture Partners.

Employee benefits

Learn about the employee benefits and perks provided at Domino Data Lab.

View benefits

Retirement benefits

401(k) or pension plan to help you invest in your future.

Employee assistance program (EAP)

We offer an employee assistance program focused on mental health.

Company equity

Domino offers stock options which vests over a four-year period with a one-year cliff.

Company events

Domino hosts company outings quarterly. We also sponsor family-oriented events annually.

View Domino Data Lab's employee benefits
Claim this profileDomino Data Lab logoDL

Domino Data Lab

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

8 remote jobs at Domino Data Lab

Explore the variety of open remote roles at Domino Data Lab, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Domino Data Lab

Remote companies like Domino Data Lab

Find your next opportunity by exploring profiles of companies that are similar to Domino Data Lab. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan