Datadog logo

Software Engineer - Site Reliability

Datadog

Job description

We're on a mission to build the best platform in the world for engineers to understand and scale their systems, applications, and teams.  We operate at high scale—trillions of data points per day—providing always-on alerting, metrics visualization, logs, and application tracing for tens of thousands of companies. Our engineering culture values pragmatism, honesty, and simplicity to solve hard problems the right way

The team:


The Site Reliability teams at Datadog are responsible for ensuring that our high-volume, low-latency environments continue to perform around the clock. These teams collaborate closely with our product engineers to ensure that Datadog can monitor millions of servers and containers, ensuring our customers always have dependable and actionable data at their fingertips. You’ll be responsible for shaping the infrastructure of our data-intensive, real-time services as we continue to grow at petabyte scale.

You will:


  • Keep our service reliable, available and fast
  • Respond to, investigate and fix service issues, whether they be deep in the OS kernel or in the application code.
  • Design, build and maintain the infrastructure we need to support orders of magnitude more customers.

Requirements:


  • You have a track record working with large-scale distributed systems, preferably in the cloud OR you have a BS/MS/PhD in a scientific field or equivalent experience
  • You value correctness and efficiency; you leave no stone unturned when diagnosing production issues
  • You handle infrastructure with code because automation lets you focus on the more difficult and rewarding problems
  • You have production experience with distributed compute/storage tools, e.g. zookeeper, cassandra, postgres, kafka, elasticsearch, redis

Bonus points:


  • You have submitted bug fixes to the aforementioned projects
  • You are fully fluent in python, ruby and go

Is this you? Tell us why, and apply now. Include links to your github, stackoverflow or other online projects.

Apply now

Apply Apply

Please let Datadog know you found this job on Himalayas. This will help us grow!

About this role

Apply before

August 18th, 2021

Job posted on

October 16th, 2020

Job type

Full Time

Hiring timezone

Worldwide
Primary industry
Company size

1,001-5,000

Founded in

2010

Social media
Visit datadoghq.com Visit datadoghq.com

About the company

Modern monitoring & analytics. See inside any stack, any app, at any scale, anywhere Datadog is a monitoring and analytics platform for large-scale application infrastructure and applications. C...
View company profile View company profile

We'll keep you updated when the best new remote jobs pop up.

mail
Subscribe

We care about the protection of your data. Read our Privacy Policy.

Featured remote companies

View all companies View all companies
  • Media.net logo

    Media. net is a technology company comprising of 1250+ employees focused on developing innovative monetization products for digital publishers and advertisers. Media.net's vast product suite

    Employees

    1,001-5,000

  • Sticker Mule logo

    Sticker Mule is the fastest and easiest way to buy custom printed products in a matter of minutes we will turn your logo, artwork, photos designs, and illustrations into custom products.

    Employees

    51-200

  • Mixcloud logo

    Mixcloud is an audio streaming platform that supports creators to craft a deeper listening experience and build their own fan communities.

    Employees

    11-50

  • Swayable logo

    Swayable is a global research platform that measures how effectively media content changes opinions.

    Employees

    11-50

  • FullStack logo

    FullStack is a small team of designers and developers who are passionate about creating exceptional web and mobile applications.

    Employees

    11-50

  • Jolly Good Code logo

    As modern craftsmen specialising in Agile practices and Ruby on Rails, we have the passion and confidence to train you to be a software engineer, and expertise and experience to help you build you

    Employees

    1-10