Scribd logo

Senior Data Engineer

Scribd

Job description

At Scribd (pronounced “scribbed”), we believe reading is more important than ever. Join our cast of characters as we build the world’s largest and most fascinating digital library: giving subscribers access to a growing collection of ebooks, audiobooks, magazines, documents, Scribd Originals and more.

In addition to works from major publishers and top authors, we also create our own original content exclusively for Scribd users.

Our community includes over 1.4 M subscribers in nearly every country worldwide.

What you'll do


Data quality and integrity are two areas of focus for your work in our existing, organically-grown data infrastructure. You will be in charge of building tools and technology to ensure that downstream customers can have faith in the data they're consuming. Based on the project, this might involve cross-functional work with the Data Science and Content Engineering teams to repartition or optimize business-critical Hive tables, or working with Core Platform to implement better processing jobs for scaling our consumption of streaming data sets. Almost everything you would be working on would be to increase the "customer satisfaction" for internal customers of Scribd data.

Required Skills


  • Strong written and verbal communication skills (we're remote!)
  • You have 5+ years experience in data engineering
  • You have engineered scalable software using big data technologies (e.g. Hadoop, Spark, Hive, Flink, Samza, Storm, Elasticsearch, Druid, Cassandra, etc)
  • You have experience building data pipelines (real-time or batch) on large complex datasets
  • Fluency with at least one dialect of SQL (MySQL and Hive preferred)
  • Expertise in Scala, Java, or Python

Desired Skills


  • You have worked on and have knowledge of Streaming platforms, typically based around Kafka.
  • Strong grasp of AWS data platform services and their strengths/weaknesses.
  • Strong experience using  Jira, Slack, JetBrains IDEs, Git, GitLab, GitHub, Docker, Jenkins, Terraform. 
  • Experience using DataBricks

At Scribd, we value people above everything else. We're building a diverse workplace and an inclusive culture to give more people the chance to change the way the world reads. Scribd is proud to be an equal opportunity employer and considers all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender, gender identity, national origin, disability, veteran status or any other legally protected characteristic. We encourage people of all backgrounds to apply because we believe that a diverse set of perspectives and experiences create a foundation for the best ideas. Come join us in building something meaningful.

Apply now

Apply Apply

Please let Scribd know you found this job on Himalayas. This will help us grow!

About this role

Apply before

May 13th, 2021

Job posted on

October 21st, 2020

Job type

Full Time

Hiring timezones

Scribd is hiring for this role in the following timezones:

Badge UTC -10.0
Badge UTC -9.5
Badge UTC -9.0
Badge UTC -8.0
Badge UTC -7.0
Badge UTC -6.0
Badge UTC -5.0
Badge UTC -4.0
Badge UTC -3.5
Badge UTC -3.0
Badge UTC -2.0
Badge UTC +14.0

Categories

Primary industry
Company size

201-500

Founded in

2007

Social media
Visit scribd.com Visit scribd.com

About the company

We believe reading is more important than ever. Join our cast of unique characters as we build the world’s largest and most fascinating digital library: giving subscribers access to a growing collectio...
View company profile View company profile

We'll keep you updated when the best new remote jobs pop up.

mail
Subscribe

We care about the protection of your data. Read our Privacy Policy.

Featured remote companies

View all companies View all companies
  • Catylist logo

    Catylist began in 2001 when Ronald D. Marten, CCIM partnered with a couple of young software developers to build a commercial real estate search engine for the CCIM Institute.

    Employees

    11-50

  • Erply logo

    Erply was founded in 2009 to give businesses the easiest and most powerful platform to manage their inventory and shops across a series of locations and devices.

    Employees

    51-200

  • Apollo logo

    Apollo is the foundation of your entire go-to-market strategy. Apollo is the unified engagement acceleration platform that gives reps the ability to dramatically increase their number of quality c

    Employees

    1,001-5,000

  • GigSalad logo

    GigSalad offers an easy way to book local entertainment and services for any type of event.

    Employees

    11-50

  • Medium logo

    Welcome to Medium, where words matter. Medium taps into the brains of the world’s most insightful writers, thinkers, and storytellers to bring you the smartest takes on topics that matter.

    Employees

    201-500

  • Lemonpie logo

    Lemonpie is a podcast PR and production agency that helps brands like Freshbooks, HubSpot, and Four Sigmatic grow through podcasting.

    Employees

    11-50