HimalayasHimalayas logo
InnodataIN

Senior Data Engineer – Real-Time & Distributed Systems (GCP)

At Innodata, we're passionate about bridging the gap between data and innovative technology.

Innodata

Employee count: 5000+

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Who we are:

Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are the AI technology solutions provider-of-choice to 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine.

By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of clean and optimized digital data to all industries. Innodata offers a powerful combination of both digital data solutions and easy-to-use, high-quality platforms.

Our global workforce includes over 3,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We’re poised for a period of explosive growth over the next few years.

Key Responsibilities

  • Design, build, and optimize scalable data pipelines for batch and real-time processing

  • Develop and maintain event-driven architectures for high-throughput systems

  • Ensure data reliability, performance, and low-latency processing across distributed environments

  • Collaborate with data scientists and application teams to enable analytics and AI use cases

  • Implement best practices in performance tuning, monitoring, and cost optimization

Requirements

  • Advanced proficiency in Python for backend and large-scale data processing

  • Strong experience building and managing big data pipelines in production environments

  • Hands-on expertise with workflow orchestration tools such as Airflow or Google Cloud Composer

  • Proven experience in batch and streaming data processing using:

    • Apache Spark

    • Apache Beam (Dataflow)

  • Experience designing and operating event-driven systems using Pub/Sub

  • Strong understanding of distributed systems architecture and scalability patterns

  • Experience managing globally distributed, low-latency datasets

  • Hands-on experience with NoSQL databases and/or Google Cloud Spanner

  • Strong knowledge of system reliability, fault tolerance, and performance optimization

Preferred Skills

  • Proficiency in Go, Java, or Scala

  • Experience with Kafka or Flume for streaming ingestion

  • Deep familiarity with the Google Cloud Platform ecosystem

  • Experience with production monitoring, logging, and observability frameworks

  • Exposure to high-availability, multi-region deployments

Please be aware of recruitment scams involving individuals or organizations falsely claiming to represent employers. Innodata will never ask for payment, banking details, or sensitive personal information during the application process. To learn more on how to recognize job scams, please visit the Federal Trade Commission’s guide at https://consumer.ftc.gov/articles/job-scams.

If you believe you’ve been targeted by a recruitment scam, please report it to Innodata at verifyjoboffer@innodata.com and consider reporting it to the FTC at ReportFraud.ftc.gov.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Location requirements

Hiring timezones

United States +/- 0 hours

About Innodata

Learn more about Innodata and their company culture.

View company profile

At Innodata, we're passionate about bridging the gap between data and innovative technology. With over 35 years of experience in the industry, we assist some of the world’s leading tech companies and enterprises in driving advancements in Generative AI and traditional AI. Our journey began in 1988, and over the years, we've established ourselves as a trusted partner, providing cutting-edge solutions that encompass data engineering, advanced data solutions, and insights for market leaders across a multitude of sectors, including healthcare, finance, and media.

Our commitment extends beyond merely providing services; we genuinely aim to foster innovation and transformation within organizations. Our comprehensive offerings range from sophisticated data annotation to enterprise AI solutions and workflow automation. We understand that every project is unique, which is why we also provide customizable platforms tailored to specific business needs. With a global presence and a dedicated team of over 5,000 experts, we deliver high-quality results around the clock, helping our clients to efficiently navigate their digital transformations and optimize their AI initiatives.

Claim this profileInnodata logoIN

Innodata

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

19 remote jobs at Innodata

Explore the variety of open remote roles at Innodata, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Innodata

Remote companies like Innodata

Find your next opportunity by exploring profiles of companies that are similar to Innodata. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan