HimalayasHimalayas logo
InfraCloudIN

Spark Developer

InfraCloud Technologies provides cutting-edge cloud-native solutions, specializing in AI cloud infrastructure and GPU enablement.

InfraCloud

Employee count: 201-500

India only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

About the Role

We are looking for a highly skilled Senior Software Engineer with deep expertise in Apache Spark and distributed data processing systems. In this role, you will work directly on Apache Spark internals, contribute upstream improvements to the Spark open-source community, and adapt Spark capabilities to support DataPelago’s product requirements.

This position demands strong technical ownership, independent execution, and the ability to drive high-impact engineering initiatives in a fast-paced environment.

Key Responsibilities

  • Contribute code, fixes, and enhancements directly to the Apache Spark open-source project.
  • Upgrade and maintain compatibility with newer Apache Spark community releases.
  • Analyze, modify, and optimize Spark internals to support DataPelago’s platform requirements.
  • Design and implement scalable distributed data processing solutions.
  • Debug and resolve complex performance, stability, and scalability issues within Spark-based systems.
  • Collaborate with product and platform teams to align Spark capabilities with business and technical objectives.
  • Drive architecture discussions and provide technical leadership across distributed systems initiatives.
  • Ensure high engineering standards through code reviews, testing, documentation, and best practices.
  • Work independently with minimal supervision while delivering high-quality outcomes.

Required Qualifications

  • Strong experience with Apache Spark internals and distributed computing systems.
  • Proven experience contributing to open-source projects, preferably Apache Spark or related Apache ecosystem technologies.
  • Expertise in Java and/or Scala programming.
  • Strong understanding of query execution, distributed processing, memory management, and performance optimization.
  • Experience upgrading and maintaining large-scale Spark deployments.
  • Deep knowledge of big data technologies and distributed systems architecture.
  • Strong debugging, problem-solving, and performance tuning skills.
  • Ability to work autonomously and lead technically challenging initiatives.

Preferred Qualifications

  • Experience with query engines, vectorized execution, or data processing frameworks.
  • Familiarity with Kubernetes, cloud-native environments, and large-scale infrastructure.
  • Knowledge of JVM performance tuning and low-level system optimization.
  • Prior experience working closely with open-source communities.

What We Expect

  • Senior-level ownership and accountability.
  • High independence with the ability to make impactful technical decisions.
  • Strong communication and collaboration skills.
  • Passion for open-source software and distributed data technologies.

Nice to Have

  • Active GitHub or Apache contributor profile.
  • Experience with large-scale analytics or database systems.
  • Publications, talks, or community involvement in big data technologies.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Location requirements

Hiring timezones

India +/- 0 hours

About InfraCloud

Learn more about InfraCloud and their company culture.

View company profile

InfraCloud Technologies is an innovative company specializing in cloud-native technologies, with a strong emphasis on building AI cloud infrastructures. Our expertise extends to GPU cloud solutions, bare metal provisioning, and private cloud setups using technologies such as OpenStack, Kubernetes, and Prometheus. We enable organizations to transform their operations and enhance their cloud capabilities, ensuring they are truly cloud-native.

Our team is dedicated to modernizing cloud infrastructures, helping businesses adopt best practices in source code management and continuous integration/delivery. This is crucial for developing effective AI platforms. We take pride in orchestrating data workflows and streamlining processes to facilitate quick execution at scale. By establishing robust model registries and feature stores, we ensure organizations can efficiently serve AI applications globally. Additionally, our services include Kubernetes consulting, security implementations, and observability solutions to monitor and manage cloud infrastructures effectively.

Employee benefits

Learn about the employee benefits and perks provided at InfraCloud.

View benefits

Company Retreats

InfraCloud organizes company offsites like Converge.

Wealth Creation (ESOPs)

Attractive ESOPs policy to help you grow and build wealth.

Open Source Contributions

Contribute to your favorite open source projects & build a personal brand.

Flexible working hours

Flexible working hours and work-life balance are not perks but a must at InfraCloud.

View InfraCloud's employee benefits
Claim this profileInfraCloud logoIN

InfraCloud

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

5 remote jobs at InfraCloud

Explore the variety of open remote roles at InfraCloud, offering flexible work options across multiple disciplines and skill levels.

View all jobs at InfraCloud

Remote companies like InfraCloud

Find your next opportunity by exploring profiles of companies that are similar to InfraCloud. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan