HimalayasHimalayas logo
ClouderaCL

Sr. Data Engineer

Cloudera, Inc. is a leading American data lake software company providing a hybrid data platform that manages and analyzes data across any cloud environment.

Cloudera

Employee count: 1001-5000

Costa Rica only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Business Area:

IT

Seniority Level:

Mid-Senior level

Job Description:

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises.

Cloudera Data Engineers power our Analytics and AI/ML initiatives by building scalable, high-performance data pipelines. In this role, you will lead our transition from traditional manual coding to AI-orchestrated development (Vibe Coding), architecting next-gen data pipelines and GenAI applications at unprecedented speed.

You will focus on modernizing development workflows and building GenAI-powered self-service tools that empower the business to resolve data needs independently. By designing robust, AI-first data management processes on Cloudera’s native platform, you will ensure data integrity while creating a blueprint for both internal efficiency and external customer success.

As a Senior Data Engineer you will:

  • Collaborate with Data Architects, Operational Architects, and Data Analysts to understand the data and operational requirements across different business units.

  • Partner with data owners to ensure seamless, reliable data ingestion for both traditional analytics and GenAI-powered applications.

  • Master "Vibe Coding" and AI-orchestrated development to accelerate the delivery of new data pipelines and GenAI applications, reducing the end-to-end development lifecycle from days to hours.

  • Develop and implement data transformations to enrich and provision data, following established specifications and standards while utilizing AI-first workflows.

  • Design and implement robust system architectures for real-time, near real-time, and batch processing data flows to meet the operational demands of complex business systems.

  • Design and deploy GenAI-powered "Self-Service" tools, including automated documentation generators and natural language interfaces, to empower business users and reduce routine engineering requests.

  • Implement monitoring and CI/CD automation processes to track data quality and ensure the reliability of AI-supported data services.

  • Standardize AI-first engineering workflows across the team to ensure high-quality, auto-validated, and well-documented code delivery.

We are excited if you have (Required Experience):

  • 5+ years of experience as a Data Engineer.

  • Proven experience with AI-first approaches and "Vibe Coding," with a demonstrated ability to deliver production-ready data pipelines using AI orchestration rather than purely manual coding.

  • Deep proficiency with AI-assisted coding tools, including Cursor, GitHub Copilot, or Gemini, to modernize and accelerate engineering workflows.

  • Solid skills in System Design for diverse data architectures, including expert-level knowledge of batch processing and real-time/streaming processing.

  • Proficient in coding with Python (primary) and SQL, with experience in ETL and data processing.

  • Hands-on experience with Distributed Systems and Big Data technologies, including Spark and the Hadoop ecosystem (Hive, Impala, Kafka).

  • Proven proficiency in Data Modeling using industry best practices (e.g., Kimball, Inmon) to ensure data integrity.

  • Ability to monitor critical data pipelines for quality and resolve any issues effectively.

  • Education: Bachelor’s degree in Computer Science, Engineering, Information Systems, or a related field.

  • Strong communication skills, both written and verbal.

You may also have:

  • Experience supporting GenAI applications from the Data Engineering side, including managing vector databases, RAG (Retrieval-Augmented Generation) pipelines, or LLM data orchestration.

  • Experience with Apache Airflow or Apache NiFi.

  • Expertise in optimizing data storage using HDFS/Parquet/Avro, Kudu, or HBase.

What you can expect from us:

  • Generous PTO Policy

  • Support work life balance with Unplugged Days

  • Flexible WFH Policy

  • Mental & Physical Wellness programs

  • Phone and Internet Reimbursement program

  • Access to Continued Career Development

  • Comprehensive Benefits and Competitive Packages

  • Paid Volunteer Time

  • Employee Resource Groups

EEO/VEVRAA

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Education

Bachelor degree

Experience

5 years minimum

Location requirements

Hiring timezones

Costa Rica +/- 0 hours

About Cloudera

Learn more about Cloudera and their company culture.

View company profile

At Cloudera, we empower people to transform complex data into clear and actionable insights. Our mission is to deliver an enterprise data cloud for any data, anywhere, while harnessing the innovation of the open source community. We provide the industry's only true hybrid data platform with secure data management and portable cloud-native analytics, allowing organizations to unlock the full potential of their data and accelerate their digital transformation.

With a focus on data democratization, Cloudera enables organizations to securely manage and analyze data from a variety of sources—from sensors and edge devices to applications and databases. This capability is particularly crucial in today's data-driven environment, where the ability to extract actionable insights can significantly impact business outcomes. Our solutions are employed by numerous Fortune 500 companies and top-performing organizations across industries including financial services, telecommunications, healthcare, and government, demonstrating our commitment to delivering scalable and reliable data solutions.

Claim this profileCloudera logoCL

Cloudera

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

43 remote jobs at Cloudera

Explore the variety of open remote roles at Cloudera, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Cloudera

Remote companies like Cloudera

Find your next opportunity by exploring profiles of companies that are similar to Cloudera. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan