HimalayasHimalayas logo
OracleOR

Principal Big Data Site Reliability Developer (US Citizenship Required) US REMO

Oracle Corporation is an American multinational computer technology company that specializes in database software and technology, cloud engineered systems, and enterprise software products. It is one of the largest software companies in the world.

Oracle

Employee count: 5000+

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

This role requires U.S. Citizenship and eligibility for a Federal Security Clearance

Our Team

Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Data, Analytics Platform. This team will focus on product development and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.

Oracle Health Data, Analytics Platform has a rare opportunity to play a critical role in how Oracle Health products impact and disrupt the healthcare industry by transforming how healthcare and technology intersect.

You will have the opportunity to:

  • Reach billions of people with our products & services
  • Create technology in which truly impacts the world
  • Ability to have immediate impact on developing technology
  • Unlimited growth potential with inspiring work
  • Work with the best minds in the industry
  • Enjoy working in an open, diverse, and productive environment

About The Job

This role provides technical leadership for the core data platforms behind Oracle Health’s Data & Analytics Platform. As a Principal Site Reliability Engineer (SRE), you will own shared, mission-critical systems used by multiple products and teams.

You will lead the design and operation of large-scale, stateful distributed platforms, including Hadoop ecosystem components (HDFS, YARN, HBase) deployed on Oracle Big Data Service (BDS), Kafka, and Storm. These multi-tenant platforms are deployed and operated through Ansible- and Terraform-based automation and require strong architectural ownership to manage scale, change, and broad blast radius.

What You'll Do

Platform Ownership & Technical Leadership

  • Own the end-to-end reliability, scalability, and operability of shared data platforms
  • Define platform standards, architectural direction, and operational guardrails
  • Influence cross-team technical decisions and long-term platform strategy
  • Drive long-term platform evolution and influence reliability strategy across the data ecosystem

Architecture & Design

  • Lead platform architecture and design reviews
  • Clearly articulate system behavior, dependencies, and failure modes
  • Make principled trade-offs between reliability, performance, cost, and complexity
  • Provide guidance and guardrails that enable downstream teams to use platforms safely and effectively

Operations Engineering

  • Establish capacity models, scaling strategies, and operational best practices
  • Design platforms that behave predictably under load, failure, and change
  • Own platform lifecycle events: upgrades, expansions, decommissioning, and recovery

Distributed Systems Expertise

  • Operate and evolve stateful distributed systems where data placement, replication, and recovery are critical
  • Reason about failure modes such as backpressure, rebalancing, region movement, replication lag, and rolling upgrades

Security

  • Operate and maintain Kerberized platforms, including authentication, authorization, and secure service-to-service communication
  • Treat security as a first-class architectural concern

Automation

  • Design and evolve an Ansible- and Terraform-driven automation framework
  • Treat automation as production software: versioned, reviewed, tested, and improved
  • Eliminate operational toil by encoding reliability and safety into the platform

Incident Leadership & Prevention

  • Serve as the ultimate escalation point for complex or ambiguous incidents
  • Focus on eliminating entire classes of failure, not just resolving individual issues

Representation

  • Represent SRE and platform engineering in high-visibility and sensitive forums
  • Communicate clearly with engineering leadership and partner teams

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Location requirements

Hiring timezones

United States +/- 0 hours

About Oracle

Learn more about Oracle and their company culture.

View company profile

At Oracle, we are at the forefront of technological innovation, dedicated to helping people see data in new ways, discover insights, and unlock endless possibilities. Founded in 1977 by Larry Ellison, Bob Miner, and Ed Oates as Software Development Laboratories, our journey began with a vision inspired by Edgar F. Codd's research paper on relational database management systems (RDBMS). This led to the 1979 release of Oracle, the first commercial relational database program to utilize Structured Query Language (SQL), a milestone that quickly gained popularity and set the stage for our future growth. By 1982, the company was renamed Oracle Systems Corporation to align with our flagship product, Oracle Database, and in 1987, we became the largest database management company globally. Our commitment to innovation continued as we developed products compatible with emerging web technologies, further solidifying our leadership in database technology.

Throughout our history, Oracle has consistently expanded its offerings through both organic growth and strategic acquisitions. Key acquisitions like PeopleSoft in 2005, Siebel in 2006, BEA Systems in 2008, and Sun Microsystems in 2010 have been instrumental in broadening our portfolio to include enterprise resource planning (ERP), customer relationship management (CRM), enterprise infrastructure software, and critical technologies like Java and Solaris. These strategic moves have enabled us to provide a comprehensive suite of enterprise software products, including human capital management (HCM), enterprise performance management (EPM), and supply chain management (SCM) software. Today, Oracle is a global leader in database software, cloud engineered systems, and enterprise software products, with a significant focus on cloud computing and artificial intelligence. We are continuously investing in research and development, with over $80 billion invested since fiscal year 2012, and have spent over $110 billion on more than 150 acquisitions to enhance our capabilities and drive industry advancement. Our mission is to empower businesses worldwide with groundbreaking technology, enabling them to transform and modernize their operations for sustained success in an ever-evolving digital landscape.

Employee benefits

Learn about the employee benefits and perks provided at Oracle.

View benefits

Pet Insurance

Pet insurance is offered.

Military Leave

Oracle offers military leave.

Caregiver Leave

Oracle offers caregiver leave.

Jury Duty Leave

Oracle offers jury duty leave.

View Oracle's employee benefits
Oracle logoOR

Oracle

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

23 remote jobs at Oracle

Explore the variety of open remote roles at Oracle, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Oracle

Remote companies like Oracle

Find your next opportunity by exploring profiles of companies that are similar to Oracle. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan