HimalayasHimalayas logo
ExtendEX

Principal Data Engineer

Extend is an AI-native document processing platform that enables technical teams to extract structured data from complex documents with over 95% accuracy.

Extend

Employee count: 11-50

Salary: 220k-250k USD

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

About Extend:

Extend is revolutionizing the post-purchase experience for retailers and their customers by providing merchants with AI-driven solutions that enhance customer satisfaction and drive revenue growth. Our comprehensive platform offers automated customer service handling, seamless returns/exchange management, end-to-end automated fulfillment, and product protection and shipping protection alongside Extend's best-in-class fraud detection. By integrating leading-edge technology with exceptional customer service, Extend empowers businesses to build trust and loyalty among consumers while reducing costs and increasing profits.

Today, Extend works with more than 1,000 leading merchant partners across industries, including fashion/apparel, cosmetics, furniture, jewelry, consumer electronics, auto parts, sports and fitness, and much more. Extend is backed by some of the most prominent technology investors in the industry, and our headquarters is in downtown San Francisco.

About the Role:

We're looking for a Principal Data Engineer to help own the analytics data architecture at Extend. This architecture powers reporting, financial processes, and business decisions for teams across the company, and feeds the data our merchants and downstream systems rely on.

This is a cross-organizational role. You’ll partner with product engineering and architecture on the data flowing upstream into Snowflake, own the design and evolution of the warehouse and reporting layer in the middle, and bridge to analytics engineering and stakeholders on the consumption side. It’s a hands-on technical leadership role anchored in Snowflake and SQL, with ownership of a portfolio of Python data jobs running on AWS — work you’ll set direction on and drive end-to-end.

Key Responsibilities:

Database Architecture. You own our data warehouse and the reporting layer on top of it, setting patterns for how data is modeled, evolved, and exposed.

Analytics Engineering. You write SQL and dbt models, refactor transformations, and build the tables and views downstream teams rely on.

Cross-Functional Partnership. You proactively engage with teams across the company to understand how data is created and used, identify gaps, and guide solutions. You’re the connective tissue between product engineering, architecture, analytics, and the business stakeholders who depend on our data.

Platform Architecture. You partner with our DevX and architecture teams on the boundary between product engineering services and Snowflake, including leading efforts to automate schema propagation so changes upstream flow cleanly into the warehouse without manual intervention.

Data Quality. You build models, tests, and processes that anticipate malformed data and upstream changes, making our pipelines boring to operate.

Observability & Reliability. You instrument what you own, define meaningful SLOs and data quality checks, and participate in our rotating on-call schedule (light volume, mostly responding to issues as they come in).

Ingestion & Integration Jobs. You own and extend our Python jobs running on Glue, Lambda, and Step Functions — primarily ingesting data from third-party APIs, with a smaller set that pushes data out to downstream systems. The infrastructure for these jobs is managed in AWS CDK.

Mentorship & Technical Leadership. You pair with more junior engineers on real work, raise the bar on PR and architecture reviews, and define the patterns and standards the team writes against. You bring a systems-thinking lens and clear communication to every conversation, connecting what’s happening upstream in product engineering to what stakeholders need downstream.

Qualifications:

  • 10+ years in Data Engineering, Analytics Engineering, or related fields, operating at a Principal or equivalent level.
  • Deep relational database architecture and data modeling expertise.
  • Expert-level Snowflake and SQL, with experience owning a warehouse at scale.
  • Strong analytics engineering experience, ideally with dbt.
  • Solid hands-on Python, with experience building data jobs on AWS Glue, Lambda, and Step Functions, and managing that infrastructure in AWS CDK.
  • Experience integrating with third-party APIs in both directions, including rate limits, retries, authentication, and idempotency.
  • Track record of building observable, reliable data systems.
  • Demonstrated technical leadership and mentorship with strong communication, systems thinking, and a track record of engaging stakeholders across an organization to drive cross-functional outcomes.

Expected Pay Range: $220,000 - $250,000 per year salaried*

* The target base salary range for this position is listed above. Individual salaries are determined based on a number of factors including, but not limited to, job-related knowledge, skills and experience.

Life at Extend:

  • Working with a great team from diverse backgrounds in a collaborative and supportive environment.
  • Competitive salary based on experience, with full medical and dental & vision benefits.
  • Stock in an early-stage startup growing quickly.
  • Generous, flexible paid time off policy.
  • 401(k) with Financial Guidance from Morgan Stanley.

Extend CCPA HR Notice

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Salary

Salary: 220k-250k USD

Education

Bachelor degree

Experience

10 years minimum

Location requirements

Hiring timezones

United States +/- 0 hours

About Extend

Learn more about Extend and their company culture.

View company profile

Extend was founded in 2023 by Eli Badgio and Kushal Byatnal, former engineers at Brex and Stir, who recognized that document processing was a persistent bottleneck for modern enterprises. While traditional OCR solutions plateaued at around 80% accuracy and required months of complex integration, the founders envisioned a platform that could leverage the power of Large Language Models (LLMs) to achieve near-perfect results. What started as a mission to 'transform how the world works with unstructured data' has evolved into the 'Modern Document Processing Cloud,' a comprehensive platform that unifies parsing, extraction, and orchestration.

Today, Extend empowers technical teams to build mission-critical document pipelines with greater than 95% accuracy out of the box. By combining state-of-the-art vision models with a developer-first API, the company has become the 'Stripe for documents,' trusted by industry leaders like Brex, Chime, and Flatiron Health to handle their most complex workflows. With $17 million in recent funding from Innovation Endeavors and Y Combinator, Extend is rapidly scaling its engineering team in New York City to further revolutionize how businesses unlock the value of their data.

Claim this profileExtend logoEX

Extend

Company size

11-50 employees

Founded in

2023

Chief executive officer

Kushal Byatnal, Eli Badgio

Employees live in

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

12 remote jobs at Extend

Explore the variety of open remote roles at Extend, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Extend

Remote companies like Extend

Find your next opportunity by exploring profiles of companies that are similar to Extend. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan