HimalayasHimalayas logo
CartesiaCA

Global Data ML Engineer for Multilingual Speech & AI

Cartesia is an AI research company building the next generation of real-time, multimodal foundation models using their pioneering State Space Model (SSM) architecture.

Cartesia

Employee count: 51-200

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

A leading technology company in San Francisco is seeking a Machine Learning Engineer to ensure the quality and coverage of data across diverse languages. You will design large-scale datasets, evaluate models, and implement quality control systems. The ideal candidate has expertise in multilingual datasets and a strong background in applied ML. This full-time role offers competitive benefits, including fully covered insurance and in-office perks, in a supportive team environment.
#J-18808-Ljbffr

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Location requirements

Hiring timezones

United States +/- 0 hours

About Cartesia

Learn more about Cartesia and their company culture.

View company profile

We are Cartesia, a company born out of the Stanford AI Lab with a mission to build the next generation of artificial intelligence. Our focus is on creating ubiquitous, interactive intelligence that can run anywhere you are, on any device. We believe that today's foundation models, while powerful, fall short of human intelligence. They are often slow, computationally expensive, and their development is restricted to only the largest companies. We're here to change that. We are pioneering a new path forward with State Space Models (SSMs), a groundbreaking architecture our founding team invented. This new approach allows for the creation of AI models that are not only higher quality but also significantly more efficient.

Our journey began with a deep-seated belief that a phase shift was needed in how we approach model architectures and machine learning. SSMs are the result of that belief. Unlike traditional transformer models, SSMs can process information in real-time, handle long sequences of data, and operate with much lower latency and cost. This efficiency makes it possible to run powerful AI directly on devices, ensuring privacy and responsiveness. Our first major product, Sonic, is a testament to the power of SSMs. It's the world's fastest and most ultra-realistic text-to-speech model, capable of generating lifelike audio with incredibly low latency. We're not stopping at audio, though. Our long-term vision is to develop multimodal AI models that can seamlessly process and understand text, audio, video, and images, transforming industries from healthcare and robotics to gaming and beyond. We're driven by the challenge of building AI that is as natural and intuitive as human interaction, making it accessible and useful for everyone.

Claim this profileCartesia logoCA

Cartesia

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

Remote companies like Cartesia

Find your next opportunity by exploring profiles of companies that are similar to Cartesia. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan