Himalayas logo
SpeechifySP

AI Infrastructure Engineer

Speechify is an AI-powered text-to-speech application that converts text from various formats into natural-sounding audio, helping users read faster and comprehend more. Founded by Cliff Weitzman to overcome his dyslexia, the platform aims to make reading accessible to everyone.

Speechify

Employee count: 51-200

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Mission

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity.

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.

Overview

We are looking for AI Infrastructure engineer to help build and scale the infrastructure that powers our machine learning initiatives. In this role, you will design, develop, and optimize the core platforms and services that enable data scientists and ML engineers to train, deploy, and monitor models efficiently. You’ll partner closely with Data Science, Data Engineering, and Product teams to create a robust, self-service ML ecosystem that accelerates innovation.

What You’ll Do

Build Scale AI Infrastructure: Design, implement, and maintain high-performance ML training and inference platforms. Develop MLOps Tools: Ship tools that allow any ML engineer to deploy a model in minutes, not days. Optimize Performance: Improve scalability, reliability, and cost efficiency of model training and serving systems. Collaborate Across Teams: Partner with researchers to turn experimental voice models into production-ready systems. Ensure Best Practices: Establish standards for model versioning, testing, monitoring, and governance. Drive Automation: Automate data/model pipelines to reduce manual intervention and speed up experimentation.

  • Experience: 3+ years in Software Engineering or ML Platform/Infrastructure roles, with a focus on distributed systems, cloud services, or MLOps.
  • Technical Expertise: Proficiency in Python (or similar), containerization (Docker, Kubernetes), CI/CD pipelines, Kubernetes, Cloud proficiency
  • Strong knowledge of cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code tools (Terraform, CloudFormation).
  • Experience with ML frameworks (TensorFlow, PyTorch, or similar) and orchestration tools (Kubeflow, Airflow, MLflow).
  • Deep understanding of data pipelines, model deployment, real-time inference systems, and reliability of AI systems
  • Strong communication skills and the ability to work across engineering and data science teams.
  • Hands-on with CI/CD and Model Serving

Nice to Have

  • Experience with feature stores, vector databases, or large-scale model training.
  • Familiarity with streaming data technologies (Kafka, Spark Streaming, Flink).
  • Knowledge of monitoring/observability tools (Prometheus, Grafana, Datadog).
  • Contributions to open-source ML/MLOps projects.
  • GPU optimization (TensorRT, ONNX, vLLM, Triton)
  • Experience with low-latency audio/streaming systems
  • Familiarity with vector DBs or feature stores

What we offer

  • A dynamic environment where your contributions shape the company and its products
  • A team that values innovation, intuition, and drive
  • Autonomy, fostering focus and creativity
  • The opportunity to have a significant impact in a revolutionary industry
  • Competitive compensation, a welcoming atmosphere, and a commitment to an exceptional asynchronous work culture
  • The privilege of working on a product that changes lives, particularly for those with learning differences like dyslexia, ADD, and more
  • An active role at the intersection of artificial intelligence and audio – a rapidly evolving tech domain

Think you’re a good fit for this job?

Tell us more about yourself and why you're interested in the role when you apply.
And don’t forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit?

Refer them!

Speechify is committed to a diverse and inclusive workplace.

Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Mid-level

Location requirements

Open to candidates from all countries.

Hiring timezones

Worldwide

About Speechify

Learn more about Speechify and their company culture.

View company profile

At Speechify, we are at the forefront of revolutionizing how individuals consume written content through groundbreaking text-to-speech technology. Our mission is to ensure that reading is never a barrier to learning, and we achieve this by transforming digital and physical text from PDFs, books, documents, articles, and websites into high-quality, natural-sounding audio. This innovation allows users to read faster, retain more information, and access content on the go, effectively turning any reading material into an audiobook. Speechify was born out of a personal need; our founder, Cliff Weitzman, developed the initial technology to overcome his own challenges with dyslexia. This origin story is deeply embedded in our company ethos, driving our commitment to accessibility and empowering users with diverse learning needs, including those with dyslexia, ADHD, and visual impairments.

Our technological advancements extend beyond basic text-to-speech. Speechify leverages sophisticated AI to offer a suite of features including over 200+ human-like voices in more than 60 languages, customizable reading speeds up to 4.5 times faster than visual reading, and optical character recognition (OCR) for scanning and listening to physical documents and images. We provide a seamless cross-platform experience with apps for iOS and Android, browser extensions for Chrome and Safari, and a web app, ensuring that users can sync their libraries and listen anywhere, anytime. Speechify is not just a tool for individuals; we also offer an AI-powered voice-over, voice cloning, and dubbing studio for businesses and creators, alongside a text-to-speech API for developers, positioning us as a leading provider of Speech AI globally. Our continuous innovation is reflected in features like active text highlighting, inline players, AI-driven summarization, and integrations with platforms like Gmail and Canvas, constantly enhancing the user experience and pushing the boundaries of audio-based learning and productivity.

Claim this profileSpeechify logoSP

Speechify

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

36 remote jobs at Speechify

Explore the variety of open remote roles at Speechify, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Speechify

Remote companies like Speechify

Find your next opportunity by exploring profiles of companies that are similar to Speechify. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Speechify hiring AI Infrastructure Engineer • Remote (Work from Home) | Himalayas