AI Infrastructure Engineer

Speechify is an AI-powered text-to-speech application that converts text from various formats into natural-sounding audio, helping users read faster and comprehend more. Founded by Cliff Weitzman to overcome his dyslexia, the platform aims to make reading accessible to everyone.

Speechify

Employee count: 51-200

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Mission

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity.

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.

Overview

We are looking for AI Infrastructure engineer to help build and scale the infrastructure that powers our machine learning initiatives. In this role, you will design, develop, and optimize the core platforms and services that enable data scientists and ML engineers to train, deploy, and monitor models efficiently. You’ll partner closely with Data Science, Data Engineering, and Product teams to create a robust, self-service ML ecosystem that accelerates innovation.

What You’ll Do

Build Scale AI Infrastructure: Design, implement, and maintain high-performance ML training and inference platforms. Develop MLOps Tools: Ship tools that allow any ML engineer to deploy a model in minutes, not days. Optimize Performance: Improve scalability, reliability, and cost efficiency of model training and serving systems. Collaborate Across Teams: Partner with researchers to turn experimental voice models into production-ready systems. Ensure Best Practices: Establish standards for model versioning, testing, monitoring, and governance. Drive Automation: Automate data/model pipelines to reduce manual intervention and speed up experimentation.

Experience: 3+ years in Software Engineering or ML Platform/Infrastructure roles, with a focus on distributed systems, cloud services, or MLOps.
Technical Expertise: Proficiency in Python (or similar), containerization (Docker, Kubernetes), CI/CD pipelines, Kubernetes, Cloud proficiency
Strong knowledge of cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code tools (Terraform, CloudFormation).
Experience with ML frameworks (TensorFlow, PyTorch, or similar) and orchestration tools (Kubeflow, Airflow, MLflow).
Deep understanding of data pipelines, model deployment, real-time inference systems, and reliability of AI systems
Strong communication skills and the ability to work across engineering and data science teams.
Hands-on with CI/CD and Model Serving

Nice to Have

Experience with feature stores, vector databases, or large-scale model training.
Familiarity with streaming data technologies (Kafka, Spark Streaming, Flink).
Knowledge of monitoring/observability tools (Prometheus, Grafana, Datadog).
Contributions to open-source ML/MLOps projects.
GPU optimization (TensorRT, ONNX, vLLM, Triton)
Experience with low-latency audio/streaming systems
Familiarity with vector DBs or feature stores

What we offer

A dynamic environment where your contributions shape the company and its products
A team that values innovation, intuition, and drive
Autonomy, fostering focus and creativity
The opportunity to have a significant impact in a revolutionary industry
Competitive compensation, a welcoming atmosphere, and a commitment to an exceptional asynchronous work culture
The privilege of working on a product that changes lives, particularly for those with learning differences like dyslexia, ADD, and more
An active role at the intersection of artificial intelligence and audio – a rapidly evolving tech domain

Think you’re a good fit for this job?

Tell us more about yourself and why you're interested in the role when you apply.
And don’t forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit?

Refer them!

Speechify is committed to a diverse and inclusive workplace.

Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Apply now

Please let Speechify know you found this job on Himalayas. This helps us grow!

Apply now

About the job

Apply before

Feb 23, 2026

Posted on

Dec 25, 2025

Job type

Full Time

Experience level

Mid-level

Location requirements

Open to candidates from all countries.

Hiring timezones

Worldwide

About Speechify

Learn more about Speechify and their company culture.

View company profile

At Speechify, we are at the forefront of revolutionizing how individuals consume written content through groundbreaking text-to-speech technology. Our mission is to ensure that reading is never a barrier to learning, and we achieve this by transforming digital and physical text from PDFs, books, documents, articles, and websites into high-quality, natural-sounding audio. This innovation allows users to read faster, retain more information, and access content on the go, effectively turning any reading material into an audiobook. Speechify was born out of a personal need; our founder, Cliff Weitzman, developed the initial technology to overcome his own challenges with dyslexia. This origin story is deeply embedded in our company ethos, driving our commitment to accessibility and empowering users with diverse learning needs, including those with dyslexia, ADHD, and visual impairments.

Our technological advancements extend beyond basic text-to-speech. Speechify leverages sophisticated AI to offer a suite of features including over 200+ human-like voices in more than 60 languages, customizable reading speeds up to 4.5 times faster than visual reading, and optical character recognition (OCR) for scanning and listening to physical documents and images. We provide a seamless cross-platform experience with apps for iOS and Android, browser extensions for Chrome and Safari, and a web app, ensuring that users can sync their libraries and listen anywhere, anytime. Speechify is not just a tool for individuals; we also offer an AI-powered voice-over, voice cloning, and dubbing studio for businesses and creators, alongside a text-to-speech API for developers, positioning us as a leading provider of Speech AI globally. Our continuous innovation is reflected in features like active text highlighting, inline players, AI-driven summarization, and integrations with platforms like Gmail and Canvas, constantly enhancing the user experience and pushing the boundaries of audio-based learning and productivity.

Tech stack

Learn about the tools and technologies that Speechify uses to build, market, and sell its products.

View tech stack

Python

Kotlin

Swift

TypeScript

React

Speechify employees can create an account to update this tech stack.

Apply now

Please let Speechify know you found this job on Himalayas. This helps us grow!

Apply now