About the job
Speechify’s mission is to remove barriers to learning using technology. With more than 50 million users, our text-to-speech tools convert everything from PDFs and books to Google Docs and news articles into audio. People read faster, remember more, and interact with content in new ways. Our products span iOS, Android, Mac, Chrome Extension, and web. We’ve been named Chrome Extension of the Year by Google and received Apple’s 2025 Design Award for Inclusivity.
Our team is fully remote and international, with nearly 200 people. Colleagues include frontend and backend engineers, AI research scientists, and specialists from companies like Amazon, Microsoft, and Google. Many team members hold advanced degrees from places such as Stanford, or have founded companies like Stripe and Vercel.
Role overview
Speechify is hiring a Software Engineer to strengthen our AI team, focusing on data infrastructure and acquisition. The work centers on building and maintaining large-scale, high-quality datasets to support model training. This position blends engineering, infrastructure, and research to support petabyte-scale data operations.
What you will do
- Find and connect new audio data sources to our ingestion pipeline.
- Oversee and improve our cloud infrastructure on Google Cloud Platform, using Terraform.
- Work with data scientists to balance data cost, throughput, and quality, helping to improve our models.
- Collaborate with AI team members and leadership to plan the dataset roadmap for future consumer and enterprise products.
Location
This role is based in Brno, Czech Republic. Our team works fully remotely, with no physical office.

