About the job
Speechify builds text-to-speech tools that help over 50 million users turn reading materials, such as PDFs, books, Google Docs, news articles, and websites, into audio. Our products span iOS, Android, Mac, Chrome, and web platforms. Recent honors include Google's Chrome Extension of the Year and Apple's 2025 Design Award for Inclusivity.
The company operates fully remotely, bringing together nearly 200 professionals worldwide. The team includes engineers, AI researchers, and specialists with backgrounds at Amazon, Microsoft, Google, top universities, and successful startups.
Role Overview
Speechify is hiring a Software Engineer focused on data infrastructure and acquisition for the AI team. This position centers on building and maintaining the systems that collect the large-scale datasets needed to train our models. The team has developed infrastructure capable of handling petabyte-scale data efficiently and cost-effectively.
What You Will Do
- Identify and source new audio data for the ingestion pipeline.
- Manage and improve cloud infrastructure on Google Cloud Platform (GCP) using Terraform for configuration management.
- Collaborate with scientists to optimize cost, throughput, and data quality, supporting the development of richer datasets for next-generation models.
- Work with the AI team and company leadership to shape the dataset roadmap for both consumer and enterprise products.
Location
This role is based in Haifa, Israel. The team works fully distributed with no physical office.

