About the job
Speechify helps over 50 million people transform how they read and learn. Our text-to-speech tools turn PDFs, books, Google Docs, news articles, and websites into audio, making information more accessible and easier to retain. The product lineup spans iOS, Android, Mac, a Chrome extension, and a web app. Our work has earned recognition from Google as Chrome Extension of the Year and Apple’s 2025 Design Award for Inclusivity.
The Speechify team includes nearly 200 professionals working remotely from around the world. Team members bring experience from Amazon, Microsoft, Google, and top universities such as Stanford. Our group includes frontend and backend engineers, AI researchers, and specialists in a range of fields.
Role Overview: Software Engineer - Data Infrastructure & Acquisition
This role sits within the AI team and centers on building and managing the data infrastructure that powers model training. The engineer in this position will focus on collecting and acquiring large-scale audio datasets, integrating engineering and research to support Speechify’s future models.
What You Will Do
- Identify and secure new audio data sources to strengthen the data ingestion pipeline.
- Manage and expand cloud infrastructure on Google Cloud Platform (GCP), using Terraform.
- Collaborate with scientists to improve data cost, throughput, and quality for next-generation models.
- Support the AI team’s dataset roadmap, helping to advance both consumer and enterprise products.
Location
This is a remote role based in Calgary, Canada.

