About the job
Speechify builds text-to-speech tools that help over 50 million people convert written content, PDFs, books, Google Docs, news, and websites, into audio. Our products span iOS, Android, Mac, Chrome, and web. Recent recognition includes Chrome Extension of the Year from Google and Apple’s 2025 Design Award for Inclusivity.
The company operates fully remote, with nearly 200 team members worldwide. The team includes frontend and backend engineers, AI researchers, and alumni from Stanford, Amazon, Microsoft, Google, Stripe, Vercel, and Bolt.
Role overview
This Software Engineer position sits within the AI team and focuses on data infrastructure and acquisition. The work centers on optimizing data collection and building large-scale, high-quality datasets to support model training. The team combines infrastructure, engineering, and research to deliver petabyte-scale data pipelines.
What you will do
- Identify and integrate new audio data sources into the ingestion pipeline
- Manage and extend cloud infrastructure on Google Cloud Platform (GCP) using Terraform
- Collaborate with scientists to improve data quality, throughput, and cost-effectiveness for next-generation models
- Work with the AI team and company leadership to shape the strategic roadmap for datasets powering Speechify’s consumer and enterprise products
Location
This role is based in Dublin, Ireland. The team works fully remotely.

