About the job
Speechify builds tools that remove reading barriers for millions of users. Our text-to-speech products convert content like PDFs, books, Google Docs, news articles, and websites into audio, helping people read faster and understand more. With over 50 million users and recognition from Google and Apple, including Chrome Extension of the Year and a 2025 Design Award for Inclusivity, we focus on making information more accessible.
The company operates as a fully distributed team of nearly 200 people. Team members come from leading tech companies such as Amazon, Microsoft, and Google, as well as top academic institutions like Stanford. Together, we advance AI and engineering to support our mission.
Role overview
Speechify is hiring a Software Engineer for the AI team in Galway, Ireland, with a focus on data infrastructure and acquisition. This role centers on collecting and managing large-scale datasets that support model training. The work involves building and integrating infrastructure to handle data efficiently and cost-effectively.
What you will do
- Identify and source new audio data to strengthen the data ingestion pipeline.
- Manage and expand cloud infrastructure for data ingestion, currently using GCP and Terraform.
- Work with data scientists to optimize cost, throughput, and data quality, ensuring strong datasets for new models.
- Partner with the AI team and company leadership to shape a strategic dataset roadmap for future consumer and enterprise products.

