Qualifications
Key ResponsibilitiesExplore and identify new audio data sources to integrate into our ingestion pipeline. Manage and extend our cloud infrastructure on GCP, utilizing Terraform for deployment. Work collaboratively with our data scientists to optimize cost, throughput, and quality, ensuring we deliver rich datasets at scale. Partner with the AI Team and Speechify Leadership to develop a strategic dataset roadmap that supports our next-generation products. Ideal Candidate ProfileAdvanced degree (BS/MS/PhD) in Computer Science or a closely related field. Minimum of 5 years of professional experience in software development. Strong programming skills in bash/Python, particularly within Linux environments. Proficient in Docker and Infrastructure-as-Code methodologies, with hands-on experience with a major cloud provider, preferably GCP. Experience with web crawlers and large-scale data processing workflows is advantageous. Exceptional multitasking abilities and adaptability to shifting priorities. Excellent verbal and written communication skills.
About the job
Speechify’s mission centers on removing barriers to learning by changing how people interact with written content. Over 50 million users depend on our text-to-speech tools across iOS, Android, Mac, Chrome, and web platforms. Google named us Chrome Extension of the Year, and Apple recognized our work with the 2025 Design Award for Inclusivity.
Our distributed team includes nearly 200 professionals from a range of backgrounds, including leading technology companies and academic institutions. We value diverse perspectives and support remote work, believing strong ideas can come from anywhere.
Role Overview
The Software Engineer - Data Infrastructure & Acquisition will join a team focused on building and maintaining large-scale data systems. This role plays a key part in gathering and processing data to power advanced model training. The team’s infrastructure supports petabyte-scale dataset construction and integrates engineering, infrastructure, and research to keep costs low while maintaining quality.
About Speechify
Speechify is dedicated to removing barriers to learning through innovative text-to-speech technology. With millions of users worldwide, our products enhance the reading experience across various formats and platforms. Our commitment to inclusivity and accessibility has earned us significant accolades, and our entirely remote workforce is a testament to our belief in the power of diverse perspectives.