Qualifications
Key Responsibilities:Proactively source new audio data and integrate it into our ingestion pipeline. Manage and expand our cloud infrastructure for the ingestion pipeline, currently hosted on GCP with Terraform. Work closely with our Scientists to enhance the cost, throughput, and quality of our data, facilitating the development of our next-generation models. Collaborate with the AI Team and Speechify Leadership to develop a dataset roadmap that supports our consumer and enterprise products. Qualifications:BS/MS/PhD in Computer Science or a related field.5+ years of software development experience. Strong proficiency in bash/Python scripting within Linux environments. Experience with Docker and Infrastructure-as-Code principles, as well as professional experience with a major Cloud Provider (GCP preferred). Familiarity with web crawlers and large-scale data processing workflows is a plus. Adept at juggling multiple tasks and adjusting to changing priorities. Excellent written and verbal communication skills.
About the job
Speechify builds tools that turn written content, PDFs, books, Google Docs, news articles, and websites, into audio. Over 50 million people use our text-to-speech products to read faster, retain more, and make learning accessible. Our product suite spans iOS, Android, Mac, Chrome, and the web. Recent recognition includes Chrome Extension of the Year from Google and a 2025 Apple Design Award for Inclusivity.
Nearly 200 people work at Speechify, collaborating remotely from around the world. Our team includes engineers and researchers from Amazon, Microsoft, Google, Stripe, Vercel, and Stanford, among others.
Role Overview
Speechify is hiring a Software Engineer for the AI team, with a focus on data infrastructure and acquisition. This engineer will manage the systems and processes that collect and organize data for training our models. The work combines infrastructure, engineering, and research to create large-scale, high-quality datasets efficiently, at petabyte scale.
Location
Boston, MA, USA
About Speechify
Speechify is dedicated to breaking down barriers in reading through innovative text-to-speech technology. By converting various reading materials into audio, we empower millions to improve their reading speed, comprehension, and retention. Our commitment to inclusivity and accessibility in technology has earned us prestigious awards and recognition within the industry.