companySpeechify logo

Software Engineer - Data Infrastructure & Acquisition

SpeechifyPhoenix, AZ, USA
Remote Full-time $140K/yr - $200K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Ideal Candidate ProfileAdvanced degree (BS/MS/PhD) in Computer Science or a related discipline. A minimum of 5 years of professional experience in software development. Proficient in bash/Python scripting within Linux environments. Experience with Docker and Infrastructure-as-Code, with professional expertise in at least one major cloud platform (we utilize GCP). Familiarity with web crawlers and large-scale data processing workflows is advantageous. Ability to manage multiple priorities and adapt to evolving demands. Exceptional communication skills, both written and verbal.

About the job

Speechify builds tools that turn reading into an accessible audio experience. Over 50 million people use our text-to-speech products to listen to PDFs, books, Google Docs, news articles, and websites. Our goal: help people read faster, retain more, and remove barriers to learning.

Our products span iOS, Android, Mac, Chrome Extension, and Web App. We’ve been named Chrome Extension of the Year by Google and received Apple’s Design Award for Inclusivity in 2025.

The Speechify team is fully remote, with nearly 200 professionals worldwide. Our group includes frontend and backend engineers, AI research scientists, and experts from Amazon, Microsoft, Google, top PhD programs, and high-growth startups.

Role overview

We’re hiring a Software Engineer for the Data team within our AI department. This position focuses on all aspects of data collection that drive model training. The work blends infrastructure, engineering, and research to build large-scale, high-quality datasets efficiently and cost-effectively.

What you will do

  • Find and acquire new audio data sources to expand our ingestion pipeline
  • Manage and improve cloud infrastructure for data ingestion, currently on GCP and managed with Terraform
  • Work closely with Scientists to improve cost, throughput, and quality metrics, delivering large-scale datasets for next-generation models
  • Support the AI Team’s roadmap for datasets powering future Speechify consumer and enterprise products

Location

Phoenix, AZ, USA (fully distributed team)

About Speechify

Speechify is dedicated to making reading accessible for everyone, helping over 50 million individuals enhance their reading experience through innovative text-to-speech solutions. Our diverse and talented team works remotely from various locations, focusing on creating high-quality audio experiences for users worldwide.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.