companySpeechify logo

Software Engineer - Data Infrastructure & Acquisition

SpeechifyBoulder, CO, USA
Remote Full-time $140K/yr - $200K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Ideal Candidate Profile BS, MS, or PhD in Computer Science or a related discipline. 5+ years of professional experience in software development. Proficient in bash and Python scripting within Linux environments. Experience with Docker and Infrastructure-as-Code, particularly with GCP. Familiarity with web crawlers and large-scale data processing workflows is a plus. Able to manage multiple tasks and adapt to shifting priorities. Excellent written and verbal communication skills.

About the job

Speechify aims to remove reading barriers for learners worldwide. With more than 50 million users, our text-to-speech products turn everything from PDFs to news articles into audio, helping people read faster and remember more. Our iOS and Android apps, Mac app, Chrome extension, and web platform have received awards from Google and Apple for design and accessibility.

Our team includes nearly 200 people working fully remotely, drawing on experience from leading tech companies and top universities. Engineers, AI researchers, and product leaders collaborate closely to advance audio reading technology.

Role Overview

The Software Engineer - Data Infrastructure & Acquisition will join the AI team's data group. This role focuses on building and managing large-scale data collection systems that support model training. The work centers on developing high-quality datasets at petabyte scale using advanced infrastructure.

What You Will Do

  • Find and connect new audio data sources to the ingestion pipeline.
  • Maintain and grow cloud infrastructure on Google Cloud Platform (GCP) with Terraform.
  • Partner with data scientists to improve dataset cost, throughput, and quality for next-generation models.
  • Work with the AI team and company leaders to plan the dataset roadmap for both consumer and enterprise products.

About Speechify

Speechify is dedicated to transforming the way people interact with text. With a commitment to inclusivity and innovative technology, we empower users worldwide to overcome reading challenges through our extensive range of audio products. Our award-winning solutions cater to diverse needs, making learning accessible for everyone.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.