companySpeechify logo

Software Engineer - Data Infrastructure & Acquisition

SpeechifyHaifa, Israel
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Ideal Candidate ProfileA degree in Computer Science or a related field (BS/MS/PhD).5+ years of professional experience in software development. Proficiency in bash/Python scripting within Linux environments. Experience with Docker and Infrastructure-as-Code, with a strong background in at least one major cloud provider (GCP preferred). Familiarity with web crawlers and large-scale data processing workflows is advantageous. Ability to manage multiple tasks and adapt to evolving priorities. Excellent communication skills, both written and verbal.

About the job

Speechify builds text-to-speech tools that help over 50 million users turn reading materials, such as PDFs, books, Google Docs, news articles, and websites, into audio. Our products span iOS, Android, Mac, Chrome, and web platforms. Recent honors include Google's Chrome Extension of the Year and Apple's 2025 Design Award for Inclusivity.

The company operates fully remotely, bringing together nearly 200 professionals worldwide. The team includes engineers, AI researchers, and specialists with backgrounds at Amazon, Microsoft, Google, top universities, and successful startups.

Role Overview

Speechify is hiring a Software Engineer focused on data infrastructure and acquisition for the AI team. This position centers on building and maintaining the systems that collect the large-scale datasets needed to train our models. The team has developed infrastructure capable of handling petabyte-scale data efficiently and cost-effectively.

What You Will Do

  • Identify and source new audio data for the ingestion pipeline.
  • Manage and improve cloud infrastructure on Google Cloud Platform (GCP) using Terraform for configuration management.
  • Collaborate with scientists to optimize cost, throughput, and data quality, supporting the development of richer datasets for next-generation models.
  • Work with the AI team and company leadership to shape the dataset roadmap for both consumer and enterprise products.

Location

This role is based in Haifa, Israel. The team works fully distributed with no physical office.

About Speechify

Speechify is dedicated to transforming the way people interact with written content through advanced text-to-speech technology. Our users can convert any written material into audio, making learning more accessible and efficient. Join us to be part of an innovative team that’s making a significant impact in the field of education and accessibility.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.