companySpeechify logo

Software Engineer - Data Infrastructure & Acquisition

SpeechifyDublin, Ireland
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

QualificationsBS, MS, or PhD in Computer Science or related field.5+ years of experience in software development. Strong proficiency in bash/Python scripting within Linux environments. Expertise in Docker and Infrastructure-as-Code practices; experience with at least one major Cloud Provider (we utilize GCP). Experience with web crawlers and large-scale data processing workflows is a plus. Ability to manage multiple tasks and adapt to shifting priorities effectively. Excellent communication skills, both written and verbal.

About the job

Speechify builds text-to-speech tools that help over 50 million people convert written content, PDFs, books, Google Docs, news, and websites, into audio. Our products span iOS, Android, Mac, Chrome, and web. Recent recognition includes Chrome Extension of the Year from Google and Apple’s 2025 Design Award for Inclusivity.

The company operates fully remote, with nearly 200 team members worldwide. The team includes frontend and backend engineers, AI researchers, and alumni from Stanford, Amazon, Microsoft, Google, Stripe, Vercel, and Bolt.

Role overview

This Software Engineer position sits within the AI team and focuses on data infrastructure and acquisition. The work centers on optimizing data collection and building large-scale, high-quality datasets to support model training. The team combines infrastructure, engineering, and research to deliver petabyte-scale data pipelines.

What you will do

  • Identify and integrate new audio data sources into the ingestion pipeline
  • Manage and extend cloud infrastructure on Google Cloud Platform (GCP) using Terraform
  • Collaborate with scientists to improve data quality, throughput, and cost-effectiveness for next-generation models
  • Work with the AI team and company leadership to shape the strategic roadmap for datasets powering Speechify’s consumer and enterprise products

Location

This role is based in Dublin, Ireland. The team works fully remotely.

About Speechify

Speechify is dedicated to transforming the way individuals read and learn through advanced technology. Our award-winning products are designed to ensure that reading is never a barrier for anyone. Join our dynamic and diverse team, where innovation and inclusivity drive our mission.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.