companySpeechify logo

Software Engineer - Data Infrastructure & Acquisition

SpeechifyCalgary, Canada
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Ideal Candidate QualificationsBachelor’s, Master’s, or PhD in Computer Science or a related discipline.5+ years of proven experience in software development. Strong proficiency in bash and Python scripting within Linux environments. Expertise in Docker and Infrastructure-as-Code, with professional experience with at least one major cloud provider (preferably GCP). Experience in web crawling and large-scale data processing workflows is advantageous. Ability to manage multiple priorities and adapt to dynamic work environments. Excellent communication skills, both written and verbal.

About the job

Speechify helps over 50 million people transform how they read and learn. Our text-to-speech tools turn PDFs, books, Google Docs, news articles, and websites into audio, making information more accessible and easier to retain. The product lineup spans iOS, Android, Mac, a Chrome extension, and a web app. Our work has earned recognition from Google as Chrome Extension of the Year and Apple’s 2025 Design Award for Inclusivity.

The Speechify team includes nearly 200 professionals working remotely from around the world. Team members bring experience from Amazon, Microsoft, Google, and top universities such as Stanford. Our group includes frontend and backend engineers, AI researchers, and specialists in a range of fields.

Role Overview: Software Engineer - Data Infrastructure & Acquisition

This role sits within the AI team and centers on building and managing the data infrastructure that powers model training. The engineer in this position will focus on collecting and acquiring large-scale audio datasets, integrating engineering and research to support Speechify’s future models.

What You Will Do

  • Identify and secure new audio data sources to strengthen the data ingestion pipeline.
  • Manage and expand cloud infrastructure on Google Cloud Platform (GCP), using Terraform.
  • Collaborate with scientists to improve data cost, throughput, and quality for next-generation models.
  • Support the AI team’s dataset roadmap, helping to advance both consumer and enterprise products.

Location

This is a remote role based in Calgary, Canada.

About Speechify

Speechify is dedicated to transforming the way individuals engage with text, fostering inclusivity and accessibility in learning. Our innovative products have made significant impacts across various demographics, and our diverse, global team is committed to utilizing technology to enhance reading experiences.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.