companySpeechify logo

Software Engineer - Data Infrastructure & Acquisition

SpeechifyHyderabad, India
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Ideal Candidate QualificationsBS/MS/PhD in Computer Science or a related discipline. A minimum of 5 years of professional experience in software development. Strong proficiency in bash/Python scripting within Linux environments. Expertise in Docker and Infrastructure-as-Code, with professional experience using a major Cloud Provider (GCP preferred). Experience with web crawlers and large-scale data processing workflows is a plus. Ability to manage multiple tasks in a dynamic environment while adapting to changing priorities. Excellent verbal and written communication skills.

About the job

Speechify builds text-to-speech tools that help over 50 million people turn written content into audio. From PDFs and books to news articles and websites, our products make reading more accessible and efficient. Our apps span iOS, Android, Mac, Chrome, and web, earning recognition such as Chrome Extension of the Year from Google and the 2025 Apple Design Award for Inclusivity.

Our team of nearly 200 works remotely across the globe. We bring together frontend and backend engineers, AI researchers, and specialists from companies like Amazon, Microsoft, and Google, as well as alumni of top PhD programs and startups.

Role Overview

Speechify is hiring a Software Engineer for the AI data division in Hyderabad, India. This role focuses on all aspects of data collection that power our model training. The work centers on building and maintaining large-scale, high-quality datasets, integrating engineering, infrastructure, and research to do so efficiently.

What You Will Do

  • Identify and bring in new audio data sources to expand our ingestion pipeline.
  • Manage and improve the cloud infrastructure supporting the ingestion pipeline (currently on GCP, configured with Terraform).
  • Partner with scientists to improve dataset cost, throughput, and quality for advanced model development.
  • Work with AI team members and company leadership to shape the strategic roadmap for datasets used in future consumer and enterprise products.

About Speechify

Speechify is dedicated to ensuring that reading is never a barrier to learning. With millions of users worldwide, our text-to-speech products allow individuals to read efficiently and effectively. Our remote team of nearly 200 professionals, including experts from top tech companies and academic institutions, collaborates to innovate and enhance our offerings.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.