About the job
About Us:
At Wynd Labs, we specialize in creating cutting-edge infrastructure that facilitates the delivery of vast amounts of web data to organizations that are training the world’s most advanced AI models.
Our innovative team is at the forefront of powering Grass, a bandwidth-sharing network that enables the operation of a large-scale distributed web crawler. This unique capability grants us unparalleled access to high-quality public web data on a global scale. Additionally, we have developed sophisticated pipelines for ingesting, segmenting, and annotating billions of videos, transcripts, and audio files, which are essential for dataset creation in pioneering laboratories.
We pride ourselves on being a nimble, technically adept team that prioritizes rapid decision-making and execution; we are builders dedicated to expanding the possibilities of open web data and artificial intelligence.
The Opportunity:
We are on the lookout for a talented Data Engineer who has a strong background in constructing and maintaining robust data pipelines, as well as integrating scalable infrastructure. Joining our small, skilled team means you will be instrumental in designing and optimizing our data systems, ensuring efficient data flow and accessibility. Your efforts will directly contribute to our mission of establishing Grass as a pivotal player in the future of data-driven innovation on the internet.

