companySieve logo

Distributed Systems Engineer

SieveSan Francisco
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

QualificationsMinimum of 3 years of experience in building foundational data infrastructure. Proficient in working with various cloud architectures. Experience in designing and maintaining data pipelines that process petabytes of data. Expertise in developing robust CI/CD pipelines specifically for machine learning teams. Strong programming skills in Go and Python; familiarity with Rust is a plus. Act as an individual contributor who leads by example. Experience in working with large-scale video data systems. Willingness to work in-person at our San Francisco headquarters.

About the job

About Us

Sieve is a pioneering AI research lab dedicated solely to video data. We harness exabyte-scale video infrastructure and innovative video understanding techniques, along with a multitude of data sources, to create datasets that advance the field of video modeling. Given that video constitutes 80% of internet traffic, it serves as a vital medium that fuels creativity, communication, gaming, AR/VR, and robotics. Our mission is to tackle the most significant challenge in the development of these applications: acquiring high-quality training data.

With a small yet highly skilled team of just 15 members, we have formed strategic partnerships with leading AI labs and achieved $XXM in revenue last quarter alone. Our Series A funding round last year was backed by prestigious firms, including Matrix Partners, Swift Ventures, Y Combinator, and AI Grant.

About the Role

As a Distributed Systems Engineer at Sieve, you will be responsible for designing and implementing systems that efficiently manage the compute, scheduling, and orchestration of complex machine learning and ETL pipelines. Your work will ensure these systems operate quickly, reliably, and cost-effectively while processing large volumes of video data.

You will thrive in this role if you are passionate about optimizing system uptime, have experience with cloud technologies, and enjoy working with high-performance distributed systems involving thousands of GPUs. Additionally, you will play a key role in developing excellent internal tools and CI/CD pipelines to facilitate rapid iteration.

About Sieve

Sieve is an innovative AI research lab focused on video data, dedicated to overcoming the challenges in acquiring high-quality training data to enhance various digital applications. Our team leverages cutting-edge technology and strategic partnerships to drive advancements in video understanding.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.