companyCerebras Systems logo

Senior Runtime Engineer

Cerebras SystemsSunnyvale CA or Toronto Canada
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Proven experience in designing and implementing high-performance distributed systems. Strong understanding of concurrency, throughput, and scalability in software development. Proficiency in working with heterogeneous clusters and data pipelines. Experience in machine learning frameworks and models. Excellent problem-solving skills and ability to work collaboratively.

About the job

Cerebras Systems is at the forefront of AI technology, developing the largest AI chip in the world, which is 56 times larger than traditional GPUs. Our innovative wafer-scale architecture delivers the computational power of dozens of GPUs on a single chip, while ensuring programming is as simple as working with a single device. This revolutionary approach enables Cerebras to provide unmatched training and inference speeds, facilitating seamless execution of large-scale machine learning applications without the complexities of managing multiple GPUs or TPUs.

Cerebras proudly serves a diverse clientele, including leading model labs, global corporations, and pioneering AI startups. Recently, OpenAI announced a multi-year collaboration with Cerebras, aiming to harness 750 megawatts of power for transformative workloads through ultra high-speed inference.

Our groundbreaking wafer-scale architecture allows Cerebras Inference to offer the most rapid Generative AI inference solution globally, surpassing GPU-based hyperscale cloud services by over ten times. This significant enhancement in speed is reshaping the user experience for AI applications, enabling real-time iteration and amplifying intelligence through advanced agentic computation.

About The Role

Join us in constructing the next generation of large-scale AI systems designed to handle training and inference workloads with unparalleled efficiency and scale. As a Senior Runtime Engineer, you will be responsible for architecting and developing high-performance distributed software that orchestrates extensive compute and data pipelines across diverse clusters. Your contributions will push the boundaries of concurrency, throughput, and scalability, facilitating the effective execution of models on a massive scale. This position sits at the confluence of systems engineering and machine learning performance, requiring both deep architectural insight and practical low-level implementation capabilities. You will play a crucial role in optimizing how models are executed and fine-tuned from data ingestion through to distributed execution across cutting-edge hardware platforms. We are actively recruiting for runtime roles in both Training and Inference.

About Cerebras Systems

Cerebras Systems is a pioneering company specializing in AI hardware, known for creating the world's largest AI chip. Our advanced technology enables rapid and efficient execution of machine learning tasks, making us a key player in the AI landscape.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.