Nebius logoNebius logo

HPC System Engineer

NebiusAmsterdam, Netherlands
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Experience

Qualifications

Strong proficiency in Unix/Linux, as well as experience with Python

About the job

Why Choose Nebius?
Nebius is at the forefront of a transformative era in cloud computing, dedicated to empowering the global AI economy. We provide our clients with innovative tools and resources to tackle real-world challenges and revolutionize their industries, all while minimizing infrastructure costs and eliminating the need for extensive in-house AI/ML teams. Our team operates at the cutting edge of AI cloud infrastructure, collaborating with some of the most seasoned and imaginative leaders and engineers in the industry.

Our Work Environment
Based in Amsterdam and publicly listed on Nasdaq, Nebius boasts a global presence with R&D centers across Europe, North America, and Israel. Our workforce of over 1400 employees includes more than 400 highly skilled engineers possessing deep expertise in both hardware and software engineering, complemented by an in-house AI R&D team.

The Opportunity

We are in search of a talented HPC System Engineer to join our dynamic team, focusing on the benchmarking of GPU platforms for machine learning and AI workloads. In this pivotal role, you will assess the performance of GPU-based hardware across various deep learning and AI frameworks, facilitating data-driven decisions for optimizing platforms and guiding the development of next-generation hardware.

 

Your Responsibilities Include:

  • Collaborating closely with hardware and development teams to profile and analyze GPU performance at both the system and kernel levels.
  • Evaluating and benchmarking GPU performance across diverse platforms, architectures, and software stacks (such as CUDA and ROCm).
  • Conducting acceptance testing for new GPU clusters, ensuring hardware and software meet the necessary performance, stability, and compatibility standards for AI workloads.
  • Executing experiments with various GPU system configurations to evaluate the effects of different interconnect strategies and system-level optimizations on performance and scalability.

 

We Are Looking For Candidates Who Have:

  • Strong proficiency in Unix/Linux, as well as experience with Python...

About Nebius

Nebius is revolutionizing cloud computing for the AI economy, providing essential tools and resources for businesses to innovate without incurring high infrastructure costs. Our global presence and talented team drive advancements in AI cloud infrastructure.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.