Nebius logoNebius logo

Senior Software Engineer - Token Factory

NebiusAmsterdam, Netherlands; Germany; Israel; Prague, Czech Republic; Remote - Europe; Remote - United States; United Kingdom
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Senior

Qualifications

We Anticipate You Will Have:5+ years of professional software development experience. Strong software engineering skills, particularly in distributed systems. Experience with AI/ML technologies and frameworks.

About the job

Why choose Nebius?
Nebius is at the forefront of revolutionizing cloud computing, catering specifically to the burgeoning global AI economy. We empower our clients with state-of-the-art tools and resources designed to tackle real-world problems and drive industry transformation, all while minimizing infrastructure expenses and eliminating the requirement for extensive in-house AI/ML teams. Our workforce is engaged in pioneering advancements in AI cloud infrastructure, collaborating with some of the most talented and innovative leaders and engineers in the industry.

Our Work Environment
With our headquarters located in Amsterdam and a listing on Nasdaq, Nebius boasts a worldwide presence with R&D centers across Europe, North America, and Israel. Our diverse team of over 1,400 employees includes more than 400 highly skilled engineers who possess deep expertise in both hardware and software engineering, complemented by a dedicated in-house AI R&D unit.

The Position

This opportunity is within Nebius AI R&D, a team dedicated to applied research and the creation of AI-intensive products. Our recent applied research publications include:

  • Exploring how test-time guided search can enhance agent capabilities.
  • Significantly expanding task data collection to drive reinforcement learning for software engineering agents.
  • Optimizing efficiency in LLM training using agentic trajectories.

One of our key AI products is the Nebius Token Factory, an inference and fine-tuning platform for AI models.

This role demands proficiency in distributed systems for constructing a large-scale LLM training platform.

Your Responsibilities Will Include:

  • Design and develop the LLM training platform.
  • Oversee and maintain our ML infrastructure to ensure peak performance, scalability, and reliability.
  • Enhance job scheduling strategies to reduce resource fragmentation.

About Nebius

Nebius is pioneering the next generation of cloud computing tailored for the AI economy, providing businesses with essential tools and resources to innovate without incurring enormous costs.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages.

Tailoring 0 resumes…

We'll move completed jobs to Ready to Apply automatically.