companyAndromeda Cluster logo

Software Engineer - AI Infrastructure

Andromeda ClusterNorth America Remote / San Francisco, CA
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Entry Level

Qualifications

What We SeekWe are looking for innovative thinkers who are passionate about technology and eager to tackle challenging problems. Ideal candidates will possess:Strong proficiency in software engineering principles and practices. Experience with cloud computing and AI infrastructure. Familiarity with container orchestration tools such as Kubernetes. Excellent problem-solving skills and the ability to work effectively in teams. A commitment to maintaining high standards of code quality and documentation.

About the job

Join Our Team as a Software Engineer - AI Infrastructure

Location: North America Remote / San Francisco · Full-Time

At Andromeda Cluster, we are dedicated to democratizing access to advanced AI infrastructure that was once only available to hyperscalers. Founded by industry leaders Nat Friedman and Daniel Gross, we have evolved from a singular managed cluster to a global platform that connects top AI labs, data centers, and cloud providers around the world. Our orchestration layer efficiently manages training and inference tasks globally, enhancing flexibility and efficiency in this rapidly expanding sector. We aim to create a global marketplace for AI computing, empowering AGI with the same fluidity as global financial markets.

As we continue to grow, we are on the lookout for talented individuals in the fields of AI infrastructure, research, and engineering.

Your Role

In the position of Infrastructure Product Engineer, you will be integral in constructing the foundational framework of Andromeda’s platform. Your challenge will be to simplify complex, real-world infrastructure issues into scalable product solutions that our customers will benefit from.

Key Responsibilities

  • Architect and develop essential platform components, focusing on infrastructure orchestration, provisioning, and lifecycle management solutions.
  • Create robust APIs, services, and control planes that abstract diverse infrastructure types, including VMs, Kubernetes, bare metal, and schedulers.
  • Convert customer usage patterns into actionable product requirements, delivering impactful features and enhancements.
  • Design automation and internal tools to mitigate manual and ad-hoc operational tasks.
  • Improve platform reliability, performance, and observability, focusing on sustainable enhancements rather than quick fixes.
  • Collaborate with other teams to establish clear ownership boundaries between platform features and customer-specific solutions.
  • Write clean, maintainable, and well-documented code with a focus on long-term sustainability.
  • Engage in technical design discussions and contribute to the architectural advancements of our platform.

About Andromeda Cluster

Andromeda Cluster is at the forefront of making sophisticated AI infrastructure accessible to all. We innovate to connect AI laboratories, data centers, and cloud providers, establishing a powerful network that supports the development of artificial general intelligence (AGI) in an efficient and scalable manner.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.