About the job
About the Role
This position is pivotal in overseeing infrastructure across our entire tech stack. If it exists in the cloud, it falls under your purview. In the world of robotics, data is essential, and we require robust, scalable infrastructure to manage, store, and process vast amounts of this data. The APIs, services, and monitoring systems you will manage are critical to our operations.
Your Responsibilities Include:
- Managing compute resources (both CPU and GPU) to efficiently process petabytes of data at high throughput.
- Overseeing the infrastructure required for data processing and storage.
- Ensuring the security and integrity of our infrastructure and data.
You Will Excel in This Role If You Have:
- A minimum of 5 years of experience in managing large-scale cloud infrastructure using tools such as Kubernetes and Terraform, with a primary focus on Python services.
- Deep understanding of AWS services (or their equivalents) and their permission models.
- Strong perspectives on the effective use of coding agents within an infrastructure context.

