About the job
Join the AMAX Solution Engineering team as a Cluster Test Engineer, where you will be instrumental in developing, validating, deploying, and supporting cutting-edge computing solutions. This position is highly hands-on and requires both hardware and software expertise to tackle technical challenges and ensure successful solution launches.
Key Responsibilities:
- Install, monitor, and optimize GPU clusters and job schedulers.
- Design and execute AI and HPC tests for data center solutions.
- Research and implement new testing technologies and methodologies to enhance solution reliability.
- Debug intricate hardware and software issues during the deployment phase.
- Support the development of diagnostic tools and validation frameworks.
- Assist customers with building compute infrastructure and validating workloads.

