Qualifications
Key ResponsibilitiesLead and mentor a team of infrastructure and platform engineers, ensuring clear priorities and effective obstacle removal. Take ownership of our build systems and CI/CD pipelines, including self-hosted runners, and cloud infrastructure. Manage and enhance our cloud security posture, including IAM policies, access controls, secrets management, and compliance. Oversee the endpoint fleet, ensuring efficient provisioning, configuration management, OS policies, and security practices across employee devices. Maintain high developer velocity through fast builds, reliable testing, and seamless tooling across a diverse codebase. Manage our compute fleet, focusing on CI bot scaling, cost optimization, container orchestration, and capacity planning. Implement observability and alerting solutions for build infrastructure, CI fleet health, and cloud resources. Monitor cloud expenditures, identify inefficiencies, and implement cost-saving strategies without compromising developer experience. Design onboarding automation to facilitate new engineers' transition from zero to productive with minimal friction. Establish and enforce disaster recovery and business continuity practices for critical infrastructure. Embrace simplicity in problem-solving by choosing straightforward solutions over unnecessary abstraction.
About the job
Join MatX in Shaping the Future of AI Infrastructure
At MatX, we are pioneering the development of vertically integrated full-stack solutions that span from silicon to sophisticated systems for artificial general intelligence compute platforms. Our unique blend of hardware and software is complemented by a robust infrastructure team that manages essential systems — including build systems, CI/CD pipelines, cloud environments, security measures, and developer tooling — enabling our nimble team to deliver results at the speed of a much larger organization.
We are seeking a proactive and experienced Infrastructure and IT Manager who will take ownership of our infrastructure and lead our talented team. In this role, you will engage in writing infrastructure-as-code, debugging build systems, triaging CI failures, and pushing code, while also focusing on hiring, mentoring, and defining the technical direction for our expanding platform team.