About the job
About the Team
The Stargate team at OpenAI is dedicated to constructing the physical infrastructure that drives our most advanced AI systems. We are at the forefront of designing, deploying, and managing cutting-edge data center infrastructure, expanding rapidly to meet the growing needs of AI technology. Our efforts synergize hardware, networking, facilities, supply chain, and deployment execution, ensuring seamless integration and functionality.
Our mission is to convert compute requirements into reliable, scalable, and deployable systems that can manage the complexities of frontier AI workloads.
About the Role
We are looking for a passionate Hardware Operations Technical Program Manager to lead the execution of AI infrastructure hardware programs throughout their lifecycle.
In this pivotal role, you will take ownership of cross-functional program execution, which includes hardware readiness, supplier coordination, deployment planning, rack-level integration, manufacturing operations, logistics, field deployment, and operational handoff. You will collaborate closely with teams in hardware engineering, data center engineering, networking, supply chain, manufacturing, deployment, and operations to ensure that critical infrastructure programs transition smoothly from design to production readiness.
This position is ideally suited for an individual who can navigate both technical and programmatic aspects, comprehending hardware systems, identifying operational hurdles, fostering accountability across teams, and establishing scalable processes for high-volume infrastructure deployment.
Key Responsibilities
Lead end-to-end Hardware Operations readiness initiatives for AI infrastructure systems, encompassing servers, racks, networking hardware, power and cooling interfaces, and related data center infrastructure.
Create and implement scalable hardware operations processes, workflows, and support models covering deployment, repair operations, diagnostics, break/fix, escalation management, and ongoing operations.
Oversee cross-functional execution of Hardware Operations readiness initiatives, ensuring that operational capabilities, tooling, documentation, staffing models, and workflows are established ahead of production deployment and operational handoff.
Collaborate with Hardware Engineering, Manufacturing, Supply Chain, Data Center Operations, Network Operations, Deployment, Reliability Engineering, and external suppliers to ensure alignment on operational requirements, supportability, and readiness milestones.
Develop operational scorecards, reporting frameworks, and metric algorithms to track progress and success.

