ai& logoai& logo

Data Center Facility Operations Lead

ai&Tokyo
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Mid to Senior

Qualifications

The ideal candidate will have:Extensive hands-on experience with high-density data center systems. Specific expertise in liquid cooling systems. Proven experience in NOC operations. Bilingual fluency in Japanese and English. A strong ability to work collaboratively in a global team environment.

About the job

About ai&

ai& is an innovative global AI technology firm focused on addressing the escalating demand for artificial intelligence solutions. Our mission is twofold: to be a leading AI laboratory specializing in localization and to function as a global provider of infrastructure and computing resources. We are creating a cohesive, optimized global platform that combines cutting-edge data centers, diverse computing capabilities, and advanced model services. We believe that the most effective way to scale AI is to control the entire technological stack.

At ai&, we empower teams with the autonomy necessary to tackle significant challenges. Our methodology involves breaking down complex problems into manageable components and collaboratively addressing intricate issues. We are in search of highly motivated, mission-driven individuals who showcase strong personal initiative. We highly value curiosity as the cornerstone of talent, and we are eager to welcome individuals who are excited to grow alongside our transformative technology and expanding business.

We are actively recruiting globally, with offices in Tokyo, San Francisco, Austin, and Toronto. We are committed to connecting with exceptional talent wherever they may be.

Role Overview

As the lead for Data Center Facility Operations, you will be responsible for ensuring the physical reliability and performance of ai&'s compute infrastructure in Japan. This role is heavily execution-oriented. You will manage the mechanical, electrical, and cooling systems that maintain our high-density GPU clusters, oversee the Network Operations Center (NOC) for our Japan operations, and ensure that any issues are identified and addressed before they affect our compute fleet.

You won't just be overseeing a facility; you will be operating it. You will define operational processes, develop monitoring systems, adhere to stringent service level agreements (SLAs), and be accountable for the uptime of some of the world’s most demanding computing environments. The ideal candidate will possess extensive hands-on experience with mission-critical data center systems, expertise in high-density and liquid cooling environments, experience in NOC operations, and the capability to communicate effectively in both Japanese and English while coordinating with a global team.

Key Responsibilities

  • NOC Ownership & Operations Take ownership of the NOC for ai&'s Japan data center. Continuously monitor infrastructure health, manage alerts, coordinate incident responses, and ensure that no issues affect the compute fleet without being addressed first. Establish processes, tools, escalation pathways, and shift handover procedures from the ground up.

  • Build Observability & Logging Develop comprehensive observability and logging systems to enhance operational efficiency.

About ai&

ai& is at the forefront of AI technology, committed to developing innovative solutions that meet the growing global demand. Our focus on localization and infrastructure services positions us as a leading player in the AI landscape, enabling us to serve a diverse array of clients across the globe.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.