About the job
Waymo is at the forefront of autonomous driving technology, striving to become the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, Waymo has been dedicated to developing the Waymo Driver—The World’s Most Experienced Driver™—with the goal of enhancing mobility access while aiming to reduce the tragic loss of lives due to traffic accidents. The Waymo Driver not only fuels our fully autonomous ride-hailing service but is also adaptable across various vehicle platforms and applications. With over ten million trips solely for riders and extensive experience from autonomously navigating over 100 million miles on public roads, alongside tens of billions of miles in simulation across more than 15 U. S. states, we are leading the charge in this transformative technology.
The Large Model Evaluation team plays a pivotal role in advancing Waymo's AI vision. As we integrate cutting-edge Large Language Models (LLMs) and Vision-Language Models (VLMs), our aim is to construct sophisticated AI systems capable of addressing the multifaceted challenges of real-world driving. Central to our achievements is the ability to accurately measure our progress. In a landscape where robust evaluation serves as a critical barrier to deploying large models, the intricacies of this task at Waymo are particularly complex and safety-sensitive. We seek quantitatively driven engineers to innovate and propose novel methodologies for evaluating the ML models utilized in the Waymo Driver.

