companyWaymo LLC logo

Senior Software Engineer - Post-Training & RL Frameworks

Waymo LLCMountain View, California
On-site Full-time $204K/yr - $259K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

You Will:Report to the Head of ML Frameworks & Efficiency. Develop the foundational training system for adapting RL techniques to unprecedented scales and diverse environments (i.e., CPU/GPU/TPU). Work alongside teams to integrate cutting-edge rollout strategies, policies, and RL algorithms (i.e., REINFORCE, DPO, PPO) into the training system. Optimize the end-to-end RL training pipeline for efficient and scalable learners/actors, and establish low-latency distributed reply buffers to retain data generated by rollouts. Create evaluations, analyze experimental results, and iterate swiftly to boost model performance and training workflows. Stay updated with the latest research in RL, Vision-Language-Action (VLA) models, and World models to inspire and inform new initiatives. You Have:A Bachelor’s degree in Computer Science, Mathematics, or 8+ years of equivalent real-world experience. Proficiency in distributed systems design with a strong understanding of machine learning efficiency. Experience working with ML frameworks and tools. Strong problem-solving abilities and a passion for innovation in autonomous driving technology.

About the job

Waymo is a pioneering autonomous driving technology company dedicated to becoming the world's most trusted driver. Originating from the Google Self-Driving Car Project in 2009, our focus has been on developing the Waymo Driver—The World’s Most Experienced Driver™. Our mission is to enhance mobility access while saving lives that are currently lost to traffic accidents. The Waymo Driver facilitates our fully autonomous ride-hail service and is adaptable to a variety of vehicle platforms and product use cases. To date, we have completed over ten million rider-only trips, driven over 100 million miles on public roads, and performed tens of billions of miles in simulation across more than 15 U. S. states.

The Waymo ML Frameworks & Efficiency team collaborates with both Research and Production teams to create core models in Perception and Planning essential to our autonomous driving software. We empower our partners by providing optimal frameworks throughout the model development lifecycle, encompassing both pre-training and post-training phases. Our frameworks are designed to efficiently scale models while addressing the unique challenges of machine learning in autonomous driving.

We invite skilled engineers with expertise in machine learning systems to join us in refining and enhancing pre-trained models for deployment within the Waymo Driver and future products. You will collaborate with researchers and modeling engineers across the organization to tackle the complexities of large-scale reinforcement learning (RL), developing systems capable of scaling across various computational, data, and environmental contexts to enhance model intelligence and interpret human driving behaviors.

About Waymo LLC

Waymo is at the forefront of autonomous driving technology, committed to making roads safer and more accessible through our advanced driving systems. With a rich history as part of Google, we leverage cutting-edge research and technology to redefine transportation for everyone.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.