About the job
Who We Are
Nuro is a pioneering self-driving technology company dedicated to making autonomy accessible to everyone. Established in 2016, Nuro is revolutionizing transportation with the most scalable autonomous driver, integrating advanced AI with automotive-grade hardware. Our flagship technology, the Nuro Driver™, is licensed to cater to various applications, including robotaxis, commercial fleets, and personal vehicles. With years of proven self-driving deployments, Nuro provides automakers and mobility platforms a clear pathway to commercial-scale autonomous vehicles, fostering a safer, more connected future.
About the Role
We are in search of a Senior/Staff Software Engineer to take on a pivotal technical leadership role in Nuro’s Machine Learning (ML) Data engine. You will be positioned at the crucial intersection of Autonomy, Machine Learning, and Infrastructure, serving as an architect for the systems that power our autonomy AI models.
In this role, you’ll collaborate with the Autonomy team to implement the technical strategy to convert vast amounts of autonomy data into high-value training signals for decision-making. Your responsibilities will include designing and constructing data products for autonomy researchers, developing queries for rare 'needle-in-a-haystack' scenarios, and automating labeling and data ingestion workflows. You will work closely with Autonomy ML researchers to grasp their data needs, cooperate with infrastructure teams to establish the appropriate data interfaces and APIs, and develop robust tools for data selection, simulation, and introspection, all capable of handling data at scale. If you thrive on solving complex problems with practical solutions that impact the real world, we invite you to join us.
About the Work
- Data Pipeline Architecture: Design and implement scalable data ingestion and processing pipelines that transform data streams into targeted training datasets. Lead efforts to enhance data quality, identify anomalies, and manage out-of-distribution examples to ensure effective model training and deployment.
- Cross-Functional Leadership: Collaborate with autonomy teams and data infrastructure teams to create effective ML data pipelines and products for ML engineers.
- ML Tooling & Introspection: Build infrastructure and visualization tools that enable ML researchers to easily introspect data, identify model failure modes, request new data samples, and comprehend shifts in data distribution.
- Labeling Operations Integration: Work closely with the data operations team to establish quality standards for labeling operations.

