About the job
Join us at Foxglove, where we are revolutionizing the robotics industry by building robust data infrastructure for real-world applications.
As robotics transitions from research environments to practical implementations in factories, warehouses, vehicles, and field operations, data becomes essential for engineers to troubleshoot failures, understand unexpected behaviors, and enhance robotic systems.
At Foxglove, we provide the observability, visualization, and data infrastructure that enable robotics and autonomous systems teams to efficiently ingest, store, query, replay, and analyze extensive volumes of multimodal sensor data from live systems and production fleets.
About the Role
We are seeking a talented Applied Machine Learning Engineer with strong infrastructure insights to design, deploy, and scale the machine learning systems that power our data platform. In this impactful role, you will be responsible for optimizing production ML infrastructure—from enhancing inference pipeline throughput to establishing training and evaluation workflows. You will focus on high-priority challenges, such as developing retrieval applications for petabyte-scale multimodal robotics data, utilizing cutting-edge models to create high-performance search and data mining products, and fostering an internal ML flywheel for rapid iteration. This is a hands-on, application-driven position rather than a research-focused role.
Key Responsibilities
- Deploy and manage inference infrastructure for production ML workloads, focusing on model serving, scalability, and cost efficiency.
- Build and oversee vector database integrations and embedding applications to facilitate semantic search across various multimodal robotics data types (image, video, point cloud, and time series).
- Design and implement evaluation and training infrastructure to enhance model performance rapidly.
- Lead cloud architecture decisions and tools to optimize inference latency, throughput, cost, and reliability at scale.
- Collaborate closely with product engineers to deliver application-driven ML features that empower developers at the forefront of robotics and physical AI, steering clear of prototype experiments.
- Identify appropriate off-the-shelf solutions for production and determine when to build versus buy.

