About the job
Baseten supports companies like Cursor, Notion, and Writer in running AI inference at scale. The team blends AI research, adaptive infrastructure, and developer tools to help organizations deploy advanced AI models efficiently. Backed by investors such as BOND, IVP, and Greylock, Baseten recently raised a $300M Series E. The company aims to be the trusted platform for engineers launching AI products.
Role overview
The Software Engineer - Realtime Systems (Voice AI) role focuses on building and deploying production-ready Voice AI systems. Baseten’s Voice AI team works with open-source models to power applications in productivity, customer support, clinical conversations, creative tools, and education. Engineers in this group influence how people use voice to interact with technology, shaping products that impact multiple industries.
This position involves leading Voice AI projects, setting both product direction and technical strategy. Collaboration is a key part of the work: expect to partner with Forward Deployed Engineers, Model Performance Engineers, and other teams to advance Baseten’s Voice AI capabilities.
Sample projects
- The world's fastest Whisper, with streaming and diarization
- Orpheus TTS inference partnership with Canopy Labs
- Collaborate with the Core Product team to build a multi-model voice agent using Baseten’s orchestration framework
- Work alongside the Training Platform team to support ongoing training of voice models
- Design APIs and SDKs that make Baseten Voice AI products accessible for developers
Location
This role is based in San Francisco.

