About the job
About Us:
At Ambience Healthcare, we are not just another scribe; we are pioneering an AI intelligence platform that reinvigorates the human touch in healthcare while delivering significant ROI for health systems nationwide.
Our innovative technology enables healthcare providers to concentrate on delivering exceptional care by alleviating the administrative burdens that detract from patient interactions and their most impactful work. Ambience provides real-time, coding-aware documentation and clinical workflow support in ambulatory, emergency, and inpatient settings across leading health systems in North America.
Our team is driven by a relentless pursuit of excellence and extreme ownership, dedicated to crafting the best solutions for our health system partners. We champion transparency, positivity, and thoughtful engagement, holding each other accountable because we understand the significance of the challenges we tackle.
Ambience has earned accolades such as being ranked #1 for Improving the Clinician Experience in the KLAS Research Emerging Solutions Top 20 Report, being recognized by Fast Company as one of the Next Big Things in Tech, and being named one of the best AI companies in healthcare by Inc. We were also selected as a LinkedIn Top Startup in 2024 and 2025. Our esteemed investors include Oak HC/FT, Andreessen Horowitz (a16z), OpenAI Startup Fund, and Kleiner Perkins — and our journey is just beginning.
The Role:
As a Staff Machine Learning Engineer, you will play a crucial role in advancing clinical AI that impacts millions of patient encounters across the largest health systems in the nation. Your contributions will directly influence the speed at which we enhance our AI capabilities through the platform you will oversee.
You will design and implement evaluation and release processes that empower teams to deliver with confidence, create observability tools to identify quality issues pro-actively, and develop debugging tools that facilitate rapid issue reproduction. Additionally, you’ll work on the chart context retrieval layer that transforms patient history into model-ready inputs.
Our goal is to enable teams to iterate on quality within days, not weeks, ensuring that every enhancement you implement adds value across all product teams each quarter.
Please note that our engineering roles operate in a hybrid model from our San Francisco office (3 days per week).
What You’ll Own:
Evaluation & Release Infrastructure — Developing automated grading systems and release gates that function seamlessly across product teams, creating a unified evaluation dataset with version control to replace fragmented workflows. Implementing production-quality monitoring that includes end-to-end tracing, shared metrics, and automated alerts.
Debugging Tools — Building encounter replay features that reconstruct precise inference inputs (including retrieved chart context, packed prompts, and model versions) to allow teams to troubleshoot issues without sifting through logs. Creating differential views to compare known good states with regressions.

