About the job
About Us
At Heidi, we believe healthcare deserves a more harmonious rhythm, one that fosters continuous and profoundly human care. We are developing an AI Care Partner that collaborates with clinicians to facilitate this vision.
Our diverse team comprises doctors, engineers, designers, researchers, and creatives dedicated to building tools that enable clinicians to concentrate on their primary focus: their patients.
In just over 18 months, Heidi has reclaimed more than 18 million hours for healthcare professionals, supporting 73 million patient visits across 116 countries. Currently, our platform empowers over two million patient visits each week globally.
With nearly $100 million in funding, we are expanding our reach in the US, UK, Canada, and Europe, collaborating with leading health systems such as the NHS, Beth Israel Lahey Health, and Monash Health.
Your Role
As a Senior DevOps Engineer, you will play a crucial role in building and scaling the cloud infrastructure that powers our healthcare AI platform. You will be part of the Engineering Platform team, focusing on reliability, automation, and developer productivity. Your responsibilities will include working across AWS and Azure environments, managing Kubernetes workloads, enhancing CI/CD pipelines, and ensuring security and observability are integrated into our deliverables.
Key Responsibilities
Lead infrastructure automation efforts by developing and maintaining Terraform modules, Helm charts, and reusable CI/CD workflows using GitHub Actions.
Scale our Kubernetes platform by managing and improving EKS clusters across multiple regions, adding new services, enhancing security, and optimizing deployment processes.
Enhance observability by implementing metrics, logs, and tracing solutions utilizing Datadog, OpenTelemetry, and synthetic monitoring techniques.
Increase release reliability by establishing safe deployment patterns, including canary releases, blue/green deployments, and managing environment promotion flows.
Support developer self-service initiatives by expanding tooling resources for application teams, including environment scaffolds, secrets management, and automated provisioning.
Ensure security and compliance by implementing least-privilege IAM practices, secret rotation, image signing, and supply-chain hardening measures.
Work collaboratively across functions with product teams to embed reliability, incident response, and SLO ownership into our processes.
