companyDoctolib logo

Senior MLOps Engineer - Data Ingestion

DoctolibParis, Paris, France
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

To excel in this position, you should possess: Extensive experience in MLOps, including the design and implementation of ML pipelines. Strong knowledge of data security and privacy practices, particularly in healthcare settings. Proficiency in programming languages such as Python, Java, or similar. Experience with data architecture, including pseudo-anonymization and secure data export. Familiarity with ML orchestration platforms like MLflow or similar tools. Excellent problem-solving skills and a proactive approach to troubleshooting. Experience mentoring and guiding junior team members.

About the job

Your Impact

Join the Panda Team within our Data & AI Platform as a Senior MLOps Engineer. Your role will be pivotal in constructing and sustaining secure ML pipelines that revolutionize our management of healthcare data at scale. Collaborating with a dedicated feature team, you will develop essential data infrastructure that supports data-driven decision-making while safeguarding the privacy of millions of patients.

Being part of the tech team at Doctolib means creating innovative products and features that enhance the daily experiences of healthcare teams and patients alike.

What You’ll Build

  • Design and implement comprehensive ML model pipelines in production (including LLM and custom models) with robust deployment, evaluation, and monitoring frameworks.
  • Oversee the data pseudo-anonymization architecture within ingestion services, transforming Tier 0 (personal identifiers) into Tier 1 (anonymized data) while ensuring data quality and model performance.
  • Develop and maintain secure data export services with ML-based threat detection to mitigate attack vectors (SQL injection, etc.) through adaptive models instead of manual rules.
  • Manage golden datasets and establish production model evaluation frameworks to ensure both anonymization quality and system reliability.
  • Construct and maintain data pipelines that efficiently extract, transform, and load data from a variety of sources, accommodating multiple data formats (text, images, audio, video).
  • Implement automation and orchestration tools utilizing ML orchestration platforms (MLflow, Braintrust, or similar) to streamline infrastructure provisioning and minimize manual efforts.
  • Continuously monitor data and ML platforms for performance, reliability, and security; proactively identify and resolve issues.
  • Mentor team members on MLOps best practices to reduce knowledge silos and enhance organizational capability.

Life at Doctolib Tech

  • Our solutions are built on a fully cloud-native platform that supports web and mobile app interfaces, multiple languages, and is tailored to specific country and healthcare specialty requirements.
  • Our tech stack features Rails, TypeScript, Java, Python, Kotlin, Swift, and React Native, ensuring we leverage AI ethically across our products to empower patients and healthcare professionals.

About Doctolib

Doctolib is a leading digital health company, committed to improving access to healthcare through innovative technology. We provide a platform that connects patients and healthcare professionals, ensuring a streamlined and efficient experience. Our team is passionate about leveraging technology to enhance the healthcare landscape, empowering both patients and providers.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.