companyRecursion Pharmaceuticals logo

Senior Scientific Data Engineer - Data Platform

Recursion PharmaceuticalsLondon, England; Oxford, EnglandNew
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

The ideal candidate will possess a robust foundation in software engineering principles and a deep understanding of the drug discovery landscape. Experience in managing large datasets and an ability to develop scalable data architectures are crucial. Familiarity with both public and proprietary data sources in the pharmaceutical domain is highly desirable.

About the job

Recursion Pharmaceuticals is reimagining drug discovery by combining biology and data in new ways. The Senior Scientific Data Engineer - Data Platform will help shape and maintain the data infrastructure that drives this mission. This position is based in either London or Oxford.

Role overview

This role focuses on designing, developing, and maintaining scientific data systems that underpin Recursion’s research and product development. The emphasis is on data architecture rather than machine learning or model building. The work involves:

  • Managing the ingestion, standardization, and distribution of public and proprietary datasets essential for drug discovery.
  • Enabling competitor intelligence, chemical tractability analysis, and compound design workflows through well-structured data products.
  • Owning and evolving the data infrastructure that supports predictive modeling and machine learning across the company.

This position suits engineers who enjoy building complex scientific data systems. It is not a modeling or data science role.

Key data systems and products

  • Flagship SAR Data Mart: Integrates commercial and public bioactivity databases (such as ChEMBL) with internal assay results.
  • Commercial Vendor Data Mart: Maintains a catalog of purchasable compounds for internal design tools and tractability assessments.
  • Biomedical Knowledge Graph: Offers semantic graph infrastructure connecting targets, diseases, and compounds to support AI-driven research.
  • Chemical Synthesis Data: Stores reaction datasets for training retrosynthesis models and predicting chemical tractability.
  • Patent Intelligence System: Converts patent feeds and competitor data into actionable insights for research and strategy.
  • Compound Standardization Registry: Maintains a large-scale chemical structure repository to ensure consistency across billions of compounds, similar to UniChem.

Important note

This is a specialized Data Engineering position focused on data infrastructure and stewardship. The work does not include training or building predictive models, but it directly supports those efforts through reliable systems and curated datasets.

About Recursion Pharmaceuticals

Recursion Pharmaceuticals is at the forefront of transforming drug discovery through innovative biology decoding. Our mission is to industrialize the drug discovery process, harnessing cutting-edge technology to deliver impactful solutions in healthcare.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.