companyTurnitin, LLC logo

Senior AI Data Engineer - Remote (UK)

Turnitin, LLCManchester
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Required Qualifications:Minimum of 4 years of experience in data engineering, especially in AI/ML data infrastructure or accelerating AI R&D. Expertise in Python, SQL, and Infrastructure as Code (Terraform, CloudFormation), along with experience in modern orchestration frameworks (Airflow, Prefect, or dbt). Proficient in cloud-native data platforms (AWS, Azure, GCP) and vector databases (Pinecone, Weaviate, Qdrant, or Chroma). Familiarity with MLOps tools and platforms (HuggingFace, SageMaker Bedrock, Vertex AI), and experiment tracking (MLflow, Weights & Biases). Experience with Large Language Models (LLMs), embedding generation, retrieval-augmented generation (RAG) systems, and frameworks for orchestrating LLM interactions (LiteLLM, LangFuse, LangChain, LlamaIndex). Strong problem-solving, analytical, and communication skills, with a proven ability to work collaboratively.

About the job

At Turnitin, we recognize that AI and data science are fundamental to our achievements and ambitious product strategy. As a Senior AI Data Engineer, you will join a dynamic global team of proactive and independent professionals dedicated to crafting sophisticated, well-structured AI and data systems. You will be at the forefront of developing our next-generation data and AI pipelines, significantly scaling our team's impact. You'll collaborate across various teams within Turnitin to integrate AI and data science into a diverse range of products aimed at enhancing learning, teaching, and academic integrity.

Key Responsibilities:

  • AI Data Infrastructure & Pipeline Management: Design, build, and operate scalable real-time data pipelines that facilitate ongoing Applied AI model training. Implement and maintain robust data infrastructure utilizing AI techniques and engineering best practices to ensure continuous model improvement.
  • Data Collection: Lead efforts to collect, normalize, and store data from various sources, including external LLM providers.
  • Collaboration: Work closely with AI R&D, Applied AI, and Data Platform teams to ensure smooth data flow and adherence to quality standards. Collaborate with stakeholders to curate and catalog high-quality datasets that support Applied AI retraining workflows and business goals.
  • Support for AI R&D: Contribute to AI Research & Development initiatives by leveraging advanced data warehousing and engineering technologies. Engage in exploratory data projects to extract insights from Turnitin's extensive datasets.
  • Communication: Foster clear communication across teams, aligning with the company vision while sharing insights on data infrastructure requirements and potential innovations.
  • Technology Evolution: Stay updated with emerging tools and methodologies in AI data engineering, providing recommendations to enhance our AI data infrastructure and capabilities.

About Turnitin, LLC

Turnitin is a leading innovator in the global education sector, dedicated to promoting academic integrity and supporting educational institutions for over 25 years. With a user base of over 21,000 academic institutions, publishers, and corporations, our services include Feedback Studio, Originality, Gradescope, ExamSoft, Similarity, and iThenticate. We offer a remote-centric culture that empowers you to work with purpose and accountability, supported by a comprehensive benefits package focused on your overall well-being. Our diverse team, spread across more than 35 countries, is united by a shared mission to make a transformative impact in education.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.