Senior AI Data Engineer - Remote (UK)
Remote Full-time
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Mid to Senior
Qualifications
Required Qualifications:A minimum of 4 years of experience in data engineering, particularly in AI/ML data infrastructure or supporting AI R&D efforts. Proficiency in Python, SQL, and Infrastructure as Code (Terraform, CloudFormation), along with familiarity with modern orchestration frameworks (Airflow, Prefect, or dbt). Experience with cloud-native data platforms (AWS, Azure, GCP) and vector databases (Pinecone, Weaviate, Qdrant, or Chroma). Familiarity with MLOps tools and platforms (HuggingFace, SageMaker Bedrock, Vertex AI), experiment tracking tools (MLflow, Weights & Biases), and model deployment pipelines. Knowledge of Large Language Models (LLMs), embedding generation, retrieval-augmented generation (RAG) systems, and frameworks for orchestrating LLM interactions (LiteLLM, LangFuse, LangChain, LlamaIndex). Excellent problem-solving, analytical, and communication skills, with the capability to...
About the job
At Turnitin, we recognize that the foundation of exceptional AI lies in superior data management. As a Senior AI Data Engineer, you will become an essential member of our innovative global team, dedicated to developing cutting-edge AI and data systems. Your contributions will play a pivotal role in creating the next generation of data and AI pipelines, amplifying the impact of our initiatives. Collaborating with diverse teams across Turnitin, you will facilitate the integration of AI and data science into a wide range of products aimed at enhancing educational experiences and promoting academic integrity.
Key Responsibilities
- AI Data Infrastructure & Pipeline Management: Design, construct, and manage scalable, real-time data pipelines that enable ongoing training of Applied AI models. Implement and uphold robust data infrastructure using AI methodologies and best engineering practices to facilitate continuous model enhancements.
- Data Collection: Lead efforts to gather, normalize, and store data from various sources, including external LLM providers.
- Collaboration: Work closely with AI R&D, Applied AI, and Data Platform teams to ensure seamless data flow and adherence to quality standards. Collaborate with stakeholders to gather, curate, and document high-quality datasets that directly support Applied AI workflows and business goals.
- AI R&D Support: Provide additional support to AI Research & Development by leveraging advanced data warehousing and engineering technologies. Engage in exploratory data projects that extract insights from Turnitin's vast data repositories.
- Communication: Foster clear communication across teams, ensuring alignment with the company’s vision while conveying insights regarding data infrastructure needs and innovative possibilities.
- Technology Evolution: Stay abreast of emerging tools and methodologies in AI data engineering, providing recommendations to enhance our data infrastructure and capabilities.
About Turnitin, LLC
Join Turnitin, a pioneering force in the global education sector, where for over 25 years, we have partnered with educational institutions to foster honesty, fairness, and consistency across all areas of learning and assessment. Our services, including Feedback Studio, Originality, Gradescope, ExamSoft, Similarity, and iThenticate, are utilized by over 21,000 academic institutions, publishers, and corporations. Experience a remote-centric culture that empowers you to work purposefully and autonomously, supported by a comprehensive benefits package focused on your well-being. Our diverse workforce is united by a common goal: to make a meaningful impact in education. Turnitin boasts a global presence, with team members across more than 35 countries, including the United States, Mexico, the United Kingdom, Australia, Japan, India, and the Philippines.
Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.