company

Senior MLOps Engineer - LLMOps

TRM LabsSan Francisco, CA
On-site Full-time $200K/yr - $220K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

The ideal candidate will have:A strong background in machine learning operations with a focus on large language models. Experience in building and maintaining CI/CD pipelines. Proficiency in programming languages such as Python, as well as familiarity with relevant AI/ML frameworks. Knowledge of cloud services and infrastructure management. Demonstrated ability to work collaboratively in a team-oriented environment. Excellent problem-solving skills and a strong commitment to quality and best practices.

About the job

Contribute to a Safer World.

At TRM Labs, we leverage blockchain analytics and artificial intelligence to empower law enforcement, national security agencies, financial institutions, and cryptocurrency enterprises in the fight against crypto-related fraud and financial crime. Our advanced blockchain intelligence and AI platforms are designed to trace transactions, identify illicit activities, build investigative cases, and establish a comprehensive view of potential threats. Trusted by leading organizations worldwide, TRM is committed to fostering a safer, more secure environment for everyone.

The AI Engineering Team is dedicated to driving the development of next-generation AI applications, specifically focusing on Large Language Models (LLMs) and agentic systems. Our mission is to create resilient pipelines, high-performance infrastructure, and operational tools that facilitate the swift, safe, and scalable deployment of AI systems.

We manage extensive petabyte-scale data pipelines, deliver model outputs with millisecond-level latency, and ensure observability and governance to make AI production-ready. Our team actively evaluates and integrates state-of-the-art tools in the LLM and agent domain, such as open-source stacks, vector databases, evaluation frameworks, and orchestration tools, which enhance TRM's ability to innovate more rapidly than the competition.

In the role of Senior MLOps Engineer specializing in LLMOps, you will play a pivotal role in constructing and scaling the technical infrastructure required for AI and ML systems. Responsibilities include:

  • Develop reusable CI/CD workflows for model training, evaluation, and deployment, incorporating tools like Langfuse, GitHub Actions, and experiment tracking.

  • Automate model versioning, approval processes, and compliance checks across various environments.

  • Construct a modular and scalable AI infrastructure stack, integrating vector databases, feature stores, model registries, and observability tools.

  • Collaborate with engineering and data science teams to integrate AI models and agents into real-time applications and workflows.

  • Regularly assess and incorporate cutting-edge AI tools (e.g., LangChain, LlamaIndex, vLLM, MLflow, BentoML, etc.).

  • Enhance AI reliability and governance, promoting experimentation while ensuring compliance, security, and system uptime.

  • Optimize AI/ML model performance by ensuring data accuracy, consistency, and reliability to improve training and inference processes.

  • Deploy infrastructure that supports both offline and online LLM evaluations.

About TRM Labs

TRM Labs is at the forefront of blockchain analytics and AI solutions, dedicated to assisting law enforcement, national security, and financial institutions in combating financial crime. With a commitment to innovation and a robust technological foundation, TRM empowers organizations worldwide to navigate the complexities of cryptocurrency and enhance security measures effectively.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.