About the job
We are seeking a resourceful, innovative, and meticulous AI Model Testing Engineer to establish, oversee, and enhance our testing infrastructure for evaluating AI models from top providers such as OpenAI, Google, and Anthropic. This position will specifically focus on testing and optimizing large language models (LLMs) and transcription models. You will work in close collaboration with our veterinary medical team to develop targeted test cases that enable veterinarians to identify and choose the most accurate and effective AI-generated outputs.
Key Responsibilities:
- Design, develop, and maintain testing pipelines and logic primarily centered on transcription models, while also addressing other AI models like LLMs and real-time inference models.
- Work alongside the veterinary medical team to translate clinical scenarios into structured transcription and AI model test cases.
- Implement frameworks for comparative analysis, enabling veterinarians to systematically assess and select optimal AI outputs.
- Strive to balance and optimize accuracy, quality, speed, and cost-effectiveness in testing processes.

