About the job
Who are we?
At Cohere, we are committed to harnessing intelligence to benefit humanity. Our mission focuses on training and deploying advanced models for developers and enterprises, empowering the creation of innovative AI systems that drive exceptional experiences in areas like content generation, semantic search, retrieval-augmented generation (RAG), and intelligent agents. We believe our contributions are vital for the widespread integration of AI technologies.
We take immense pride in our creations. Each team member is empowered to enhance the capabilities of our models and the value they deliver to our clients. We thrive in a fast-paced environment and prioritize actions that best serve our customers.
Cohere comprises a talented team of researchers, engineers, designers, and other specialists who are dedicated to their craft. Our diverse perspectives are crucial for developing exceptional products.
Join us in shaping the future of AI!
Why is this role important?
In this position, you will push the boundaries of model post-training, deploying state-of-the-art models into production while bridging the divide between research and real-world application. With one of the highest compute-to-engineer ratios globally, we foster collaboration between engineering and research, allowing everyone to contribute to production code and support research initiatives based on their interests and organizational needs. You will have access to the resources you need to excel.
Please note: We have offices in London, Paris, Toronto, San Francisco, and New York, but we also support a remote-friendly work culture!
As a Technical Staff Member, your responsibilities will include:
Developing and implementing high-performance, scalable software for model training.
Consistently performing post-training of models to achieve state-of-the-art performance.
Collaborating with specialized teams (Agentic, Code, etc.) to create models with comprehensive performance metrics.
Designing and executing strategies to enhance performance and outcomes during training cycles, including supervised fine-tuning (SFT) and reinforcement learning (RL).
Conducting research and experimentation using our advanced supercomputing and data infrastructures.
Learning from and collaborating with leading researchers in the AI field.
Ideal candidates will possess:
Exceptional software engineering capabilities.
A strong understanding of machine learning frameworks and tools.
Experience in collaborative development and version control systems.
A passion for continuous learning and applying innovative solutions.

