About the job
Zyphra is a pioneering artificial intelligence firm located in the vibrant city of San Francisco, California.
About the Role:
We are seeking a passionate Research Scientist to join our dynamic Agency and Reasoning Team at Zyphra. In this role, you will conduct cutting-edge research in reinforcement learning, post-training methodologies, and human preference learning. Your innovative ideas will be instrumental in shaping our next-generation language models, enabling their application on a large scale.
What We Desire:
A strong sense of research intuition and taste
Capability to navigate a research project from initial concept to execution and documentation
Proficiency in implementation and prototyping
A quick thinker who can rapidly transform ideas into experimental frameworks
Ability to collaborate effectively in a fast-paced research environment
An insatiable curiosity and enthusiasm for the study of intelligence.
Qualifications:
Proven experience and skill in reinforcement learning, particularly in the context of language model reasoning or classical RL tasks
Familiarity with language-model-supervised fine-tuning and preference-learning techniques, such as DPO and simPO.
Experience with methods for context-length extension
Strong intuitive understanding of model behaviors, with the ability to refine them through iterative fine-tuning
Interest in engaging deeply with data and dedicating time to data engineering and synthetic data generation
A postgraduate degree in a scientific discipline (Computer Science, Electrical Engineering, Mathematics, Physics)
Published research in reputable machine learning venues
Expertise in PyTorch and Python
Eagerness and aptitude for rapidly acquiring new knowledge and implementing innovative concepts
Exceptional communication and teamwork abilities, capable of contributing to both research and large-scale engineering efforts
Why Join Zyphra?
We champion creative and unconventional ideas and are prepared to invest significantly in innovative concepts.
Our culture fosters collaboration, curiosity, and intellectual growth.

