Senior Research Scientist, Reward Models
AnthropicRemote-Friendly (Travel Required) | San Francisco, CA
Remote Full-time
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Qualifications
We are seeking an individual with a strong background in AI research, particularly in reinforcement learning and reward modeling. You should possess a profound understanding of large language models and experience in developing training methodologies. Strong analytical skills and the ability to design experiments focused on model generalization and robustness are essential. Familiarity with Python is a must, as all interviews will be conducted in this language. The ideal candidate will have a proven track record of leading ambitious research projects, contributing to publications, and mentoring junior researchers to foster organizational knowledge.
Join Anthropic as a Senior Research Scientist on our Reward Models team, where you will spearhead groundbreaking research aimed at enhancing our understanding of human preferences at scale. Your innovative contributions will directly influence how our AI models, including Claude, align with human values and optimize for user needs. You will delve into the forefront of reward modeling for large language models, designing novel architectures and training methodologies for Reinforcement Learning from Human Feedback (RLHF). Your research will explore advanced evaluation techniques, including rubric-based grading, and tackle challenges such as reward hacking. Collaboration is key, as you'll work alongside teams in Finetuning, Alignment Science, and our broader research organization to ensure your findings result in tangible advancements in AI capabilities and safety. This role offers you an opportunity to address critical AI alignment challenges, leveraging cutting-edge models and substantial computational resources to further the science of safe and capable AI systems.
About Anthropic
At Anthropic, our mission is to engineer AI systems that are not only reliable and interpretable but also capable of being steered towards beneficial outcomes for users and society. We are rapidly expanding our team of dedicated researchers, engineers, policy experts, and business leaders, all working together to create AI solutions that prioritize safety and utility. Join us in our journey to redefine the possibilities of AI for the greater good.
Similar jobs
Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.
