Researcher Interpretability jobs in San Francisco – Browse 508 openings on RoboApply Jobs

Researcher, Interpretability

OpenAISan Francisco

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Experience Level

Experience

About the job

About Our Team

Join the Interpretability team at OpenAI, where we delve into the inner workings of deep learning models. Our mission is to leverage internal representations to gain insights into model behavior and to design models that offer clearer interpretations. We prioritize applying our findings to enhance the safety of advanced AI systems. Our collaborative and inquisitive work culture fosters innovation and exploration.

About the Position

OpenAI is on the lookout for a dedicated researcher with a passion for deep learning and a solid engineering background. In this role, you will develop and execute a research agenda focused on mechanistic interpretability, working closely with a team of driven individuals. Your contributions will be vital in ensuring that future AI models remain safe as their capabilities expand, significantly advancing our commitment to creating safe AGI.

Key Responsibilities:

Conduct and publish research on methods for interpreting the representations of deep networks.
Develop infrastructure to analyze model internals on a large scale.
Collaborate across various teams to undertake projects uniquely suited to OpenAI’s capabilities.
Direct research initiatives towards tangible usefulness and long-term scalability.

Ideal Candidate Profile:

Passionate about OpenAI’s mission to ensure that AGI benefits all of humanity, and aligned with OpenAI’s charter.
Enthusiastic about long-term AI safety and knowledgeable about the technical pathways to achieve safe AGI.
Experience in AI safety, mechanistic interpretability, or closely related fields.
Possess a Ph. D. or substantial research background in computer science, machine learning, or a related discipline.
Excited to engage with large-scale AI systems and utilize OpenAI’s exceptional resources in this domain.
Have 2+ years of experience in research engineering and proficiency in Python or similar programming languages.
Exhibit a deep curiosity and willingness to explore new ideas.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

1 - 20 of 508 Jobs

Select all on this page (20)

Apply

Researcher, Interpretability

OpenAI

Full-time|On-site|San Francisco

About Our TeamJoin the Interpretability team at OpenAI, where we delve into the inner workings of deep learning models. Our mission is to leverage internal representations to gain insights into model behavior and to design models that offer clearer interpretations. We prioritize applying our findings to enhance the safety of advanced AI systems. Our collabor…

Jun 15, 2025

Apply

Research Scientist, Interpretability

Anthropic

On-site|On-site|San Francisco, CA

Join Anthropic as a Research Scientist specializing in Interpretability, where you will play a pivotal role in demystifying modern language models. Our dedicated Interpretability team is committed to reverse-engineering how these advanced systems operate, ensuring their safety and reliability for society. We focus on mechanistic interpretability—understanding how neural network parameters correlate with meaningful algorithms. In this role, you will apply innovative methodologies akin to biological research, utilizing our custom-built 'microscopes' to explore the inner workings of neural networks. If you're passionate about advancing AI safety through scientific inquiry, we invite you to contribute to our transformative research.

Jan 29, 2026

Apply

Research Engineer in Economic Research

Anthropic

Full-time|On-site|San Francisco, CA

Join Anthropic as a Research Engineer focusing on Economic Research. In this role, you will leverage your analytical skills to conduct in-depth economic analysis and contribute to innovative projects aimed at enhancing our understanding of economic models and their implications.

Mar 12, 2026

Apply

Research Engineer / Research Scientist, Post-Training

OpenAI

Full-time|Hybrid|San Francisco

About the TeamJoin the innovative Post-Training team at OpenAI, where we focus on refining and elevating pre-trained models for deployment in ChatGPT, our API, and future products. Collaborating closely with various research and product teams, we conduct crucial research that prepares our models for real-world deployment to millions of users, ensuring they are safe, efficient, and reliable.About the RoleAs a Research Engineer / Scientist, you will spearhead the research and development of enhancements to our models. Our work intersects reinforcement learning and product development, aiming to create cutting-edge solutions.We seek passionate individuals with robust machine learning engineering skills and research experience, particularly with innovative and powerful models. The ideal candidate will be driven by a commitment to product-oriented research.This position is located in San Francisco, CA, and follows a hybrid work model requiring three days in the office each week. Relocation assistance is available for new employees.In this role, you will:Lead and execute a research agenda aimed at enhancing model capabilities and performance.Work collaboratively with research and product teams to empower customers to optimize their models.Develop robust evaluation frameworks to monitor and assess modeling advancements.Design, implement, test, and debug code across our research stack.You may excel in this role if you:Possess a deep understanding of machine learning and its applications.Have experience with relevant models and methodologies for evaluating model improvements.Are adept at navigating large ML codebases for debugging purposes.Thrive in a fast-paced and technically intricate environment.About OpenAIOpenAI is a pioneering AI research and deployment organization dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We are committed to pushing the boundaries of AI capabilities while prioritizing safety and human-centric values in our products. Our mission is to embrace diverse perspectives, voices, and experiences that represent the full spectrum of humanity, as we strive for a future where AI is a powerful ally for everyone.

Dec 1, 2025

Apply

Research Engineer/Research Scientist, RL/Reasoning

OpenAI

Full-time|Hybrid|San Francisco

About Our TeamJoin the forefront of AI innovation with the RL and Reasoning team at OpenAI. Our team is dedicated to advancing reinforcement learning research and has pioneered transformative projects, including o1 and o3. We are committed to pushing the limits of generative models while ensuring their scalable deployment.About the RoleAs a Research Engineer/Research Scientist at OpenAI, you will play a pivotal role in enhancing AI alignment and capabilities through state-of-the-art reinforcement learning techniques. Your contributions will be essential in training intelligent, aligned, and versatile agents that power various AI models.We seek individuals with a solid foundation in reinforcement learning research, agile coding skills, and a passion for rapid iteration.This position is located in San Francisco, CA, and follows a hybrid work model of three days in the office per week. We also provide relocation assistance for new hires.You may excel in this role if:You are enthusiastic about being at the cutting edge of RL and language model research.You take initiative, owning ideas and driving them to fruition.You value principled methodologies, conducting simple experiments in controlled environments to draw trustworthy conclusions.You thrive in a fast-paced, complex technical environment where rapid iteration is essential.You are adept at navigating extensive ML codebases to troubleshoot and enhance them.You possess a profound understanding of machine learning and its applications.About OpenAIOpenAI is a pioneering AI research and deployment organization committed to ensuring that general-purpose artificial intelligence serves the greater good for humanity. We strive to push the boundaries of AI system capabilities while prioritizing safe deployment through our innovative products. We recognize AI as a powerful tool that must be developed with safety and human-centric principles, embracing diverse perspectives to reflect the full spectrum of humanity.We are proud to be an equal opportunity employer, welcoming applicants from all backgrounds without discrimination based on race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or any other legally protected characteristic.

May 14, 2025

Apply

Developer Relations Lead

Pluralis Research

Full-time|On-site|San Francisco

Pluralis Research is at the forefront of Protocol Learning—an innovative decentralized approach to training and deploying AI models that democratizes access to this technology for individuals, rather than just large corporations. By aggregating computing resources from numerous contributors, incentivizing participation, and ensuring no single entity can dominate the model's complete weights, we are forging a truly open and collaborative pathway to cutting-edge AI.Role OverviewWe are seeking a passionate Developer Relations Lead to serve as the crucial technical liaison between Pluralis's research initiatives and the broader machine learning and systems communities. In this role, you will transform complex, groundbreaking research (including distributed training, communication-efficient model parallelism, and fault-tolerant optimization) into clear, engaging, and accessible content for researchers, engineers, and innovators.This position is not merely a traditional marketing role. We are looking for an individual who can digest our research papers, grasp the underlying architecture, and convey these insights effectively through blog posts, conference presentations, or social media updates. You will shape our technical narrative and become the face of Pluralis's contributions within the community.

Mar 25, 2026

Apply

Research Lead

abundant

Full-time|Remote|San Francisco

abundant seeks a Research Lead based in San Francisco. This position steers research activities that help shape the company’s direction. The Research Lead partners with colleagues to analyze data, draw meaningful insights, and support projects where research has a clear business impact. Key responsibilities Plan, manage, and execute research initiatives from start to finish Work with team members to analyze data and spot important trends Turn research results into practical recommendations for the business Support projects that guide company strategy Collaboration and impact This role involves close teamwork and communication across departments. Research findings directly inform business decisions and contribute to the company’s ongoing growth.

Apr 24, 2026

Apply

Research Scientist

Intology

Full-Time|On-site|San Francisco

OverviewBecome an integral part of our dynamic R&D team dedicated to developing fully automated research systems that push the boundaries of AI. Zochi has achieved a milestone by publishing the first entirely AI-generated A* conference paper. Locus has set a new industry standard as the first AI system to surpass human experts in AI R&D.Key ResponsibilitiesConceptualize and develop innovative architectures for automated research.Work collaboratively within a specialized team of researchers addressing cutting-edge challenges in long-horizon agentic capabilities, post-training for open-ended objectives, and environment crafting.Document and publish key internal findings alongside success stories from external collaborations.QualificationsPhD or equivalent research experience in Computer Science, Machine Learning, Artificial Intelligence, or a related discipline. Outstanding candidates with significant research contributions are encouraged to apply, regardless of formal qualifications.Demonstrated history of impactful AI/ML research contributions in academic or corporate environments.Expertise in developing long-horizon, multi-agent systems and/or model post-training, especially in scientific domains or for open-ended discovery objectives.A strong passion for advancing problem-solving processes and scientific discovery, thriving in high-autonomy roles and environments.Our CultureCompetitive compensation and equity options.Unlimited Paid Time Off (PTO), emphasizing team collaboration and a community-focused workplace.Opportunities for conference participation and engagement in community initiatives.Empowered roles with high levels of responsibility.#1: We are a small, passionate team of leading investors, researchers, and industry experts committed to the mission of accelerating discovery. Join us.

Sep 14, 2025

Apply

Senior Researcher in Misalignment Research

OpenAI

Full-time|On-site|San Francisco

About Our TeamAt OpenAI, our Safety Systems team is dedicated to advancing the mission of developing and deploying safe artificial general intelligence (AGI). We are establishing a specialized research team focused on identifying and addressing critical misalignment issues that may arise as AGI technology evolves. Our goal is to proactively quantify and mitigate potential misalignment risks to ensure they do not threaten societal wellbeing.Our research efforts are structured around four key areas:Worst-Case Demonstrations – Create compelling demonstrations that illustrate how AI systems can fail, particularly in scenarios where misaligned AGI could undermine human interests.Adversarial & Frontier Safety Evaluations – Develop rigorous evaluations based on these demonstrations to measure dangerous capabilities and remaining risks, focusing on issues like deceptive behavior and power-seeking tendencies.System-Level Stress Testing – Construct automated infrastructure to stress-test entire product stacks, evaluating their robustness under extreme conditions and evolving the tests as systems improve.Alignment Stress-Testing Research – Analyze failures in mitigations and publish insights to inform strategy and develop next-generation safeguards, collaborating with other research teams for collective advancement.About the RoleWe are looking for a passionate Senior Researcher focused on AI safety and red-teaming. In this role, you will design and execute innovative attacks, contribute to adversarial evaluations, and deepen our understanding of how safety measures can fail—and how they can be improved. Your findings will significantly impact OpenAI's product releases and long-term safety strategies.Key ResponsibilitiesCreate and implement worst-case demonstrations that clarify AGI alignment risks for stakeholders, particularly in critical use cases.Develop comprehensive adversarial and system-level evaluations based on these demonstrations, promoting their integration across OpenAI.Design automated tools and frameworks to enhance our red-teaming and stress-testing capabilities.

Apr 28, 2026

Apply

Research Resident

Perplexity

Full-time|$220K/yr - $220K/yr|On-site|San Francisco

The Perplexity Research Residency stands as our premier initiative designed to empower outstanding research talent from diverse fields to influence the future of artificial intelligence (AI). This program opens avenues for exceptional researchers, engineers, and analysts from disciplines outside traditional AI research to make significant contributions to the advancement of AI and its implications for users. We welcome applications from theoretical physicists, cognitive scientists, biochemists, quants, mathematicians, philosophers, and distinguished researchers in any other relevant field.For comprehensive details on the Perplexity Research Residency and the application process, please visit our program homepage. We encourage you to review the “What We’re Looking For” section for the specific criteria that will guide our selection of candidates.The annual cash compensation for this position is set at $220,000, prorated for a three-month term.

Feb 20, 2026

Apply

Research Engineer, Interpretability

Anthropic

On-site|On-site|San Francisco, CA

Join Anthropic as a Research Engineer focused on AI Interpretability, where you'll explore the inner workings of advanced language models. Our Interpretability team is committed to reverse-engineering trained models to enhance safety and trust. If you're passionate about mechanistic interpretability and eager to contribute to the understanding of neural networks, we want to hear from you. Engage with cutting-edge research and collaborate with experts in the field to make AI systems more reliable and interpretable.

Jan 29, 2026

Apply

Research Operations Specialist in Economic Research

Anthropic

Full-time|Remote|San Francisco, CA

Join Anthropic as a Research Operations Specialist focused on Economic Research. In this role, you will facilitate the smooth execution of research projects and support our team in analyzing and interpreting economic data. Your contributions will play a crucial part in driving our mission to create safe and beneficial AI systems.

Mar 16, 2026

Apply

Researcher, Alignment

OpenAI

Full-time|Hybrid|San Francisco

Join Our Innovative TeamAt OpenAI, our Alignment team is committed to building AI systems that prioritize safety, trustworthiness, and alignment with human values, even as these systems evolve and grow in complexity. We are at the forefront of AI research, developing advanced methodologies to ensure that AI adheres to human intent across diverse scenarios, including high-stakes and adversarial environments. Our focus is on tackling the most critical challenges, addressing areas where AI can have profound impacts. By quantifying risks and making meaningful improvements, we aim to prepare our models for the complexities of real-world applications.Our approach is built on two foundational pillars: (1) integrating enhanced capabilities into alignment, ensuring our techniques evolve positively with increasing capabilities, and (2) centering human input through the development of mechanisms that allow humans to communicate their intent and effectively monitor AI systems, even in intricate situations.Your Role in Shaping the FutureAs a Research Engineer / Scientist on our Alignment team, you will play a pivotal role in ensuring our AI systems align with human intent in complex and unpredictable contexts. Your responsibilities will include designing and implementing scalable solutions that maintain alignment as AI capabilities expand, while incorporating human oversight into AI decision-making processes.This position is based in San Francisco, CA, and follows a hybrid work model of three days in the office each week. We also offer relocation assistance to new team members.Key Responsibilities:Develop and assess alignment capabilities that are context-sensitive, subjective, and challenging to quantify.Create evaluations to accurately measure risks and alignment with human values and intentions.Construct tools and evaluations to examine model robustness across various scenarios.Design experiments to explore how alignment scales with compute resources, data, context lengths, actions, and adversarial influences.Innovate new Human-AI interaction frameworks and scalable supervision methods that enhance human engagement and understanding of AI systems.

Aug 27, 2024

Apply

Research Scientist

OpenAI

Full-time|On-site|San Francisco

Join OpenAI as a Research Scientist and explore cutting-edge machine learning innovations. In this role, you will be at the forefront of developing groundbreaking techniques while advancing our team's research initiatives. Collaborate with talented peers across various teams to discover transformative ideas that scale effectively. We seek individuals who are passionate about pushing the boundaries of AI and want to contribute to our unified research vision.

Apr 5, 2025

Apply

Research Manager

Cloudflare, Inc.

Full-time|Hybrid|Hybrid

Join Cloudflare as a Research Manager and play a pivotal role in driving innovative research initiatives that enhance our security and performance solutions. You will lead a team of skilled researchers, collaborating closely with cross-functional teams to identify market trends and develop groundbreaking strategies that align with our business objectives.Your responsibilities will include overseeing research projects from inception to completion, analyzing data to derive actionable insights, and presenting findings to stakeholders. You will foster a culture of creativity and critical thinking, ensuring that our research efforts remain at the forefront of industry standards.

Feb 6, 2026

Apply

Senior Lead, Research & Evaluation

aiedu

Full-time|On-site|San Francisco, United States

Join aiedu as a Senior Lead in Research & Evaluation, where you will drive impactful research initiatives that shape educational practices and policies. In this role, you will lead a team of researchers in designing and executing comprehensive evaluations that inform our strategic direction. Your expertise will be critical in analyzing data, generating insights, and communicating findings to stakeholders.

Mar 13, 2026

Apply

Bilingual Medical Interpreter/Translator - Spanish

Stanford Medicine Children's Health

Full-time|On-site|SAN FRANCISCO

Stanford Medicine Children's Health seeks a Bilingual Medical Interpreter/Translator with Spanish language skills for its San Francisco team. This role centers on supporting clear communication between healthcare providers and Spanish-speaking patients, ensuring medical information is conveyed accurately in both directions. Key Responsibilities Interpret spoken conversations during appointments, procedures, and consultations between medical staff and Spanish-speaking patients. Translate written medical documents, instructions, and forms between English and Spanish. Assist patients in understanding diagnoses, treatment plans, and care instructions. Contribute to a welcoming and informed experience for patients and families from diverse backgrounds. Role Impact By providing accurate interpretation and translation, this position supports the quality of care for Spanish-speaking patients and families. Effective communication helps ensure that everyone involved understands essential medical information, which contributes to improved health outcomes and patient satisfaction.

Apr 23, 2026

Apply

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAI

Full-time|Hybrid|San Francisco

About Our TeamJoin the Foundations Research team, where we tackle ambitious and innovative projects that could redefine the future of AI. Our mission is to enhance the science behind our training and scaling initiatives, focusing on pioneering frontier models. We are dedicated to advancing data utilization, scaling methodologies, optimization strategies, model architectures, and efficiency enhancements to accelerate our scientific breakthroughs.About the PositionWe are on the lookout for a dynamic technical research lead to spearhead our embeddings-focused retrieval initiatives. You will oversee a talented team of research scientists and engineers committed to developing foundational technologies that enable models to access and utilize the right information precisely when needed. This includes crafting innovative embedding training objectives, architecting scalable vector storage, and implementing adaptive indexing techniques.This pivotal role will contribute to various OpenAI products and internal research initiatives, offering opportunities for scientific publication and significant technical influence.This position is located in San Francisco, CA, where we embrace a hybrid work model, requiring three days in the office weekly, and we provide relocation assistance for new hires.Your ResponsibilitiesLead cutting-edge research on embedding models and retrieval systems optimized for grounding, relevance, and adaptive reasoning.Supervise a team of researchers and engineers in building an end-to-end infrastructure for training, evaluating, and integrating embeddings into advanced models.Drive advancements in dense, sparse, and hybrid representation techniques, metric learning, and retrieval systems.Work collaboratively with Pretraining, Inference, and other Research teams to seamlessly integrate retrieval throughout the model lifecycle.Contribute to OpenAI's ambitious vision of developing AI systems with robust memory and knowledge access capabilities rooted in learned representations.You Will Excel in This Role If You PossessA proven track record of leading high-performance teams of researchers or engineers within ML infrastructure or foundational research.In-depth technical knowledge in representation learning, embedding models, or vector retrieval systems.Familiarity with transformer-based large language models and their interaction with embedding spaces and objectives.Research experience in areas such as contrastive learning and retrieval-augmented generation.

Jun 16, 2025

Apply

Research Operations Analyst

Listen Labs

Full-time|On-site|San Francisco, CA

Overview: Join Listen Labs as a Research Operations Analyst, where you will be pivotal in ensuring the integrity and effectiveness of our research studies. Your role will involve real-time monitoring of study performance, identifying potential problems early, and ensuring that our research processes remain on track.About Listen LabsListen Labs is an innovative AI-driven research platform dedicated to helping teams swiftly derive insights from customer interviews. Our technology accelerates the analysis of conversations, identifies key themes, and supports rapid, informed product development decisions.Why Join Us?Outstanding Team: Our founders are experienced entrepreneurs with a successful AI exit, bringing together talent from industry leaders such as Jane Street, Twitter, Stripe, and Goldman Sachs.Fast Growth: Our dynamic 40-member team is backed by Sequoia, achieving an impressive $14M run-rate in under a year. We prioritize quality craftsmanship and value ownership.Notable Success: We are experiencing rapid growth across various sectors, securing significant contracts with major enterprises including Google, Microsoft, and Nestlé.Exceptional Performance: Our highly differentiated product contributes to an industry-leading win rate.Market Validation: We continue to gain traction across all segments, with substantial agreements leading to swift expansions.Viral Product: Our interviews reach tens of thousands of viewers, driving organic growth and attracting attention from Fortune 500 companies.Your ResponsibilitiesServe as the primary oversight for all ongoing studies, maintaining a comprehensive understanding of their real-time status.Track recruitment progress, response rates, completion timelines, and quality indicators on a daily basis.Identify and escalate issues promptly when studies appear to deviate from expected parameters, providing clear context to relevant stakeholders.Conduct standardized quality assessments on incoming data, including completion and screen-through rates.Keep internal trackers and dashboards updated, ensuring the team has immediate access to study health information.

Mar 4, 2026

Apply

Post-Training Applied Researcher

Baseten

Full-time|Remote|San Francisco

Join Baseten as a Post-Training Applied Researcher, where you will be at the forefront of innovative research applications. Your expertise will help bridge the gap between training and real-world applications, making a tangible impact in the industry.

Mar 17, 2026

Create account — see all 508 results

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, or location & role pages.