Research Engineer, Frontier Red Team (Autonomy)

AnthropicSan Francisco, CA

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Qualifications

A Master's degree or Ph. D. in Computer Science, AI, Robotics, or a related field. Proven experience in AI research and development, particularly in autonomous systems and adversarial AI. Strong programming skills in Python and experience with machine learning frameworks. Familiarity with cybersecurity principles and practices is preferred. Excellent problem-solving abilities and a collaborative mindset.

About the job

About Anthropic

At Anthropic, we are dedicated to developing AI systems that are reliable, interpretable, and controllable. Our mission is to ensure that AI benefits individuals and society as a whole. We are a rapidly expanding team of passionate researchers, engineers, policy experts, and business leaders united in our efforts to create safe and beneficial AI systems.

About the Team

The Frontier Red Team (FRT) is a specialized technical research group within Anthropic's Policy division. Our mission is to enhance global safety in the age of advanced AI by thoroughly understanding the capabilities of these systems and developing effective defenses against potential threats.

In 2026, we are concentrating on research aimed at ensuring the safety of self-improving, highly autonomous AI systems, particularly those with cyberphysical capabilities. Explore our previous work on cyberdefense, robotics, and Project Vend. This is groundbreaking research with the potential for significant impact.

About the Role

As a Research Engineer on our team, you will tackle the critical challenge of defending against the potential adversarial use of powerful, autonomous, self-improving AI systems.

Your role will involve constructing and evaluating model organisms of autonomous systems and developing the defensive mechanisms necessary to counteract them. This work lies at the intersection of AI capabilities research, security, and policy—what we discover will directly influence how Anthropic and the wider world prepares for advanced AI.

This is applied research with substantial implications. Your contributions will inform decisions at the highest echelons of the company, aid in public demonstrations that shape policy discussions, and help develop technical defenses that could be crucial as AI systems evolve.

What You Will Do

Design and construct autonomous AI systems capable of utilizing tools and operating in varied environments, creating model organisms that enhance our understanding and defenses against advanced adversarial AI.
Develop evaluations and training environments to influence agent behavior in beneficial ways.
Create defensive agents that can detect, disrupt, or outmaneuver adversarial AI systems in realistic scenarios.
Integrate Claude with hardware platforms (e.g., robotics, physical systems) to assess cyberphysical risks and defenses.

About Anthropic

Anthropic is at the forefront of AI innovation, committed to building AI technologies that prioritize safety, transparency, and user control. Our diverse team combines expertise from various domains to create AI systems that are not only advanced but also aligned with societal values. Join us in our mission to shape the future of AI for the betterment of all.

Similar jobs

1 - 20 of 5,487 Jobs

Search for Autonomous Code Validation Research Engineer

5,487 results

Select all on this page (20)

Apply

Autonomous Code Validation Research Engineer

Greptile

Full-time|On-site|San Francisco

Join us at Greptile, where we are pioneering the development of agents that autonomously validate code changes. Currently, our AI technology reviews an astounding 1 billion lines of code monthly for over 3,000 companies, ensuring quality and compliance by identifying bugs and enforcing coding standards.Challenges We're Excited To TackleCan we create agents that learn coding standards through experience, much like a new employee would?How can we tailor pull request feedback for each customer using sample-efficient reinforcement learning to enhance the signal-to-noise ratio?What if we could deploy feature branches autonomously and utilize agents to run tests and identify bugs before they reach production?Our Growth PathOver 7,000 satisfied customersSecured $30 million in funding from esteemed investors including Benchmark, Y Combinator, Paul Graham, and Initialized CapitalOur TeamWe are a compact, highly skilled team with experience scaling critical engineering functions at industry leaders such as Stripe, Google, Figma, and LinkedIn.Desired QualificationsBachelor's degree in Computer Science or a related fieldExperience in research, particularly with machine learning, language models, or agent-based systemsProficient programming skills coupled with a strong intuition for product developmentKey ResponsibilitiesExplore and implement cutting-edge advancements in agents and language models to enhance our product capabilitiesFor instance, you may investigate multi-agent architectures, prototype a multi-agent code review system, and collaborate with the team to integrate successful solutions into our production environmentStay abreast of the latest research in large language models, information retrieval, and developer tools

Nov 19, 2025

Apply

Perception Sensor Validation Engineer

Bedrock Robotics

Full-time|On-site|San Francisco, CA

Be a Part of Our Team Revolutionizing Autonomy in ConstructionAt Bedrock Robotics, we are dedicated to transitioning artificial intelligence from theoretical frameworks to practical applications. Our talented team, comprising industry leaders who played pivotal roles in launching Waymo, driving Segment's $3.2 billion acquisition, and propelling Uber Freight to $5 billion in revenue, is on a mission to implement autonomous systems in heavy construction machinery throughout the nation. By streamlining billion-dollar infrastructure projects and enhancing safety on job sites, we are responding to the urgent need for housing, data centers, and manufacturing facilities amid the construction industry's labor shortage. In this role, you will bridge the gap between algorithms and heavy machinery, collaborating with seasoned construction professionals and elite engineers to tackle real-world challenges that traditional simulations cannot address. If you’re eager to leverage cutting-edge technology to make a significant impact alongside a skilled team, we invite you to join us.We are in the process of launching our inaugural fleet of retrofitted autonomous construction vehicles, and we seek a Perception Sensor Validation Engineer to develop and enhance our sensor pipelines, including Lidar, cameras, IMUs, and GPS.

Jan 31, 2026

Apply

Senior Platform DevOps Engineer (Cloud + On-Prem) at code-metal | San Francisco

code-metal

Full-time|On-site|San Francisco, California, United States

Join code-metal as a Senior Platform DevOps Engineer, where you will play a pivotal role in enhancing our cloud and on-premises infrastructure. You will be responsible for deploying, managing, and optimizing systems to ensure high availability and performance. This position offers an exciting opportunity to work with cutting-edge technologies and collaborate within a dynamic team.

Apr 3, 2026

Apply

Reinforcement Learning Engineer at Code Metal AI | Remote

Code Metal AI

Full-time|Remote|Remote — San Francisco, California, United States

Join Code Metal AI's elite team, comprised of talents from MIT, OpenAI, and other esteemed organizations, as we lead the charge in pioneering large language models (LLMs) and advanced code generation techniques. Our innovative projects engage with top-tier chip manufacturers, leveraging cutting-edge AI to tackle significant, real-world challenges.This position serves as a critical link between two essential domains:Production Responsibilities:Establish and uphold resilient distributed training systems utilizing PyTorch (2+ years of experience required).Design and execute scalable data curation and quality assurance pipelines to ensure high-quality training datasets.Create orchestration tools that streamline complex workflows for large-scale AI model training and evaluation.Research Responsibilities:Lead the innovation in developing evaluation frameworks and reinforcement learning solutions, emphasizing recent advancements in Reinforcement Learning with Human Feedback (RLHF).Engage with cutting-edge research through open-source contributions and potential publications, focusing on applying RLHF to LLMs, particularly in code generation tasks.Qualifications:Minimum of 2 years of experience in distributed training, preferably using PyTorch.Strong foundation in reinforcement learning, with recent RLHF experience being highly preferred.Demonstrated ability to construct data curation and quality assurance pipelines.Experience in developing evaluation frameworks.Ideally, familiarity with both data pipeline and orchestration aspects.Eligibility for TS/SCI clearance.Preferred Qualifications:Contributions to open-source AI or ML initiatives.Published research or experience in relevant fields.Hands-on experience implementing RLHF to LLMs, especially for code generation.Experience in large-scale synthetic data generation.Benefits:Comprehensive healthcare plan with 100% premium coverage, including medical, dental, and vision.401k plan with 5% matching contribution.Unlimited Paid Time Off, along with Sick leave and Public Holidays.Flexible hybrid work arrangement.Relocation assistance for eligible employees.

Aug 11, 2025

Apply

Research Engineer in Calibration

Waabi

Full-time|On-site|San Francisco, CA

Waabi builds AI-driven autonomous transportation systems, with a focus on trucks and robotaxis. The team includes specialists in AI, automotive, logistics, and deep technology, working across locations in Toronto, San Francisco, Dallas, and Pittsburgh. More details about the company can be found at www.waabi.ai. Role overview The Research Engineer in Calibration will contribute to the development of advanced calibration systems for autonomous vehicles. This position collaborates closely with scientists and engineers to deliver scalable solutions that enhance the safety and efficiency of self-driving technology. What you will do Collaborate with a multidisciplinary team to design scalable calibration systems for various sensors, including camera, lidar, radar, IMU, and GNSS. Create calibration algorithms using both classical and learning-based approaches in 3D geometry, prioritizing computational efficiency, reliability, and accuracy for safety-critical use cases. Develop and implement real-time monitoring algorithms that assess calibration integrity, supporting safe operation throughout a diverse fleet of autonomous vehicles. Oversee the deployment of calibration processes at scale across the entire vehicle fleet.

Apr 22, 2026

Apply

ASIC Validation Engineer

Block, Inc.

Full-time|On-site|Bay Area, CA, United States of America

Role Overview Block, Inc. is looking for an ASIC Validation Engineer in the Bay Area, CA. This role focuses on validating ASIC designs and confirming their performance meets expectations. The work involves collaborating with teams across different functions and running thorough validation tests. What You Will Do Validate ASIC designs using established and custom test procedures Work closely with engineering and product teams to optimize design outcomes Contribute to projects that influence future technology directions

Apr 17, 2026

Apply

Head of Marketing at Code Metal | San Francisco

Code Metal

Full-time|Remote|San Francisco, California, United States

About Code MetalAt Code Metal, we are pioneering the transformation of code translation in critical sectors, empowering partners in defense, automotive, and semiconductor industries to accelerate their journey from algorithm to silicon with unmatched reliability. We seek an innovative marketing leader to take charge of our positioning and demand generation efforts.The Role:In this pivotal position, you will be responsible for building our brand identity, shaping our narrative, enhancing our visibility, and formulating actionable product marketing strategies. Collaborating closely with our executive team, you will oversee product and content marketing, translating the complexities of Code Metal’s technology into compelling value propositions.This position is ideal for a hands-on strategist who excels at the intersection of advanced technology and storytelling. As the first dedicated marketing leader, you will initially function as a solo marketing team, developing scalable programs without the support of a large team.Key Responsibilities:Formulate and implement Code Metal’s marketing strategy in collaboration with sales leaders across the defense, automotive, aerospace, and semiconductor sectors.Work closely with senior leadership on branding and storytelling, ensuring these elements are reflected across our web presence, branding materials, and messaging.Design and execute a measurable product marketing strategy to drive engagement.Conduct thorough research to uncover potential customers, market opportunities, and industry trends.Establish partnerships and channel relationships, as well as define our event strategy.Lead inbound and outbound marketing campaigns aligned with our growth objectives.Why Choose Code Metal?Purpose-Driven Mission: Join us in accelerating innovation in mission-critical industries with proven AI solutions.Agile Environment: Work in tight feedback loops with small teams — set a strategy in the morning and execute it by evening.Ownership and Impact: Take the reins of our marketing, branding, and storytelling efforts without any spectators.

Nov 12, 2025

Apply

Robotics Research Engineer at Physical Intelligence | San Francisco

Physical Intelligence

Full-time|On-site|San Francisco

About Physical Intelligence Physical Intelligence is building general-purpose AI for the physical world. The team brings together engineers, scientists, roboticists, and entrepreneurs focused on foundational models and learning algorithms for robots and interactive devices. Role Overview The Robotics Research Engineer works at the intersection of hardware, software, and large-scale model training. The goal: develop efficient autonomous robot policies that move the field forward. What You Will Do Design robotic systems and data collection pipelines to generate high-quality training data Develop learning algorithms that turn collected data into reliable, effective robot policies Contribute to vision-language-action models, from concept through implementation Help shape datasets, research infrastructure, and the direction of robotics research at Physical Intelligence Location San Francisco

Apr 13, 2026

Apply

Manager of Quality Engineering & AI Validation

Accordion

Full-time|On-site|Atlanta; Boston; Charlotte; Chicago; Dallas; Los Angeles; New York; San Francisco

Accordion is seeking a Manager of Quality Engineering & AI Validation to guide a team focused on upholding quality standards in AI projects. This position is based in several major cities, including Atlanta, Boston, Charlotte, Chicago, Dallas, Los Angeles, New York, and San Francisco. Role overview This leadership role centers on shaping and maintaining AI validation processes. The manager will play a key part in developing and refining methodologies that support reliable, high-quality AI solutions for clients. What you will do Lead a quality engineering team dedicated to AI-driven initiatives Oversee and improve quality assurance protocols Implement thorough testing strategies to ensure product reliability Collaborate with cross-functional teams to boost product performance Requirements Strong background in quality engineering Experience with AI technologies Demonstrated leadership skills Interest in developing and managing validation processes

Apr 29, 2026

Apply

Senior Systems Verification & Validation Engineer

Server Robotics

Full-time|On-site|San Francisco Bay Area

Join Server Robotics as a Senior Systems Verification & Validation Engineer, where you will play a pivotal role in ensuring the integrity and performance of our cutting-edge robotic systems. In this position, you'll be responsible for designing, implementing, and executing verification and validation strategies that guarantee our products meet the highest standards of quality and reliability.

Apr 6, 2026

Apply

Software Engineer for Autonomous Delivery Network

Zipline

Full-time|$170K/yr - $210K/yr|On-site|South San Francisco, California, USA

Software Engineer, Delivery Network Platform Join Zipline, where we are revolutionizing logistics with an autonomous delivery network. As part of the Delivery Network Platform team, you will develop the foundational systems that enable aircraft, sites, and infrastructure to operate seamlessly in live delivery scenarios. Your work will involve creating software solutions that provide operators with real-time insights and control, designing orchestration systems that manage fleet movements, and developing validation platforms to ensure the network's reliability as it scales. Your Responsibilities You will be responsible for software systems that are pivotal to fleet operations, including: Network Operating Center software for real-time visibility and interventions across aircraft, sites, missions, weather, and demand. Fleet orchestration systems for assignment, routing, scheduling, and rebalancing tasks. Maintenance and asset health systems linking issue detection to service readiness. Simulation and validation platforms to assess topology, load, and policy changes prior to production. Platform interfaces and configurable control planes that empower other teams to safely extend the network. Tackling Complex Challenges Unlike typical software roles focused on digital experiences, this position plays a critical role in managing a live autonomous logistics network. You'll address challenges such as: Maintaining an accurate real-time view of aircraft and essential site assets across the network. Ensuring the network remains operational amidst shifting demand, changing weather conditions, infrastructure issues, or capacity constraints. Creating user-friendly operator control interfaces that facilitate quick and accurate decision-making under pressure. Simulating potential future network behaviors to mitigate risks before they impact production. These systems directly affect operational performance. You will own significant components of the platform, make critical technical and product decisions, and have a substantial impact on the network's effectiveness. Team Dynamics Our team operates with a strong emphasis on ownership, trust, and high technical standards. Engineers are expected to identify significant problems, develop a clear vision for system functionality, and drive solutions from conception to production. Additionally, we encourage engineers to leverage AI tools to enhance exploration, implementation, and debugging processes while upholding strong engineering principles, judgment, and accountability.

Mar 23, 2026

Apply

Lead Staff Engineer in Autonomous Driving Systems

AeroVect

Full-time|On-site|San Francisco

AeroVect is seeking a highly skilled Staff Engineer to elevate our autonomous driving systems within structured, low-speed environments. In this pivotal role, you will spearhead the growth and development of our core autonomy software team in a dynamic, early-stage startup environment. Your extensive experience in building production-grade systems will drive the AeroVect Driver to new heights, establishing industry-leading vehicle autonomy tailored for the airport operational design domain.Your responsibilities will encompass leading the design and implementation of vital enhancements across core modules such as planning, prediction, perception, localization, and controls. You will integrate both innovative techniques and reliable, off-the-shelf solutions to optimize all autonomous driving and towing functionalities, delivering exceptional value and efficiency to the modern supply chain.This role presents an exciting opportunity for a technically adept, hands-on leader to contribute to a groundbreaking enterprise product that merges autonomous vehicle technology with a robotics-as-a-service (RaaS) business model. You will report directly to our co-founders.Key Responsibilities:Define, lead, and manage the development of an autonomous driving stack specifically designed for structured logistics environments. Anticipate spending 70% of your time on hands-on development while dedicating the remaining 30% to defining project requirements, managing schedules, overcoming challenges, mentoring the team towards ambitious goals, and building a team of elite engineers.Oversee a team of engineers responsible for the deployment of all components essential for dependable autonomous driving operations in the airside environment, including vehicle corridors and apron areas.Design, implement, test, and maintain all facets of ground vehicle autonomy, incorporating planning, prediction, perception, localization, controls, and infrastructure subsystems.Ensure all subsystems are qualified using objective metrics, with a strong focus on functional safety and adherence to systems engineering best practices.Collaborate with vehicle engineering teams to forge an integrated system, addressing sensor and computing selection and integration.Stay informed of the latest advancements in the field.

Oct 12, 2021

Apply

Strategic Projects Coding Internship

AfterQuery

Internship|On-site|San Francisco

Join AfterQuery as a Strategic Projects Intern, where you will play a crucial role in shaping the future of artificial intelligence. This full-time internship in San Francisco during the Spring 2026 semester offers a unique opportunity to collaborate directly with our co-founder and CEO, Spencer, on pivotal operations and go-to-market strategies.Company OverviewAfterQuery is a pioneering research lab at the forefront of artificial intelligence, exploring innovative datasets and conducting groundbreaking experimentation. We cater to leading foundation model labs and work closely with cutting-edge AI organizations.Headquartered in San Francisco, CA, we have attracted investment from renowned backers such as Y Combinator and BoxGroup, alongside influential figures from Google DeepMind and Meta GenAI.Our founding team comprises experts from prestigious institutions including Jane Street, Meta, Citadel Securities, Google, Goldman Sachs, Morgan Stanley, Silver Lake, Berkeley Artificial Intelligence Research (BAIR), and Stanford Artificial Intelligence Laboratory (SAIL).Key ResponsibilitiesAssist in the creation of expert-generated datasets.Recruit and manage teams of domain experts across various specialized fields, including healthcare, software engineering, finance, and law.Work collaboratively with internal teams to advance innovative AI research projects such as UI-Bench and FinanceQA.Contribute to the design of effective sales processes and strategies, including sourcing potential new customers and developing targeted messaging and value propositions.Provide operational support to the founding team across various initiatives.

Nov 5, 2025

Apply

Compiler Code Generation Engineer

Lemurian Labs

Full-time|On-site|SF Bay Area

Join Lemurian Labs on our ambitious mission to harness the potential of artificial intelligence while minimizing our ecological impact. Our commitment to responsible innovation drives us to create sustainable AI solutions that benefit society and the environment alike. After all, innovation should empower the world, not compromise it.We are developing a cutting-edge, high-performance compiler that enables developers to 'build once, deploy anywhere.' This means seamless cross-platform compatibility, allowing you to train your models in the cloud and deploy them at the edge—all while ensuring optimal resource efficiency and scalability.If you are passionate about scaling AI sustainably and making AI development both powerful and accessible, we invite you to be a part of our team at Lemurian Labs. Collaborate with us as we build the future responsibly and innovatively.

Mar 13, 2025

Apply

Research Engineer in Economic Research

Anthropic

Full-time|On-site|San Francisco, CA

Join Anthropic as a Research Engineer focusing on Economic Research. In this role, you will leverage your analytical skills to conduct in-depth economic analysis and contribute to innovative projects aimed at enhancing our understanding of economic models and their implications.

Mar 12, 2026

Apply

Senior Electrical Engineer - Autonomous Robotics

DoorDash, Inc.

Full-time|$170K/yr - $250K/yr|On-site|San Francisco, CA

DoorDash Labs is an innovative team at DoorDash, dedicated to developing autonomous delivery robots and cutting-edge autonomy solutions that power DoorDash's delivery platform, utilized by millions worldwide. If you are passionate about the intersection of robotics and service technology, we would love to connect with you!About the RoleWe are on the lookout for a highly skilled and hands-on Senior Electrical Engineer to spearhead the design and development of electrical systems for our four-wheeled autonomous delivery robot. This pivotal role involves delivering high-quality, production-ready hardware that encompasses high-speed digital design, PCB assembly, RF integration, harnessing, and board-level packaging.This position is highly technical, ideal for individuals who can navigate both system-level concepts and intricate circuit designs with ease.What You’ll DoDesign and innovate electrical systems for a complex mobile robotic platform, from initial concept to production.Lead high-speed digital design initiatives, including PCIe, MIPI, Ethernet, DDR, and multi-gigabit SERDES.Oversee PCBA development, including schematic capture, stack-up definition, layout guidance, impedance control, and design for manufacturability.Drive RF integration efforts including Wi-Fi, LTE/5G, GNSS, Bluetooth, RADAR, and LiDAR, focusing on antenna placement, layout constraints, and signal integrity.Develop and assess cable harness designs, connector selection, grounding strategies, and environmental robustness.Collaborate closely with mechanical engineers on board packaging, thermal management, vibration resistance, and environmental sealing.Define and implement power distribution architectures, battery interfaces, protection circuits, and system monitoring.Ensure adherence to EMI/EMC and regulatory standards (FCC/CE) including CISPR-25 and FCC Part 15 Class A.Support prototype builds and troubleshoot hardware/firmware interface issues, quickly identifying root causes and driving corrective actions.Establish design standards and conduct comprehensive design reviews within the electrical engineering team.Engage cross-functionally with firmware, robotics, systems, manufacturing, and reliability teams.

Feb 23, 2026

Apply

Senior Planning Engineer - Autonomous Driving Systems

AeroVect

Full-time|On-site|San Francisco

AeroVect is seeking a highly skilled Senior Planning Engineer to contribute to the design of cutting-edge planner systems for autonomous driving in structured, low-speed environments.In this pivotal role, you will take ownership of, enhance, and scale a critical planning module within our fast-paced, innovative startup. Drawing on your expertise in developing production-grade planners, you will elevate the AeroVect Driver to adeptly manage a variety of driving scenarios, establishing a benchmark for vehicle autonomy in airport operational design.Your responsibilities will encompass leading the system design and executing key enhancements to the existing AeroVect planner, which includes the global mission planner, behavior planner, and motion planner.As a generalist engineer, you will be integral to the core autonomy team, focusing on system requirements and validating autonomous driving capabilities across multiple areas of the autonomy stack. Your key tasks will involve designing, implementing, testing, and documenting robotic systems and features in C/C++ across both desktop and embedded platforms.This is an exciting opportunity for a technically adept and hands-on team leader to help forge a market-defining enterprise product that merges autonomous vehicle technology with a robotics-as-a-service (RaaS) business model. You will work closely with our co-founders and the autonomy engineering team.Key ResponsibilitiesDefine, implement, and take ownership of practical enhancements to the core planner module, focusing on achieving milestones, expanding the team, and collaborating with internal and external partners. Expect to dedicate 80% of your time to hands-on development while leading the planning module roadmap, managing schedules, eliminating obstacles, and mentoring the team to achieve ambitious goals.Lead a team of engineers in deploying all components essential for reliable autonomous driving in airside environments, including vehicle corridors and aprons.Design, implement, test, and support all facets of ground vehicle autonomy, which includes planning, prediction, perception, localization, controls, and infrastructure subsystems.Qualify all subsystems using objective metrics, with a strong emphasis on functional safety and adherence to systems engineering best practices.Collaborate with vehicle engineering teams to develop an integrated system, encompassing sensor and compute selection and integration.Stay informed about the latest advancements in the field.

Nov 27, 2021

Apply

Research Engineer, Frontier Red Team (Autonomy)

Anthropic

On-site|On-site|San Francisco, CA

About AnthropicAt Anthropic, we are dedicated to developing AI systems that are reliable, interpretable, and controllable. Our mission is to ensure that AI benefits individuals and society as a whole. We are a rapidly expanding team of passionate researchers, engineers, policy experts, and business leaders united in our efforts to create safe and beneficial AI systems.About the TeamThe Frontier Red Team (FRT) is a specialized technical research group within Anthropic's Policy division. Our mission is to enhance global safety in the age of advanced AI by thoroughly understanding the capabilities of these systems and developing effective defenses against potential threats.In 2026, we are concentrating on research aimed at ensuring the safety of self-improving, highly autonomous AI systems, particularly those with cyberphysical capabilities. Explore our previous work on cyberdefense, robotics, and Project Vend. This is groundbreaking research with the potential for significant impact.About the RoleAs a Research Engineer on our team, you will tackle the critical challenge of defending against the potential adversarial use of powerful, autonomous, self-improving AI systems.Your role will involve constructing and evaluating model organisms of autonomous systems and developing the defensive mechanisms necessary to counteract them. This work lies at the intersection of AI capabilities research, security, and policy—what we discover will directly influence how Anthropic and the wider world prepares for advanced AI.This is applied research with substantial implications. Your contributions will inform decisions at the highest echelons of the company, aid in public demonstrations that shape policy discussions, and help develop technical defenses that could be crucial as AI systems evolve.What You Will DoDesign and construct autonomous AI systems capable of utilizing tools and operating in varied environments, creating model organisms that enhance our understanding and defenses against advanced adversarial AI.Develop evaluations and training environments to influence agent behavior in beneficial ways.Create defensive agents that can detect, disrupt, or outmaneuver adversarial AI systems in realistic scenarios.Integrate Claude with hardware platforms (e.g., robotics, physical systems) to assess cyberphysical risks and defenses.

Jan 29, 2026

Apply

Software Engineer (Generalist) at greptile | San Francisco

greptile

Full-time|On-site|San Francisco

Join our innovative team at greptile, where we are developing cutting-edge agents that autonomously validate code changes. Our advanced AI technology reviews pull requests on GitHub, identifies bugs, and enforces coding standards. Currently, we are analyzing nearly 1 billion lines of code monthly for over 3,000 companies.Exciting Challenges AheadCan we create agents that learn coding standards as intuitively as a new hire might absorb them?How can we optimize PR feedback based on each customer's preferences, possibly utilizing sample-efficient reinforcement learning?Is it possible to autonomously deploy feature branches and utilize agents to rigorously test applications, identifying potential bugs?Growth and VisionServing over 7,000 customers and counting.Secured $30 million in funding from top-tier investors like Benchmark, Y Combinator, and Initialized Capital.Our TeamOur team is composed of highly skilled individuals who have previously scaled essential functions at renowned companies such as Stripe, Google, and Figma.Your Responsibilities:Tackle complex challenges, including LLM memory, multi-language codebase indexing, and semantic search for expansive codebases.Design, implement, test, and deploy comprehensive features.Gather user feedback to refine and enhance features.

Nov 19, 2025

Apply

Staff AI Product Engineer - Code

Semgrep

Full-time|$202K/yr - $238K/yr|On-site|San Francisco, Boston, New York, Denver

About SemgrepSemgrep stands at the forefront of code security, enabling developers to innovate seamlessly. We empower teams to identify, flag, and resolve real issues before deployment, utilizing adaptive security that evolves as development progresses. With Semgrep, code is secured in real-time, providing developers with the freedom to work swiftly while maintaining security integrity. Designed for developers and endorsed by security teams, Semgrep integrates into the developer's workflow, offering solutions without disrupting productivity, while giving security teams essential visibility and control. Our AI-driven approach minimizes false positives and prioritizes actionable vulnerabilities, earning the trust of 95% of security reviewers across over 6 million findings. Semgrep is making the dream of zero false positives a reality, enabling AppSec teams to manage 80% fewer false positives across Code and Supply Chain, significantly reducing backlog.Founded in San Francisco and supported by top-tier investors including Menlo Ventures, Felicis Ventures, Lightspeed Venture Partners, Redpoint Ventures, and Sequoia Capital, Semgrep has gained recognition from Gartner in Application Security Testing and is relied upon by industry leaders such as Snowflake, Dropbox, and Figma. Discover more at semgrep.dev.About the RoleAs a Staff AI Product Engineer within Semgrep’s Code team, you will leverage cutting-edge AI/ML technologies from leading companies (including OpenAI, Anthropic, Hugging Face, Amazon, Google) to develop user-centric security tools that accelerate the process of writing and deploying secure software.The Semgrep Code product enhances the software development lifecycle by pinpointing genuine vulnerabilities without hindering productivity. Unlike other security solutions that inundate developers with irrelevant alerts, we provide clear, actionable, and intuitive insights. The advancements in AI are already transforming how we minimize noise, and we believe there’s even more to unlock in the future.You will gain insights into the application-security domain, mentor fellow engineers, collaborate with product managers, security researchers, and application developers, while contributing to features that delight our customers. Within Semgrep’s culture of transparency, you’ll observe and impact the decisions that drive a startup’s success. Your contributions will be pivotal in establishing Semgrep as the leading code analysis initiative and a trusted security platform.Your Responsibilities:Integrate AI platform APIs into the Semgrep Code productDevelop and optimize LLM prompt chains for real-world developer scenariosExplore the latest advancements in AI/ML and evaluate their potential for product integrationCollaborate with cross-functional teams to enhance product features and functionality

Sep 23, 2025

Create account — see all 5,487 results