About the job
About the Team
The Codex Core Agent team is at the forefront of developing the foundational elements of Codex. Our mission is to enhance the agent's capabilities, expedite research efforts, and ensure these advancements are implemented effectively for our users.
This involves collaborating across various systems that empower Codex to operate seamlessly in the real world. We focus on optimizing production performance metrics such as token management, latency, reliability, cost efficiency, and capacity. Our work encompasses the core execution loop and interfaces that translate models into actionable behaviors, as well as the shared infrastructure that supports other teams in leveraging Codex. Additionally, we establish feedback mechanisms that refine models and agent behaviors based on real-world usage over time.
About the Role
We are seeking passionate engineers to develop the infrastructure that fuels Codex agents in production environments. This role centers on the systems that ensure models can execute code securely, interact with various tools, complete complex, multi-step tasks, and maintain reliability and efficiency at scale.
You will be responsible for designing and managing the infrastructure that supports sandboxed execution, orchestration, stateful workflows, application server and SDK boundaries, as well as model rollouts. Working at the intersection of distributed systems, developer tools, and AI, you will create the core components that enhance Codex's performance, safety, and reliability, making it easier for teams across the organization to build on its capabilities.
What You’ll Do
Design and implement execution environments tailored for AI agents, incorporating features like sandboxing, isolation, and reproducibility.
Develop orchestration systems for agents that handle multi-step processes and tool utilization.
Create infrastructure for the execution, testing, and debugging of code generated by models.
Establish state and memory systems that enable agents to maintain context during extended tasks.
Optimize production metrics including tokens, latency, reliability, and cost across the Codex deployment.
Assist in model rollouts, capacity planning, and managing the essential trade-offs between quality, speed, and cost to effectively handle a fleet of advanced agents at scale.
Develop shared platform capabilities that facilitate the work of product teams, partner teams, and the open-source community contributing to Codex.
You Might Be a Good Fit If You
Possess substantial experience in distributed systems or infrastructure engineering.
Have experience building systems involving containers, sandboxing, or virtualization.
Are adept at working across backend systems and collaborating with diverse teams to drive project success.

