About the job
Are you enthusiastic about crafting secure, high-performance, and dependable infrastructure? Do you excel at troubleshooting and resolving complex challenges in systems designed for the most demanding GraphQL workloads worldwide? If so, we want to speak with you about the opportunity to contribute to the operation and evolution of next-generation GraphQL runtime infrastructure.
The GraphOS organization is responsible for a critical tier of Apollo's platform — the foundational engine that makes GraphOS a highly performant, secure, and reliable self-service GraphQL platform. As an integral member of the GraphOS North America team, you will play a pivotal role in this mission by managing the Router and empowering engineering teams to accelerate their development processes and enhance user experiences.
Key Responsibilities
Develop, test, and maintain fault-tolerant infrastructure for GraphQL runtime platforms primarily in idiomatic Rust, ensuring top-tier security, performance, and reliability.
Diagnose and resolve escalations from enterprise clients operating some of the largest GraphQL deployments globally, with hundreds of subgraphs handling trillions of requests monthly.
Manage and enhance durable, stable public APIs utilized by the most demanding GraphQL workloads.
Engage directly with users, whether community members or enterprise customers, to understand their needs, debug issues, and provide feedback for platform evolution.
Design scalable and observable systems that seamlessly integrate into various customer infrastructure stacks, utilizing self-driven research and real-world insights.
Collaborate with engineers across teams through supportive communication and constructive code reviews to deliver cohesive solutions.
Guide and mentor teammates in architecting and crafting idiomatic Rust code while considering the broader implications of changes across domains.
Lead architectural discussions and cross-team initiatives, driving impactful changes as a technical leader or coaching others to step into leadership roles.
Create comprehensive technical designs and documentation that address cost efficiency, security, and observability.
Participate in on-call rotations as a fundamental part of the role, ensuring the reliability of mission-critical systems.

