About the job
Be a part of revolutionizing finance.
At Robinhood, we are committed to making finance accessible for everyone. With an estimated $124 trillion poised to transition to younger generations over the next two decades, we offer you the opportunity to be at the forefront of this monumental shift in cultural and financial landscapes. If you are ready to join us, we invite you to learn more.
About the Team & Role
The Cloud Networking team at Robinhood is dedicated to creating scalable, secure, and dependable networking infrastructure that facilitates communication across all our services. Our mission is to empower engineering teams to seamlessly build and operate microservices by providing foundational networking capabilities that are both resilient and transparent.
We are seeking a Staff Software Platform Engineer to architect, develop, and enhance our foundational platform for large-scale services, with a focus on AWS, Kubernetes (K8s) on Amazon EKS, modern networking strategies, and a robust Istio service mesh to ensure secure, reliable, and high-performance systems. This role is integrated into the Network Service Discovery and Communication (SDC) team, which is responsible for service discovery, traffic management, and resilient service-to-service communication across our platform.
This position is based in our Toronto office, aligning with our in-office work culture. For further details about our in-office philosophy and expectations, please consult with your recruiter.
Your Responsibilities
- Lead the technical strategy and implementation for platform networking, traffic management, and multi-tenant/multi-cluster EKS environments.
- Design and manage resilient service mesh topologies using Istio, including mTLS, traffic policies, ingress/egress rules, rate limiting, and observability, with Envoy-based extensions as necessary.
- Establish and enforce platform standards for HTTP/REST and gRPC APIs, covering versioning, contract governance, and performance/SLO baselines.
- Develop automation, tools, and streamlined processes in Python and Go (Golang) to enhance developer productivity, operational safety, and release processes.

