Qualifications
Your Impact:
You will focus on three core areas that define Databricks’ networking stack:
Ingress Control Plane: Develop the control plane for Databricks’ global ingress layer. This includes programming API gateways with both static and dynamic endpoints, streamlining service onboarding, and securely exposing APIs across clouds.
Service-to-Service Communication: Create scalable solutions for service discovery and load balancing across many clusters. Provide networking abstractions so product teams can focus on their core functionalities without worrying about underlying connectivity issues.
Overload Protection: Design intelligent rate limiting and admission control systems to safeguard critical services during high load. Ensure reliability and consistent performance for both customer-facing and internal workloads.
What We Look For:
Bachelor's degree (or higher) in Computer Science or a related field.
5+ years of experience in designing and building large-scale distributed systems.
Strong proficiency in programming languages such as Java, Scala, Go, or C++.
Experience with service-oriented architectures and large-scale distributed systems.
Familiarity with cloud platforms (AWS, Azure, GCP) and container/orchestration technologies (Kubernetes, Docker).
Proven track record of delivering infrastructure that supports mission-critical workloads at scale.
Preferred: Experience in service discovery, DNS, load balancing, Envoy, or related networking systems.
About the job
Join Databricks, where we are dedicated to empowering data teams to tackle the world's most challenging problems, from enhancing AI research to driving next-generation applications. We achieve this by constructing and managing the premier data and AI infrastructure platform globally. Founded by engineers and fueled by a commitment to our customers, we take on the most challenging technical tasks, whether it's scaling distributed systems across multiple clouds or ensuring reliable, low-latency communication between thousands of services. And this is just the beginning.
As a Senior Software Engineer on the Application Traffic team, you will design and develop the systems that facilitate Databricks’ service-to-service communication across thousands of clusters in a multi-cloud environment. You will also contribute to creating abstractions that simplify networking complexities for product teams, ensuring seamless connectivity, discovery, and reliability by default.
About Databricks
At Databricks, our mission is to simplify the complexities of data and AI infrastructure, enabling organizations to harness the power of their data. With a focus on innovation and customer satisfaction, we strive to solve the most difficult challenges in the tech industry and lead the way in data analytics and machine learning.