About the job
At Databricks, we are driven by a passion for empowering data teams to tackle some of the world's most challenging problems—whether it's revolutionizing transportation or accelerating medical advancements. Our mission is to build and operate the premier data and AI infrastructure platform, enabling our clients to harness deep data insights to enhance their businesses.
Founded by engineers and deeply committed to our customers, we eagerly embrace every opportunity to confront technical challenges. From designing next-gen UI/UX for data interfaces to scaling our services and infrastructure across millions of virtual machines, we are just getting started.
As part of our ongoing commitment, we are embarking on a multi-year journey to develop the best Lakehouse Platform. While we are building on a solid foundation, our aim is to create significantly improved products. This involves re-evaluating every component to deliver our customers the fastest, easiest to use, and most secure data platform for all their data workloads.
As a Software Engineer, you will play a vital role as a founding member of our London site and the core team in our journey towards achieving the Lakehouse vision. You will be engaged in the complete development lifecycle and will embody the core values that define Databricks.
Your Impact:
Our backend teams tackle diverse challenges across essential service platforms. You may work on:
- Complex problems that encompass product and infrastructure components, including distributed systems, scalable service architecture, monitoring, workflow orchestration, and enhancing developer experience.
- Developing reliable, secure, and high-performance services and client libraries for managing vast amounts of data on cloud storage backends such as AWS S3, GCS, and Azure Blob Store.
- Creating product features that empower our customers to seamlessly store and access their data.
- Addressing reliability issues related to Lakebase.
- Proactively identifying downtime causes and systematically eliminating root issues.
- Assisting the organization in defining SLIs, achieving SLOs, and driving long-term reliability improvements.

