1 - 20 of 10,799 Jobs

Search for Senior Software Engineer (Infrastructure) - HyperDX

10,799 results

Apply
companyClickHouse logo
Full-time|Remote|United Kingdom (remote)

About ClickHouseFeatured on the 2025 Forbes Cloud 100 list, ClickHouse is a pioneering and rapidly expanding private cloud company. With over 3,000 clients and an annual recurring revenue (ARR) growth exceeding 250% year-over-year, ClickHouse is at the forefront of real-time analytics, data warehousing, observability, and AI workloads.The company’s remarkable growth was recently underscored by a successful $400M Series D funding round. In recent months, high-profile clients such as Capital One, Lovable, Decagon, Polymarket, and Airwallex have either adopted our platform or expanded their existing implementations. These additions complement our established clientele of AI trailblazers and global giants like Meta, Cursor, Sony, and Tesla.Our mission is to revolutionize how organizations leverage data. Join us in this exciting journey!NOTE: This position is open to candidates residing in any EMEA/UK country where ClickHouse is authorized to hire.Join us in reshaping Observability for Developers! We aim to reinvent how engineers monitor, debug, and scale their production applications with HyperDX, now a part of ClickHouse. HyperDX is an open-source platform that converts telemetry data into actionable insights. Visualize a realm where logs, metrics, traces, and session replays harmoniously converge to swiftly identify root causes. If you've ever been roused in the middle of the night, exasperated with Grafana, Datadog, or Elastic for not providing the necessary insights, you'll understand the challenge we are addressing. Now, you can be part of the solution!

Mar 4, 2026
Apply
companyClickHouse logo
Full-time|Remote|United Kingdom (remote)

About ClickHouseRanked among the top innovators on the 2025 Forbes Cloud 100 list, ClickHouse is a trailblazer in the private cloud sector, experiencing phenomenal growth with over 3,000 satisfied customers and an annual recurring revenue (ARR) increase exceeding 250% year-over-year. Our expertise lies in real-time analytics, data warehousing, observability, and AI workloads.Our rapid growth was underscored by a significant $400 million Series D funding round. Recently, esteemed clients like Capital One, Lovable, Decagon, Polymarket, and Airwallex have chosen our platform or expanded their existing deployments, joining the ranks of AI pioneers and global giants such as Meta, Cursor, Sony, and Tesla.Join us on our mission to transform the way companies harness data!We invite you to be part of our exciting journey in revolutionizing Observability for Developers! As part of ClickHouse, you will contribute to HyperDX, an open-source platform that turns telemetry data into actionable insights. Imagine a cohesive environment where logs, metrics, traces, and session replays converge to swiftly identify root causes. If you’ve ever been frustrated at 2 AM with existing tools like Grafana, Datadog, or Elastic, you’ll understand the challenge we are tackling—and now you can help us conquer it.We are looking for a Senior Full Stack Engineer to play a vital role in developing a petabyte-scale, high-performance observability platform, emphasizing an exceptional developer experience (the DX in HyperDX).

Mar 25, 2026
Apply
companyGraphcore logo
Full-time|On-site|Bristol, UK; Cambridge, UK

About Graphcore Graphcore is at the forefront of innovation in Artificial Intelligence computing, dedicated to developing cutting-edge hardware, software, and systems infrastructure that will catalyze the next wave of AI advancements and facilitate the widespread integration of AI solutions across diverse industries. As a proud member of the SoftBank Group, Graphcore is part of a distinguished family of companies that are pioneering transformative technologies. Together, we share a bold aspiration: to enable Artificial Super Intelligence and make its advantages universally accessible. Our teams consist of individuals from various backgrounds, each contributing a unique set of skills and perspectives. Comprised of AI research experts, silicon designers, software engineers, and systems architects, Graphcore fosters a culture of continuous learning and relentless innovation. Job Summary Become an integral part of our vibrant Software Infrastructure team, where you will play a crucial role in scaling and managing our infrastructure. You will design and develop essential tools and services that empower our larger software team, significantly improving the build, test, deployment, and productization processes of our Machine Learning Software components. Gain hands-on experience working with our High-Performance Computing (HPC) AI platforms and expand your knowledge in distributed systems. The Team The Software Infrastructure team is responsible for delivering vital platforms and services that support software development teams throughout the organization. Our duties encompass managing the CI platform and services, build engineering, component integration, and packaging and release systems. We operate in squads, promoting a culture of service ownership and empowering our engineers, with a focus on long-term engineering solutions to minimize repetitive tasks. Responsibilities and Duties Develop, oversee, and maintain tools and services to streamline the software build and release process. Deploy and maintain services utilizing Kubernetes and Docker. Manage our Cloud Infrastructure with tools such as Terraform. Mentor and support the technical development of junior and graduate engineers. Exemplify strong engineering discipline to ensure high reliability and reduced toil.

Mar 13, 2026
Apply
companyRoku, Inc. logo
Full-time|On-site|Manchester, United Kingdom

Join Roku as a Senior Software Engineer specializing in Infrastructure and Efficiency. In this role, you will be instrumental in enhancing our platforms, ensuring optimized performance, and driving innovative solutions. Collaborate with cross-functional teams to design and implement scalable systems that support our growing user base.

Apr 9, 2026
Apply
companyKraken logo
Full-time|Remote|United Kingdom

Shape the Future of CryptocurrencyAt Kraken, our team of dedicated professionals is driven by a shared conviction in the transformative power of cryptocurrency and blockchain technology. We are committed to unlocking the full potential of these innovations.What Sets Us Apart?Kraken is a purpose-driven organization grounded in crypto principles. As a member of our team, you will contribute to our mission of fostering global cryptocurrency adoption, enabling financial freedom and inclusivity for all. Over the past decade, our unwavering commitment to our mission and crypto values has attracted some of the brightest minds in the industry.Before you submit your application, we encourage you to visit our Kraken Culture page to gain insights into our values and internal culture. Additionally, we expect candidates to familiarize themselves with the Kraken app. Learn how to create a Kraken account here.As a fully remote organization, we proudly employ Krakenites across over 70 countries, communicating in more than 50 languages. Our team members are industry trailblazers who design top-tier cryptocurrency products catering to seasoned traders, institutions, and newcomers alike. Kraken is resolute in its commitment to world-class security, crypto education, and exceptional client support through offerings such as Kraken Pro, Desktop, Wallet, and Kraken Futures.Join Kraken and Help Build the Future of Cryptocurrency!The TeamThe AI Infrastructure team is responsible for developing and maintaining the production systems that drive intelligent agents at scale. This team forms the backbone of the agent platform, enabling cutting-edge advancements in artificial intelligence.

Feb 24, 2026
Apply
companyRoku, Inc. logo
Full-time|On-site|Cambridge, United Kingdom

Join Roku, a leading streaming platform, as a Senior Software Engineer focused on enhancing our infrastructure, efficiency, and productivity. In this role, you will play a key part in architecting and developing innovative solutions that drive our technology forward. Collaborate with cross-functional teams to optimize system performance and ensure robust reliability in our services.

Apr 9, 2026
Apply
companyRoku, Inc. logo
Full-time|On-site|Cambridge, United Kingdom

Collaboration Fuels Innovation. Roku is transforming the way the world experiences televisionAs the leading TV streaming platform in the U.S., Canada, and Mexico, Roku aims to power every television globally. We pioneered streaming technology, and our mission is to be the ultimate platform connecting the entire TV ecosystem. We link consumers to their favorite content, enable publishers to grow and monetize vast audiences, and provide advertisers with unique engagement opportunities.From day one at Roku, your contributions will be significant and appreciated. Join our dynamic public company where every team member plays an active role. You’ll have the chance to inspire millions of TV viewers while gaining invaluable experience across diverse areas. About the Role We are developing a next-generation observability and cloud platform that is high-performance, cost-effective, secure, and scalable across multi-region, multi-cloud infrastructures. You will take the lead in architecting and evolving Roku’s observability and cloud infrastructure stack, which encompasses metrics, logs, traces, telemetry pipelines, service mesh, developer experience, and system reliability that support thousands of services and millions of devices. You will be instrumental in driving a vision where developers achieve profound visibility with minimal overhead, onboarding is effortless, and insights are accessible in real-time. Your contributions will directly empower Roku...

Apr 2, 2026
Apply
companyClickHouse logo
Full-time|Remote|London(Remote)

About ClickHouseFeatured on the prestigious 2025 Forbes Cloud 100 list, ClickHouse stands as a pioneering and rapidly expanding private cloud enterprise. With an impressive client base exceeding 3,000 and an annual recurring revenue (ARR) that has surged over 250% year-on-year, ClickHouse is at the forefront of real-time analytics, data warehousing, observability, and AI workloads.Our remarkable growth trajectory was recently underscored by a substantial $400 million Series D funding round. In the last quarter alone, notable clients like Capital One, Lovable, Decagon, Polymarket, and Airwallex have either adopted or enhanced their use of our platform. They join a diverse roster of innovators in AI and global brands such as Meta, Cursor, Sony, and Tesla.Join us on our mission to revolutionize data utilization for businesses. Be a part of our exciting journey!About the TeamThe Cloud Infrastructure Engineering team is dedicated to constructing and maintaining the essential components of ClickHouse Cloud's data plane. This encompasses computing, networking, security, and a multi-cloud, multi-region architecture designed to deliver a reliable and scalable managed ClickHouse experience for our customers. We are in search of highly skilled and experienced cloud infrastructure software engineers to join our team, responsible for the design, deployment, and maintenance of our infrastructure.

Mar 5, 2026
Apply
companyphysicsx logo
Full-time|On-site|London

Join physicsx as a Senior Software Engineer specializing in Site Reliability Engineering (SRE). In this pivotal role, you will be responsible for enhancing our core infrastructure, ensuring reliability, scalability, and performance of our software systems. You will work with a talented team to design, implement, and maintain robust solutions that meet the needs of our users.As a senior engineer, you will also mentor junior team members, contribute to architectural discussions, and drive best practices in software development and operations.

Mar 16, 2026
Apply
companyPalantir Technologies Inc. logo
Full-time|On-site|London, United Kingdom

Join Palantir Technologies as a Senior Backend Software Engineer focused on Infrastructure. In this crucial role, you will design and implement robust backend systems that power our cutting-edge software solutions. Collaborate with a talented team to develop scalable and efficient solutions that meet the needs of our clients.

Apr 6, 2026
Apply
companyContentful logo
Full-time|On-site|London, England, United Kingdom

Join Contentful as a Senior Backend & Infrastructure Software Engineer and play a pivotal role in shaping the future of our platform. In this role, you will design and implement robust backend solutions that enhance our infrastructure capabilities, ensuring scalability and performance. Collaborate with cross-functional teams to tackle complex technical challenges and drive innovation within the organization.

Mar 27, 2026
Apply
companyAxon logo
Full-time|Hybrid|London, England, United Kingdom

Become a Force for Good with Axon.At Axon, we are committed to the mission of Protecting Life. We tackle society's most urgent safety and justice challenges through our innovative devices and cloud-based software. Our collaborative spirit drives our success as we engage with diverse perspectives from our customers, communities, and each other.Working at Axon is fast-paced, fulfilling, and engaging. You will have the opportunity to take ownership and enact real change while growing in a mission-driven environment where your contributions are valued.Your ImpactAs a Senior Software Engineer within the Infrastructure Services team at Dedrone, you will be tasked with designing, building, and enhancing the core systems that support our developer platform and critical backend services. Your robust software engineering skills coupled with extensive infrastructure knowledge will ensure our platform remains scalable, reliable, and secure. This role transcends traditional infrastructure operations; you will create production-ready backend services, design cloud-native systems, and manage the CI/CD and runtime environments that underpin our product ecosystem. Your influence will shape engineering standards, enhance developer experiences, and steer architectural decisions across Dedrone’s product teams.What You’ll DoWork Location: This position is situated in our London office following a hybrid work model. We emphasize in-person collaboration, requiring team members to be on-site from Tuesday to Friday, with the flexibility for remote work on Mondays, unless a workplace accommodation is in place. We believe that connection drives innovation, and our in-office culture is crafted to promote meaningful teamwork, mentorship, and collective success.Reports to: Engineering Manager, Infrastructure ServicesBackend & Platform Service EngineeringDesign, construct, and sustain production-grade backend services (primarily using Java, Go, or Python) that facilitate deployment orchestration, internal APIs, and developer platform functionalities.Contribute to architectural decisions encompassing service boundaries, API design, persistence layers, scalability strategies, and fault tolerance.Enhance performance, reliability, and maintainability of platform services through rigorous testing practices and meticulous system design.Manage and evolve shared services governed by the Infrastructure Services team.CI/CD & Developer PlatformOwn, design, and refine CI/CD systems that support Dedrone’s distributed product ecosystem.

Apr 8, 2026
Apply
companyOpenAI logo
Full-time|On-site|London, UK

About Our TeamJoin the Applications Engineering team, a dynamic group that collaborates across research, engineering, product management, and design to deliver cutting-edge AI solutions for consumers and businesses alike.As a member of our team, you will play a vital role in managing the essential infrastructure that underpins products like ChatGPT and our API. This encompasses our Kubernetes clusters, infrastructure deployment, networking architecture, cloud abstractions, and much more.We are committed to learning from our deployments and spreading the advantages of AI while ensuring its responsible and safe application. For us, safety takes precedence over unrestricted growth.Role OverviewOur cloud infrastructure team is dedicated to constructing and sustaining infrastructure abstractions that empower OpenAI to deliver products efficiently and at scale.Key Responsibilities:Design and develop robust development and production platforms that ensure reliability and security at scale.Guarantee that our infrastructure is poised to scale for future demands.Foster a diverse, equitable, and inclusive environment that encourages openness, welcome, and the challenging of conventional thinking.Participate in the overall responsibility for system reliability, including an on-call rotation for critical incidents.Ideal Candidate Profile:5+ years of experience in building core infrastructure systems.Skilled in operating orchestration systems, particularly Kubernetes, on a large scale.Experience in creating abstractions over various cloud platforms.Pride in developing and managing scalable, reliable, and secure systems.Comfortable navigating ambiguity and adapting to rapid changes.About OpenAIOpenAI is a pioneering AI research and deployment organization committed to ensuring that general-purpose artificial intelligence serves the greater good of humanity. We continuously push the boundaries of AI capabilities and are focused on the safe deployment of these technologies through our innovative products. Our mission is to prioritize safety and human needs while embracing diverse perspectives, ensuring that our AI tools are developed responsibly.

Oct 1, 2025
Apply
companyWaymo LLC logo
Full-time|On-site|London, UK

Waymo is at the forefront of autonomous driving technology, dedicated to becoming the world's most trusted driver. Originally initiated as the Google Self-Driving Car Project in 2009, our team has been relentlessly working to enhance mobility access and save lives by developing the Waymo Driver—The World’s Most Experienced Driver™. With over ten million successful rider-only trips facilitated by our technology, which has driven over 100 million miles on public roads and tens of billions of miles in simulations across more than 15 U.S. states, we are shaping the future of transportation.The Simulation ML Infrastructure team is responsible for creating scalable AI/ML infrastructure that propels our Simulator team towards innovative, state-of-the-art simulations of realistic environments for the Waymo Driver's training and testing. By leveraging large foundation models trained on extensive datasets, we enhance the authenticity and control of our simulations, including realistic representations of vehicles, pedestrians, cyclists, roads, traffic control systems, and weather conditions.We are currently seeking a highly skilled Senior Machine Learning Infrastructure Engineer to spearhead the development of advanced AI/ML infrastructure for multi-billion parameter foundation models tailored for ML accelerator-friendly simulations. Your extensive knowledge in massive model scaling, ML accelerators, and distributed training will be vital in designing and optimizing our systems.

Feb 10, 2026
Apply
companyAlgolia logo
Full-time|Hybrid|London, England; Paris, France

At Algolia, we are proud to be at the forefront of AI Search technology, enabling over 17,000 businesses to provide lightning-fast, predictive search and browsing experiences on a global scale. We handle more than 30 billion search requests each week, surpassing the combined total of Microsoft Bing, Yahoo, Baidu, Yandex, and DuckDuckGo.In 2021, we secured $150 million in Series D funding, elevating our valuation to $2.25 billion. This strong financial backing allows us to continuously enhance our industry-leading platform and serve esteemed clients like Under Armour, PetSmart, Stripe, Gymshark, and Walgreens.Algolia was designed to empower users to offer an intuitive search-as-you-type functionality on their websites and mobile applications. Our search API is utilized by thousands of clients across more than 100 countries, facilitating billions of search queries every month thanks to the code we deploy into production daily.We are on the lookout for a Senior Software Engineer to join our Metis team, which is responsible for the cloud-based scalable architecture of NeuralSearch, our AI-powered search engine that integrates both keyword and vector search capabilities. Metis comprises distributed components that manage the construction and storage of indices containing customer data and querying that data to deliver relevant search results. This role demands a deep understanding of the complexities associated with distributed systems.Our team is composed of engineers, with most working remotely, bringing diverse skill sets and backgrounds. Your unique experiences and insights will contribute to our collective diversity, helping us to create impactful products.

Feb 18, 2026
Apply
companyPerplexity logo
Full-time|On-site|London

Join Perplexity, a pioneering company based in London, as we transform the way users search and engage with the internet. We are seeking seasoned Infrastructure Engineers to become integral members of our dynamic team. In this role, you will lead the design, implementation, and scaling of innovative tools, systems, and platforms that empower web, mobile, and browser engineers to develop cutting-edge products. Your contributions will be vital in enabling our product, AI, and development teams to innovate swiftly while ensuring utmost reliability, security, and performance at scale.Our Tech StackPython | Go | TypeScript | PostgreSQL | DynamoDB | Redis | FastAPI | React | Bazel | GitHub | AWSTeams HiringSenior/Staff PlatformThe Platform team is fundamental to ensuring product reliability, scalability, and performance at Perplexity. This elite team is responsible for developing and maintaining the critical infrastructure—from backend systems such as authentication, real-time data flows, and service orchestration to frontend frameworks that guarantee fast, reliable, and secure user experiences. By upholding stringent standards for code quality, uptime, and developer productivity, the Platform team enables all of Perplexity to innovate rapidly on a solid, well-designed foundation.Senior/Staff DevXThe Developer Experience (DevX) team is dedicated to empowering engineers at Perplexity to build, ship, and iterate faster than ever before. They manage internal platforms for source control, build, test, and deployment, effectively removing bottlenecks from each phase of product development. This team designs seamless onboarding, ultra-fast CI/CD pipelines, and developer tools that enhance creativity and safety at scale. Through collaborative efforts and significant autonomy, the DevX team amplifies the contributions of every engineer, facilitating rapid, dependable innovation across the organization.Senior/Staff Cloud InfraThe Cloud Infrastructure team architects and manages the essential cloud infrastructure that powers Perplexity’s global platform. This team designs and scales computing, storage, and networking systems to support high-throughput, low-latency workloads, balancing proactive project initiatives with operational support. Collaboration with product, AI research, security, and other teams ensures that our services remain consistently available, reliable, and responsive to user demands.

Feb 10, 2026
Apply
companytem logo
Full-time|On-site|United Kingdom

Join tem as a Senior Staff Engineer specializing in Pricing Infrastructure, where you'll lead innovative projects that enhance pricing systems and drive strategic initiatives. Collaborate with cross-functional teams to deliver scalable solutions that meet business needs.

Mar 2, 2026
Apply
companyPalantir Technologies logo
Full-time|On-site|London, United Kingdom

Join a Pioneering CompanyAt Palantir, we are at the forefront of developing transformative software that empowers data-driven decisions and operations. Our platforms enable partners to revolutionize industries by facilitating groundbreaking discoveries in healthcare, enhancing supply chain resilience, and aiding critical social initiatives.The OpportunityAs a Backend Software Engineer at Palantir, you will play a crucial role in crafting scalable software solutions that redefine organizational data usage. You will engage in every phase of the product lifecycle—from conceptualization and design to prototyping and delivery. Collaborating closely with both technical and non-technical team members, you will gain deep insights into customer challenges and develop innovative solutions. Our culture promotes cross-team collaboration, allowing you to broaden your technical expertise across various technologies and product domains. You'll work independently while receiving support from a vibrant community that encourages your growth as a technical contributor and engineering leader.Our Product Development teams consist of small, focused groups of Software Engineers, each dedicated to distinct product facets. The infrastructure teams specialize in the foundational layers of our software stack, emphasizing database technologies, distributed systems, large-scale data architectures, security, and application infrastructure. As an infrastructure Software Engineer, you will produce high-quality code that underpins Palantir Foundry and Gotham, ensuring they are performant, secure, and scalable. Your work will empower critical applications utilized by researchers, engineers, analysts, and forecasters across the globe.We seek engineers who are passionate about tackling real-world challenges and enhancing the productivity of both developers and end-users. If you are driven to create reliable, high-performance systems and robust APIs, this position offers a unique chance to impact our products and their users significantly.Exclusive Learning ExperienceSuccessful candidates may also participate in Frontline, a distinctive program that embeds engineers directly with customers. This short-term assignment provides invaluable insights into user experiences and the complexities our customers encounter, differentiating it from traditional engineering roles.

Feb 20, 2024
Apply
companyGradle Inc. logo
Full-time|Remote|Europe

About Us Develocity is an innovative toolchain observability and acceleration platform, uniquely designed to assist software teams in adopting and enhancing their DORA capabilities, which include continuous delivery. We empower organizations to achieve software delivery excellence through a combination of build and test acceleration and in-depth observability for tools including Gradle Build Tool, Apache Maven™, sbt, npm, and Python. Our solutions cater to both CI and local builds, effectively providing an operational layer across a company’s toolchains to improve speed, troubleshooting, and optimization of both local developer and remote CI feedback loops. Our software is trusted by some of the world's premier software organizations, including Netflix, Airbnb, SAP, and several top ten banks, alongside numerous other significant clients across various sectors. We engage closely with these users to continuously enhance our product offerings. In addition, we collaborate with esteemed organizations such as the Apache Software Foundation, the Commonhaus Foundation, the Scala Center, the Micronaut Foundation, and other open-source projects like Spring, Quarkus, Kotlin, JUnit, and AndroidX to extend the values of Develocity to the open-source community.

Jan 15, 2026
Apply
companyGraphcore logo
Full-time|On-site|Bristol, UK; Cambridge, UK

Join Graphcore: Pioneers in AI Innovation At Graphcore, we are at the forefront of Artificial Intelligence computing, dedicated to developing cutting-edge hardware, software, and systems that will drive the next wave of AI advancements. Our mission is to facilitate the broad adoption of AI solutions in various sectors and enhance the transformative potential of this technology. As a proud member of the SoftBank Group, Graphcore belongs to a remarkable group of companies that are reshaping the technological landscape. We share a vision of enabling Artificial Super Intelligence and making its benefits universally accessible. Our team comprises talented professionals from various disciplines, including AI researchers, silicon designers, software engineers, and systems architects. We foster a culture of continuous learning and innovation, making Graphcore an exciting place to advance your career. Your Role In our vibrant Software Infrastructure team, you'll play a crucial role in scaling and managing our infrastructure. You'll create essential tools and services that empower our software teams, enhancing the build, test, deployment, and productization processes for our Machine Learning Software components. Collaborate with our High-Performance Computing (HPC) AI platforms and gain invaluable experience in distributed systems. The Team's Mission The Software Infrastructure team is integral to our software development ecosystem, delivering key platforms and services. We manage CI platforms, build engineering, component integration, and release systems, operating in squads that emphasize service ownership and engineer empowerment. Our focus is on long-term engineering solutions and minimizing repetitive tasks to enhance productivity.

Mar 13, 2026

Sign in to browse more jobs

Create account — see all 10,799 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.