About the job
Join Algolia, a trailblazer and leader in AI Search, serving over 17,000 businesses with lightning-fast, predictive search and browsing capabilities on an internet scale. Our platform processes over 30 billion search requests weekly—four times the volume of Microsoft Bing, Yahoo, Baidu, Yandex, and DuckDuckGo combined.
In 2021, we secured $150 million in Series D funding, boosting our valuation to $2.25 billion. This solid foundation allows us to continually enhance our market-leading platform while providing outstanding service to clients such as Under Armour, PetSmart, Stripe, Gymshark, and Walgreens.
At Algolia, we aim to empower every organization to develop exceptional Search and Discovery experiences through an API-first methodology. Performance and scalability are central to our mission: we facilitate 1.5 trillion searches annually for over 10,000 customers globally.
If you are a creative problem solver who thrives in collaborative environments and is eager to both mentor others and learn from them, this is your opportunity!
The Team
The Fleet team, dedicated to Site Reliability Engineering, is focused on one primary objective: ensuring the availability of our Search products. To achieve this, the Fleet team devises pragmatic solutions that optimize product availability and cost-effectiveness at scale, while addressing the needs of customers, product teams, and the various engineering teams contributing to a unique Search Experience.
The Opportunity
We are seeking an individual with experience in building and managing scalable architectures. You will play a critical role in delivering solutions that enable other engineering teams, directly influencing the success of Algolia's Search products.
In this position, you will design and implement systems centered around reliability, scalability, and cost efficiency, while also having opportunities for personal growth and team collaboration.
Your role will include:
- Operating Search products and developing self-healing and automated incident response mechanisms.
- Creating components that enhance reliability and performance.
- Monitoring and analyzing the Service Level Objectives (SLO) and error budgets of the products you manage.
- Minimizing toil and technical debt through task automation and improving the quality of existing components.
- Managing incidents and addressing customer requests.

