OUR DNA Join Scaleway in Building the European Sovereign Cloud!Founded in 1999, Scaleway is the cloud subsidiary of the Iliad Group, a leading telecommunications provider in Europe. Our mission? To create a more responsible digital industry by empowering developers and businesses to build, deploy, and adapt their applications across any infrastructure. With headquarters in Paris, Lille, Toulouse, Bordeaux, and Lyon, we design and operate a sovereign cloud ecosystem that is utilized daily by our teams and embraced by 25,000 clients including Photoroom, Mistral, H, Ministry of National Education, Paris 8 Universities, Dysflexis, Lacroix, Little Big Connection, Mon Petit Placement, Radio France, Hachette Livres. Our offerings include: A seamless and intuitive user experience Multi-AZ redundancy ensuring high availability and resilience Carbon-neutral data centers Native tools for multi-cloud architecturesOur solutions cater to all needs from bare metal to containerization and serverless architectures, providing a high-performance European alternative for all types of clients and use cases. Join a team of nearly 600 passionate professionals from diverse backgrounds in a technology-driven, innovative, and collaborative environment!THE ROLEAs a Site Reliability Engineer (SRE), you will play a pivotal role in ensuring the robustness and performance of our services.Reporting to a Lead SRE (Engineering Manager), your contributions will include:- Continuously enhancing the reliability and scalability of our platforms.- Automating infrastructure to optimize deployments and minimize human intervention.- Collaborating with Development, Product, and Operations teams to ensure high-performance and resilient services.You will also become part of the SRE Guild, a collective dedicated to best practices and technical innovation. Your ResponsibilitiesAutomation & Tools- Develop tools and frameworks to streamline deployments and infrastructure management.- Automate repetitive tasks to enhance efficiency and reliability.Monitoring & Alerting- Implement key performance indicators (SLO, KPI) to monitor service performance.- Optimize monitoring and alerting systems to reduce alert fatigue.Incident Management- Lead incident response efforts to quickly resolve issues and restore service functionality.Join us in making a significant impact in the cloud industry!
Mar 6, 2025