About the job
About Tyk
Tyk is revolutionizing the way organizations connect their systems and services through our innovative API Management platform. We are at the forefront of enabling seamless connections across various industries, including retail, finance, telecommunications, healthcare, and media. Whether you’re banking online, checking the news via an app, or navigating a connected vehicle, Tyk’s APIs are making these experiences possible.
Established in 2015, with offices in London (UK), London (Ontario), Atlanta, and Singapore, Tyk serves thousands of users globally. Our diverse clientele includes renowned brands such as Lotte, Bell, T Mobile, RBS, Capital One, and Vinci, with users spanning every continent, including Antarctica.
Our Vision
Tyk aims to connect every system worldwide by providing a robust API Management platform.
Work Culture
We promote total flexibility with default remote work options and offer unlimited paid holidays for all employees. This flexible work environment allows our team to achieve their best results, regardless of location or working hours.
Role Overview:
As a Senior Site Reliability Engineer (SRE) at Tyk, you will play a critical role in enhancing our software solutions, ensuring high availability and performance for our growing user base. We are looking for an innovative thinker and a collaborative team player who is eager to optimize, automate, and improve our systems using insights from large-scale real-time data.
Key Responsibilities:
- Take the lead in maintaining and optimizing our global Cloud platform, adhering to defined SL(A/I/O)s.
- Collaborate on SRE strategy and translate it into actionable technical plans through SCRUM methodologies.
- Identify reliability challenges, conduct root cause analyses, and implement effective solutions within your team.
- Drive performance tuning and fault analysis by examining OS and application metrics.
- Design and implement automation for routine operational tasks and cloud operations workflows.
- Develop proactive monitoring and alerting strategies, including relevant dashboards and KPIs.
