About the job
Join amo at a pivotal time as we prepare for an unprecedented surge in traffic. In the initial months, you will collaborate closely with the founding team, progressively assuming ownership of critical system components.
As a Lead Site Reliability Engineer, you will play a key role in ensuring our systems not only manage high traffic volumes but excel under strain. Performance and reliability are our cornerstones; thus, monitoring latency as a key metric is paramount.
Collaboration with the backend team is essential as we work together to deploy code into production. Our mantra is clear: automate repetitive tasks to enhance efficiency across our operations.
Your Responsibilities:
Vision and Leadership
You will establish and drive the long-term vision and roadmap for our infrastructure, ensuring that amo’s systems are reliable, scalable, and secure as we expand globally. You'll lead a dynamic team of SRE engineers, providing clear direction while participating in hands-on execution. Fostering a culture of reliability, ownership, and technical excellence will be essential to our success.
System Design and Management
Design and maintain distributed databases at scale across multiple clusters (ScyllaDB). Your role will involve testing, scaling, and designing high-throughput monitoring infrastructure capable of handling over 1 million series per second.
Automation and Efficiency
Our goal is to eliminate every manual process. Automate everything from monitoring dashboards to database installations and cluster management. Utilize automated Kubernetes Pod Autoscaler to achieve near 90% CPU utilization per machine while ensuring high-quality service delivery and cost control.
Collaboration with Backend Teams
Working closely with software engineers is part of our daily routine, making you an integral member of the engineering team. Engage with partners like GCP, ScyllaDB, and Redpanda to maximize their products’ potential.
Software Engineering and Development
As an SRE at amo, you will also engage in coding, build CI/CD pipelines, and introduce new tools to accelerate our growth.

