About the job
About Us
AMCS Group, a leader in sustainability software, is based in Ireland with a global presence in Europe, the USA, and Australasia. With a dedicated workforce of over 1,300 professionals spanning 22 countries, we are committed to providing innovative technology solutions that contribute to a carbon-neutral future.
Our Mission
We offer advanced SaaS solutions designed to enhance efficiency and sustainability in resource-intensive sectors. Our Performance Sustainability software serves more than 5,000 clients across 23 countries, delivering effective solutions that drive both profitability and environmental resilience worldwide.
Your Role
We are looking for a dynamic and experienced Cloud Infrastructure Manager to spearhead and refine our Site Reliability Engineering, Cloud Operations, and FinOps teams. This pivotal role will ensure that our infrastructure is reliable, secure, and cost-effective, enabling robust product delivery at scale. You will shape operational strategies, enhance organizational maturity, and foster cross-functional collaboration, taking charge of both team leadership and technical guidance in these vital areas.
Key Responsibilities
Strategic Leadership:
- Formulate and implement a cohesive strategy across Site Reliability Engineering (SRE), Cloud Operations, Cloud Security, and FinOps.
- Develop and execute a cloud optimization roadmap that promotes cost efficiency while maintaining high standards of reliability and performance.
- Provide leadership, mentorship, and performance management for diverse teams.
- Collaborate with senior engineering leadership to align platform and reliability goals with overarching business objectives.
Site Reliability Engineering:
- Oversee SRE teams tasked with ensuring reliability, availability, performance, and operational excellence.
- Drive the observability strategy to enhance metrics, logs, traces, dashboards, and alerting accuracy.
- Promote SRE principles such as Service Level Indicators (SLIs), Service Level Objectives (SLOs), error budgets, and toil reduction.
Cloud Operations:
- Manage cloud infrastructure provisioning, governance, and cost optimization across Azure, AWS, and GCP.
- Advocate for automation-first operational practices to minimize manual interventions.
Cloud Security:
- Implement secure-by-default cloud architecture, governance, and controls.
- Collaborate with IT security teams to integrate policy-as-code and identity-based access models.

