About the job
Join our innovative team at nawy-real-estate as a Senior Site Reliability Engineer (SRE). We are on the lookout for a highly proficient SRE with a solid software engineering background, particularly in leveraging Terraform for Infrastructure as Code (IaC) to effectively manage and provision cloud resources on AWS.
Your responsibilities will include:
- Designing, implementing, and managing cloud infrastructure using Terraform and other IaC tools.
- Proactively monitoring, troubleshooting, and optimizing cloud environments to ensure high availability and efficiency.
- Implementing and maintaining CI/CD pipelines for seamless automated code deployment and infrastructure modifications.
- Developing and managing the data stack, which includes infrastructure resources, implementation, and the setup of data lakes.
- Responding to critical alerts and incidents, coordinating swift response efforts to minimize impact and downtime.
- Conducting thorough root cause analysis (RCA) for incidents to identify and rectify underlying issues, while developing preventive solutions.
- Documenting and maintaining comprehensive guides for system configurations and operational procedures, fostering knowledge sharing and operational excellence.
