Qualifications
Key Responsibilities:• Design, build, and operate observability platforms utilizing Grafana and Prometheus.• Define and maintain standards for metrics, dashboards, alerts, and SLOs.• Enhance signal quality: reduce alert noise, optimize thresholds, and refine runbooks.• Assist with incident response by delivering actionable telemetry and conducting post-incident analysis.• Integrate metrics, logs, and traces across distributed systems.• Collaborate with engineering teams to correctly instrument services.• Automate observability configuration through infrastructure as code.• Contribute to reliability improvements via capacity planning and performance analysis. Required Skills and Experience:• Extensive experience with Prometheus (scraping, federation, recording rules, alerting).• Strong expertise in Grafana (dashboards, alerting, templating, RBAC).• Solid understanding of Linux and networking fundamentals.• Experience managing observability stacks in Kubernetes environments.• Familiarity with infrastructure as code (preferably Terraform).• Knowledge of incident management and on-call practices.• Proficiency in debugging production systems using metrics and logs.
About the job
Cision is hiring a Staff Site Reliability & DevOps Engineer focused on Observability. This remote position is open to candidates based in Hungary or Sofia, Bulgaria.
This role centers on designing, operating, and improving observability platforms. The main focus areas are metrics, logging, and alerting, using tools such as Grafana and Prometheus. The goal is to keep production systems observable, reliable, and scalable.
Key Responsibilities
- Design and maintain observability platforms with an emphasis on metrics, logging, and alerting.
- Operate and evolve systems using Grafana and Prometheus.
- Work closely with platform, infrastructure, and application teams to ensure effective monitoring and reliability.
About Cision
Cision values curiosity, teamwork, and innovation. Team members are encouraged to share insights and ideas, contributing to the growth of the brands supported. The company fosters an environment where each person’s perspective is valued and recognized.