Lead Observability Engineer - Remote (15:00 - 23:00 CET)
N-iX
N-iX is a leading global software development company, established in 2002, that unites over 2,400 skilled tech professionals across more than 40 countries. We specialize in delivering cutting-edge technology solutions in cloud computing, data analytics, artificial intelligence, embedded software, IoT, and more, serving global industry leaders and Fortune 500 companies. Join us to create transformative technology that makes a genuine difference for businesses and individuals worldwide. As a Lead Observability Engineer, you will elevate our observability platform, focusing on ClickHouse as the primary telemetry storage solution. You will spearhead the transition from a custom Cosmos telemetry system to ClickHouse, ensuring robust alerting, notifications, and telemetry functionalities. This hybrid role requires an individual contributor who reports to the Senior Manager. Our client values transparency and strives to create a more agreeable environment for employees, customers, and communities. Here, you will have the opportunity to voice your ideas, share insights openly, and make a meaningful contribution as part of a team that impacts the globe. About the Team Our Cloud Engineering team thrives on collaboration, curiosity, and innovation. We develop mission-critical cloud solutions that serve millions of users and businesses globally. We embrace agile principles, DevOps practices, and infrastructure as code, focusing on reliability, scalability, and security in everything we do. Responsibilities Lead the migration and transformation of telemetry storage from custom Cosmos DB solutions to ClickHouse, creating a scalable and reliable end-to-end observability platform. Architect, implement, and maintain alerting and notification systems integrated with ClickHouse for critical services and applications. Develop, deploy, and operate high-throughput telemetry pipelines, ensuring accurate and actionable monitoring across cloud environments. Collaborate with engineering and product teams to establish and advocate for observability best practices. Design and build dashboards and visualization tools to facilitate proactive monitoring, detection, and resolution of incidents. Work with DevOps and development teams to automate the collection, ingestion, and retention policies for logs, metrics, and traces. Drive continuous improvement in system performance, stability, and reliability through effective observability. Participate in on-call rotations, incident response, and root cause analysis to enhance monitoring and alerting capabilities.
Apr 17, 2026