About the job
Join MongoDB, a leader in the database market, as we revolutionize the way developers create and manage applications. Our distributed systems support mission-critical services for thousands of customers worldwide.
We are seeking a skilled Software Engineer to become part of our Cloud Core Alerts Platform team. You will be instrumental in designing scalable, fault-tolerant, and highly available systems that can process millions of events and alerts in real-time.
This role is based in Dublin, where we embrace a hybrid working model.
Our domain includes:
- Alert Streaming Systems: Real-time alerting pipelines using Apache Flink and Amazon Kinesis, delivering critical insights for MongoDB Atlas customers.
- Event Systems: SOA-based event platforms utilizing gRPC and streaming architectures to efficiently manage the MongoDB activity feed at scale.
- Communication Systems: Distributed services that ensure reliable delivery of alert notifications across multiple channels (email, SMS, Slack, PagerDuty).
- Third-Party Integrations: Secure management of customer credentials and seamless integration with external observability providers, ensuring encryption both at rest and in transit.
- Cross-team Collaboration: Working closely with the broader Customer Observability teams to provide unified telemetry and monitoring experiences.
What you'll do:
- Design and develop distributed systems capable of processing millions of events per second while maintaining high availability and low latency.
- Lead projects from design to production, ensuring scalability, observability, and operational excellence.
- Enhance our streaming alerting and eventing platforms to improve reliability, throughput, and the overall developer experience.
- Collaborate with cross-functional teams to integrate alerting, event, and communication services into the broader customer observability ecosystem.
- Create secure, multi-tenant integrations with third-party providers, handling sensitive customer credentials with care.
- Promote best practices in distributed systems design, focusing on resilience, scalability, and fault tolerance.
- Mentor and guide fellow engineers, sharing expertise in distributed architectures and streaming technologies.

