About the job
Role Overview
Sigma Computing is growing its engineering team in New York City. The Senior Software Engineer - Observability and Reliability will help build technology that makes data accessible for all. This role focuses on improving how systems are monitored, measured, and maintained for reliability.
What You Will Do
- Design and build observability tools and platforms, including metrics collection, logging, distributed tracing, dashboarding, alerting, and application performance management.
- Work with technologies such as Go, Open Telemetry, and Kubernetes.
- Take part in on-call rotations to help maintain high service uptime.
- Develop runtime tools and processes that support cloud triaging and help minimize downtime.
- Define and promote best practices for monitoring and measuring systems and services.
- Collaborate with engineers and stakeholders through design and code reviews, with a strong emphasis on hands-on coding.

