About the job
About Okta
Okta stands as the leader in identity solutions, empowering individuals to securely engage with any technology, on any device, and through any application. Our versatile products, including the Okta Platform and Auth0 Platform, ensure safe access and authentication, placing identity at the forefront of security and business growth.
At Okta, we embrace diverse perspectives and experiences. We are not searching for someone who checks all the boxes; rather, we value lifelong learners who can enrich our team with their unique backgrounds.
Join us in crafting a future where identity is truly yours.
Position Overview:
We are looking for a highly skilled Senior Observability Site Reliability Engineer with a focus on Splunk to take ownership and enhance our Splunk ecosystem. In this role, you will go beyond traditional monitoring, creating a comprehensive and scalable Observability Platform that empowers our SRE teams and business stakeholders. You will treat infrastructure as code, leveraging Terraform alongside proficient coding skills in Go, Python, or Ruby to automate deployment across complex distributed systems.
Key Responsibilities
- Automated Infrastructure: Design, build, and maintain scalable observability infrastructure utilizing tools like Terraform.
- Splunk Engineering: Enhance the collection, processing, and storage of log data to ensure our Splunk services are highly reliable and low-latency.
- Incident Response: Engage in on-call rotations and lead post-incident reviews to drive systemic improvements and promote 'observability-driven development.'
- Automation: Minimize 'toil' by automating the deployment and scaling of observability agents and collectors.
About Okta, Inc.
Okta is a leading identity solutions provider, dedicated to enabling secure technology access and authentication for individuals and businesses alike. Our innovative platforms, Okta and Auth0, are designed to prioritize identity in business security and expansion.

