companyOkta, Inc. logo

Senior Site Reliability Engineer - Observability

Okta, Inc.Bellevue, Washington
On-site Full-time $147K/yr - $202K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Required Skills & Experience:Log Management: Over 5 years of experience in scaling and managing Splunk Cloud at scale (1000+ services), including expertise in Workload Management (WLM) and HEC optimization. Visualization: Proficiency in developing intuitive, actionable Splunk dashboards that integrate data from multiple sources. SRE Mindset: At least 3 years of experience in an SRE, DevOps, or Systems Engineering role focusing on high-availability systems.

About the job

About Okta

Okta stands as the leader in identity solutions, empowering individuals to securely engage with any technology, on any device, and through any application. Our versatile products, including the Okta Platform and Auth0 Platform, ensure safe access and authentication, placing identity at the forefront of security and business growth.

At Okta, we embrace diverse perspectives and experiences. We are not searching for someone who checks all the boxes; rather, we value lifelong learners who can enrich our team with their unique backgrounds.

Join us in crafting a future where identity is truly yours.

Position Overview:

We are looking for a highly skilled Senior Observability Site Reliability Engineer with a focus on Splunk to take ownership and enhance our Splunk ecosystem. In this role, you will go beyond traditional monitoring, creating a comprehensive and scalable Observability Platform that empowers our SRE teams and business stakeholders. You will treat infrastructure as code, leveraging Terraform alongside proficient coding skills in Go, Python, or Ruby to automate deployment across complex distributed systems.

Key Responsibilities

  • Automated Infrastructure: Design, build, and maintain scalable observability infrastructure utilizing tools like Terraform.
  • Splunk Engineering: Enhance the collection, processing, and storage of log data to ensure our Splunk services are highly reliable and low-latency.
  • Incident Response: Engage in on-call rotations and lead post-incident reviews to drive systemic improvements and promote 'observability-driven development.'
  • Automation: Minimize 'toil' by automating the deployment and scaling of observability agents and collectors.

About Okta, Inc.

Okta is a leading identity solutions provider, dedicated to enabling secure technology access and authentication for individuals and businesses alike. Our innovative platforms, Okta and Auth0, are designed to prioritize identity in business security and expansion.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.