Replit logoReplit logo

Staff Site Reliability Engineer

ReplitFoster City, CA (Hybrid) In office M,W,F
Hybrid Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Senior

Qualifications

• Proven experience as a Site Reliability Engineer or similar role in a high-scale environment. • Strong knowledge of monitoring, logging, and tracing solutions, with a demonstrated ability to architect observability frameworks. • Expertise in defining and implementing Service Level Objectives (SLOs) and Service Level Indicators (SLIs). • Experience leading incident management and response, including conducting post-mortems and implementing preventative measures. • Proficiency in automation and Infrastructure as Code (IaC) practices, including tools like Terraform or Pulumi. • Excellent problem-solving skills and a passion for improving system reliability and performance. • Strong communication skills and experience in mentoring team members.

About the job

At Replit, we are at the forefront of transforming software development by empowering users to create applications using natural language. Our platform has millions of users globally, including over 500,000 businesses, as we strive to eliminate obstacles in application creation. As a Senior Staff Site Reliability Engineer, you will play a pivotal role in our Site Reliability Engineering team, ensuring the reliability, scalability, and performance of Replit's infrastructure that supports developers around the world. You will act as a vital link between development and operations, applying your expertise in automation and best practices to help our platform scale efficiently while ensuring high availability. We are looking for passionate individuals who are committed to building and maintaining resilient systems. Your mission will be to proactively identify and analyze reliability issues across our tech stack and design innovative software and systems for substantial improvements. You will create robust observability solutions, drive incident responses, automate operational tasks, and enhance our infrastructure's reliability while mentoring and guiding the engineering team to embrace reliability as a core value.

About Replit

Replit is a revolutionary software creation platform that democratizes the way applications are built. By allowing users to interact with our platform using natural language, we have opened up the world of software development to millions. With a robust user base, including over 500,000 businesses, we are dedicated to reducing barriers in application creation and empowering developers everywhere.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.