About the job
ABOUT THE TEAM
The Production Engineering team at Rubrik plays a crucial role in ensuring the availability and reliability of mission-critical platforms across expansive, multi-cloud environments. We serve as the cornerstone of operational excellence, managing incident responses, outages, observability, and continuous enhancement.
Our team collaborates closely with Site Reliability Engineering (SRE) and Engineering units to proactively identify risks, minimize operational toil, mitigate outages, and construct resilient systems. We thrive in complex, high-pressure situations, utilizing sound technical judgment while relentlessly improving through learning, ownership, and accountability.
ABOUT THE ROLE
As a Senior Engineering Manager for Production Engineering, you will lead and nurture a high-impact team responsible for sustaining highly available, business-critical services. You will drive the technical roadmap, strategy, and execution while acting as a senior escalation point during significant incidents.
Your role involves setting a compelling vision for the team, coaching engineers to handle high-pressure scenarios confidently. This position demands strong technical expertise, decisive decision-making skills, and the capability to collaborate across teams to enhance system reliability and operational maturity.
What You’ll Do
- Lead the Production Engineering team, supporting critical infrastructure and services across multi-cloud environments, aligning with EST timezone (4:00 PM – 1:00 AM IST).
- Own the operational excellence for production environments by establishing robust processes, standards, and accountability for availability and reliability.
- Promote a metrics-driven culture focused on continuous improvement, driving initiatives to enhance operational KPIs such as MTTA/MTTR.
- Encourage a strong collaborative mindset by working closely with leadership to align operational priorities with business objectives and reliability targets.
- Develop team members through coaching, mentoring, and career advancement, empowering senior contributors to achieve maximum impact.
- Plan and manage on-call rotations, escalation protocols, and resource availability to guarantee sustainable support for mission-critical systems.
- Implement strong execution rigor through sprint planning, prioritization, and accountability, holding teams to elevated standards of delivery and performance.
Experience You’ll Need
- A passionate leader dedicated to building technically proficient teams, with over 12 years of experience in Software Development Engineering.

