Coupang logoCoupang logo

Director of Site Reliability Engineering - Coupang Pay

CoupangSeoul, South Korea
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Senior Level Manager

Qualifications

Qualifications Bachelor’s or Master’s degree in Computer Science or a related field. Extensive experience in operating and supporting an internet software portfolio. Proficiency in software configuration management and release engineering. Expertise in scaling internet software using modern practices and solutions. Strong knowledge of infrastructure and associated technologies. Experience with cloud service platforms (AWS/GCP/Azure) and familiarity with best practices for troubleshooting. Hands-on experience with observability platforms, including metrics, logging, and tracing. Exceptional problem-solving abilities and leadership skills during incident response, with a proven ability to mentor others. Demonstrated experience in leading a team and driving technical initiatives.

About the job

Director of Site Reliability Engineering – Coupang Pay

We seek an experienced leader to join our Site Reliability Engineering (SRE) team at Coupang Pay. In this pivotal role, you will exemplify exceptional operational and engineering excellence, ensuring the scalability and reliability of our FinTech capabilities across our extensive array of applications, platforms, and infrastructure. You will work closely with key stakeholders to define the technical strategy for reliability and lead a talented team of engineers and DBAs to execute this vision. Your ability to recruit and nurture top engineering and operational talent is essential, as is your skill in managing and delivering complex projects with multiple dependencies. As an effective leader and communicator, you will be instrumental in driving our success.

  • Lead and direct a team of skilled engineers and DBAs in the development and management of our production and development environments.
  • Formulate a comprehensive technology strategy focused on availability, resilience, and incident response across our FinTech portfolio.
  • Oversee the Observability Platform and establish best practices for instrumentation, enabling teams to monitor and respond to the health of their applications.
  • Drive the achievement of resilience objectives through rigorous scale and chaos testing.
  • Continuously enhance procedures based on insights gained from real experiences and simulated drills.
  • Establish baseline resilience requirements, instrumentation standards, and operational readiness checkpoints while tracking adherence.
  • Collaborate closely with DevOps and Data teams to provide seamless developer experiences.
  • Recruit, develop, and mentor exceptional individuals in SRE and related domains.

About Coupang

Coupang is a leading e-commerce platform renowned for its innovative technology and commitment to customer satisfaction. We are dedicated to creating a seamless shopping experience and empowering our teams to achieve excellence in all aspects of our operations.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.