Tailoring 0 resumes…

We'll move completed jobs to Ready to Apply automatically.

Site Reliability Engineer - Google Cloud Platform at Carousell Group | Ho Chi Minh City | RoboApply Jobs

This job posting is no longer active and is not accepting applications.

Site Reliability Engineer - Google Cloud Platform

Carousell GroupHo Chi Minh City

On-site Full-time

No Longer Active

Experience Level

Mid to Senior

Qualifications

Minimum Qualifications (Essential):Technical Skills: Demonstrated experience with Python and proficiency in at least one of Bash/Perl/Golang.Cloud & Orchestration: Proven experience in a cloud ecosystem (GCP or AWS) and relevant tools, with experience in managing, scaling, and troubleshooting containerized workloads using Kubernetes or similar orchestration tools.Infrastructure as Code: Practical experience with Terraform or alternative tools such as Ansible/Chef.Fundamental Knowledge: Strong understanding of operating systems, databases, and the fundamentals of distributed systems.DevOps Understanding: Comprehensive knowledge of DevOps culture, principles, and practices.Soft Skills: Self-driven, detail-oriented, and capable of independent problem-solving.

About the job

Why You Will Enjoy Being Part of Our Team:

Blameless Culture: We address incidents collaboratively. Our policy is clear: Resolve the issue first, analyze the root cause later — no blame, only solutions.
Fully Cloud-Based & Large-Scale Operations: Operating entirely within the Google Cloud Platform (GCP) ecosystem and Google Kubernetes Engine (GKE), we manage seamless auto-scaling during peak traffic events.
AI-Driven Processes: We utilize AI to enhance daily operations, automate log analysis and troubleshooting, and expedite software releases.
Empowerment Through Trust: Your access rights start minimal but expand as you demonstrate your skills. Master our systems, and you’ll earn the highest access privileges.

Key Responsibilities (50% Automation / 50% Operations):

This pivotal role demands robust engineering expertise, practical experience, and hands-on implementation skills. You will:

Serve as the primary point of contact for incident management, swiftly addressing issues as they arise.
Guarantee optimal performance, availability, and scalability of production systems.
Automate infrastructure provisioning in the cloud, including systems and software setups.
Design and manage build & release pipelines, configuration management, and code deployments across various environments.
Collaborate closely with the development team to refine deployment processes and strategies.
Identify and tackle challenges or opportunities in critical high-impact areas.

Your First 6 Months:

Months 1-2 (Learning Phase): Focus on understanding Chợ Tốt's core infrastructure. We will sponsor your learning through Coursera to obtain necessary Google Cloud / K8s certifications and familiarize you with our infrastructure across all three environments.
Months 3-6 (Execution Phase): Achieve mastery in the infrastructure, particularly in Production. You will manage support requests from Engineers, take on Group-level assignments, and engage in on-call responsibilities.

About Carousell Group

Chợ Tốt is advancing its technology foundation to fuel our next phase of growth, impacting millions of Vietnamese users. Our Site Reliability Engineering (SRE) team engages daily with open-source CNCF projects, constructing resilient platforms, automation workflows, and data engineering pipelines that facilitate the continual release of numerous microservices.Join us to tackle complex distributed system challenges in a dynamic, agile environment. We handle hundreds of millions of requests and manage data pipelines processing over a billion messages daily. Being part of the larger Carousell Group means your technical solutions can resonate on a regional scale!

This job posting is no longer active and is not accepting applications.

Site Reliability Engineer - Google Cloud Platform

Carousell GroupHo Chi Minh City

On-site Full-time

No Longer Active

Experience Level

Mid to Senior

Qualifications

About the job

Why You Will Enjoy Being Part of Our Team:

Blameless Culture: We address incidents collaboratively. Our policy is clear: Resolve the issue first, analyze the root cause later — no blame, only solutions.
Fully Cloud-Based & Large-Scale Operations: Operating entirely within the Google Cloud Platform (GCP) ecosystem and Google Kubernetes Engine (GKE), we manage seamless auto-scaling during peak traffic events.
AI-Driven Processes: We utilize AI to enhance daily operations, automate log analysis and troubleshooting, and expedite software releases.
Empowerment Through Trust: Your access rights start minimal but expand as you demonstrate your skills. Master our systems, and you’ll earn the highest access privileges.

Key Responsibilities (50% Automation / 50% Operations):

This pivotal role demands robust engineering expertise, practical experience, and hands-on implementation skills. You will:

Serve as the primary point of contact for incident management, swiftly addressing issues as they arise.
Guarantee optimal performance, availability, and scalability of production systems.
Automate infrastructure provisioning in the cloud, including systems and software setups.
Design and manage build & release pipelines, configuration management, and code deployments across various environments.
Collaborate closely with the development team to refine deployment processes and strategies.
Identify and tackle challenges or opportunities in critical high-impact areas.

Your First 6 Months:

Months 1-2 (Learning Phase): Focus on understanding Chợ Tốt's core infrastructure. We will sponsor your learning through Coursera to obtain necessary Google Cloud / K8s certifications and familiarize you with our infrastructure across all three environments.
Months 3-6 (Execution Phase): Achieve mastery in the infrastructure, particularly in Production. You will manage support requests from Engineers, take on Group-level assignments, and engage in on-call responsibilities.

Site Reliability Engineer - Google Cloud Platform

Experience Level

Qualifications

About the job

About Carousell Group

Revenue Operations Specialist

Quality Assurance and Health, Safety, and Environment Coordinator

Compliance Officer

Remote Lead Software Engineer

Senior Software Engineer - Remote Opportunity

Senior Software Engineer - Fully Remote

Remote Senior Developer at jobgether

Senior Software Developer - Fully Remote

Experienced Remote Senior Software Developer

Lead Software Engineer - Remote Opportunity

Senior Remote Software Engineer

Senior Software Engineer - Remote Opportunity

Senior Software Engineer - Fully Remote

Lead Product Designer - Remote Opportunity

Senior Design Leader - Remote Opportunity

Remote Senior Director of Product Design

Senior Director of Design - Remote

Remote Product Design Director

Senior Design Director - Remote Opportunity

Senior Director of Design - Remote Opportunity

Site Reliability Engineer - Google Cloud Platform

Experience Level

Qualifications

About the job

About Carousell Group

Site Reliability Engineer - Google Cloud Platform

Experience Level

Qualifications

About the job

About Carousell Group

Site Reliability Engineer - Google Cloud Platform

Experience Level

Qualifications

About the job

About Carousell Group