companyTensorwave Cloud logo

Site Reliability Engineer at Tensorwave | Las Vegas

Tensorwave CloudLas Vegas, Nevada
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Candidates should possess a Bachelor's degree in Computer Science, Computer Engineering, or a related technical field, or have equivalent practical experience, alongside over 5 years in DevOps or Infrastructure Engineering roles. Proficiency in low-level programming languages such as Rust or Go, coupled with extensive experience in Linux systems and configuration management, is essential. Hands-on experience with infrastructure tools like Terraform and Kubernetes is required, along with a solid grasp of systems programming and performance tuning.

About the job

At Tensorwave Cloud, we are dedicated to creating a seamless, secure, and resilient AI infrastructure on a large scale, breaking down barriers and redefining the norms to empower innovators and nurture AI advancements.

Role Overview

We are looking for a proactive Site Reliability Engineer with a robust software engineering background, tasked with the design, construction, and maintenance of highly scalable, secure, and resilient infrastructure.

In this pivotal role, you will engage in low-level systems design, automate infrastructure using contemporary tools, and ensure platform reliability.

This position is perfect for individuals who thrive at the intersection of systems programming and DevOps, proficient in coding with Go, JavaScript, Rust, C, or Zig while managing infrastructure with NixOS, Kubernetes, and Terraform.

Key Responsibilities

  • Design, build, and sustain infrastructure systems utilizing Linux and NixOS.

  • Utilize Terraform for infrastructure-as-code to provision and scale resources effectively.

  • Architect and operate Kubernetes clusters with an emphasis on performance, security, and automation.

  • Develop high-performance tools and internal utilities in Go or Rust.

  • Create and manage CI/CD pipelines for infrastructure and code deployments.

  • Monitor system performance, troubleshoot issues, and enhance reliability through observability tools.

  • Work collaboratively with engineering teams to support deployment strategies and development workflows.

Required Qualifications

  • Bachelor's degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience.

  • 5+ years of experience in DevOps, Site Reliability, or Infrastructure Engineering roles.

  • Proficiency in one or more low-level programming languages such as Rust or Go.

  • Extensive experience with Linux systems and configuration management.

  • Hands-on experience with Terraform, Kubernetes, and containerized environments.

  • Strong understanding of systems programming, performance tuning, and operating system internals.

  • Familiarity with CI/CD practices and infrastructure monitoring/alerting tools.

About Tensorwave Cloud

Tensorwave Cloud is at the forefront of AI infrastructure innovation, dedicated to building robust, secure, and scalable solutions that empower developers and support groundbreaking AI initiatives. Our mission is to eliminate barriers and redefine the standards for AI infrastructure.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.