company

Infrastructure Engineer

HappyRobotSan Francisco
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Must-Have QualificationsMinimum of 3 years of hands-on experience in debugging production systems, including logs, traces, incidents, etc. Demonstrated strong problem-solving capabilities and the ability to navigate unfamiliar backend codebases. Proficient in Go and Kubernetes. Familiar with observability and monitoring tools, such as Datadog, Prometheus, and Sentry. Ability to communicate clearly and calmly under pressure, especially during live incidents.

About the job

About HappyRobot

HappyRobot is pioneering the AI-native operating system for the real economy, bridging the gap between intelligence and action. By harnessing real-time truths, specialized AI workers, and orchestrating intelligence, we empower enterprises to manage complex, mission-critical operations with unprecedented autonomy.

Our AI OS accumulates knowledge, optimizes processes at every level, and evolves continually. Our initial focus is on supply chain and industrial-scale operations, where resilience, speed, and ongoing improvement are paramount—liberating humans to engage in strategy, creativity, and other high-value endeavors.

To explore our vision further, check out our Manifesto. To date, HappyRobot has successfully raised $62 million, including a recent $44 million in Series B funding in September 2025, with support from esteemed investors like Y Combinator (YC), Andreessen Horowitz (a16z), and Base10—partners dedicated to our mission of redefining enterprise operations. We are using this investment to build a world-class team of individuals with relentless drive, exceptional problem-solving skills, and a passion for pushing boundaries in a dynamic, high-intensity environment. If this resonates with you, we invite you to join us at HappyRobot.

About the Role

We are in search of an Infrastructure Engineer to spearhead the enhancement of our operational resilience as we scale. You will be responsible for the stability, observability, and debugging processes that ensure our systems operate seamlessly. As the primary troubleshooter for complex failures in real-time, you will design tools that transform chaos into clarity and assist in transitioning our operations from reactive to proactive.

This role carries significant impact and trust, as you will influence how we approach reliability—reducing incident frequency, creating internal tools, and directly enhancing developer focus and system uptime. If you thrive on uncovering the root causes of challenging issues and fortifying systems (and teams), this is your opportunity.

About HappyRobot

HappyRobot is at the forefront of developing an AI-native operating system that revolutionizes how enterprises operate, enhancing their ability to execute complex tasks with autonomy and efficiency. With significant backing from prominent investors, we are committed to building a powerful team that thrives in fast-paced environments.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.