About the job
About Radiant Industries
Radiant Industries, headquartered in El Segundo, CA, is pioneering the development of the world’s first mass-produced, portable nuclear microreactors. Our flagship product, Kaleidos, is a revolutionary 1-megawatt microreactor designed for transportability and can operate continuously for up to five years without the need for refueling. This innovative solution aims to replace conventional diesel generators, providing essential energy support for hospitals, data centers, remote locations, and military bases. We leverage cutting-edge software engineering practices to deliver safe, factory-built microreactors utilizing established, high-quality materials. Established in 2020, Radiant Industries is gearing up to test its inaugural reactor at the Idaho National Laboratory next year, with initial customer deliveries set to commence in 2028.
Position Overview
We are looking for an accomplished Technical Lead DevOps Engineer to spearhead our software infrastructure, deployment, and automation initiatives. In this pivotal role, you will collaborate with teams across the organization, working closely with the software development team to design scalable, secure, and resilient DevOps practices, tools, and systems. The infrastructure you oversee, the pipelines you construct, and the analytical tools you develop will be integral to the design, operation, and analysis of our groundbreaking reactor, which represents a major advancement in nuclear technology over the past 50 years. This is a unique opportunity to establish a robust foundation for our production environment.
Key Responsibilities:
Architect, implement, and maintain infrastructure across AWS and on-premises Linux environments, ensuring high availability, security, and optimal performance for mission-critical systems.
Oversee and enhance Kubernetes and Docker container orchestration to efficiently execute simulations, internal tools, and engineering applications at scale.
Design, build, and optimize CI/CD pipelines using Git, Argo, and other automation tools to facilitate rapid and reliable software delivery across engineering teams.
Develop internal tools that enhance build systems, testing frameworks, deployment automation, and developer environments.
Create computational analysis tools for our digital twin platform, implementing automated profiling and monitoring to optimize high-performance computing workloads for reactor modeling.
Implement and manage infrastructure-as-code practices using Terraform or similar tools to ensure reproducible, version-controlled infrastructure deployments.
Establish and maintain observability, monitoring, and logging infrastructure to guarantee system reliability and performance.

