About the job
This Site Reliability Engineer position at Cathexis is based in Tysons, Virginia and requires an active Top Secret clearance. The role centers on managing and optimizing Kubernetes clusters for both internal teams and government clients, supporting digital transformation and AI-driven projects.
Key responsibilities
- Maintain and scale Kubernetes clusters in production settings
- Monitor system health to ensure uptime and meet performance goals
- Support cloud infrastructure operations and automation through Infrastructure as Code (IaC)
- Assist with deploying data-driven AI solutions for government clients
Requirements
- Active Top Secret security clearance
- Extensive experience administering and troubleshooting Kubernetes environments
- Strong knowledge of cloud infrastructure, whether public or private
- Proven skills with Infrastructure as Code tools and practices
Cathexis provides program and project management, data analytics, and audit services to government agencies. The team values integrity, accountability, and collaborative growth, fostering an environment where employees can build their strengths and work toward shared goals.
