About the job
Join our dynamic team at akeno as a Staff Platform/DevOps Engineer, where you will collaborate with our DevOps Lead to enhance our platform’s delivery, automation, and observability across various deployment environments. We specialize in building reliable B2B software tailored for industrial clients, with a strong emphasis on reliability, traceability, and maintainability throughout the entire software lifecycle.
Your role will involve close cooperation with engineering teams to design and manage CI/CD pipelines, runtimes, and tools that facilitate a seamless transition from development to production. You will take charge of the CI/CD architecture, guiding how code progresses from commit to production with safety and consistency. This includes overseeing pipeline management, implementing GitOps workflows, and establishing clear protocols for rollback and change control.
Furthermore, you will spearhead our observability initiatives by optimizing our existing stack, defining effective telemetry standards, and ensuring all services provide actionable insights with meaningful service-level objectives (SLOs). As our architecture evolves, you will assess and integrate cutting-edge tools and practices that enhance delivery speed, reliability, and traceability, always considering operational constraints and trade-offs.
Your Key Responsibilities:
- CI/CD: Design, implement, and manage pipelines utilizing GitHub Actions, Terraform, and Ansible to reduce feedback cycles and ensure reproducibility.
- Lifecycle Integration: Automate the full software lifecycle from development to operations, embedding validation, observability, and continuous improvement.
- GitOps: Standardize repository structures and manage environment promotions while detailing where GitOps methodologies apply.
- Runtime & Deployments: Oversee and enhance the runtime stack, selecting appropriate rollout strategies based on risk assessment.
- Observability: Collaborate with engineering teams to integrate robust logging and metrics across services.
- Architecture Evolution: Lead the adoption of modern DevOps tools and practices that align with team needs.
- Security & Resilience: Enhance procedures for secrets management and backup/restoration, ensuring baseline security measures are met.
- Reliability in Practice: Identify and resolve issues with well-tested updates, implementing tools and practices that meet operational needs.

