Senior/Principal DevOps Engineer

AghanimLisbon

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

We are seeking candidates with the following qualifications:Strong experience with cloud platforms, particularly Google Cloud Platform (GCP). Proficient in Infrastructure as Code tools, particularly Terraform. Experience with container orchestration, specifically Kubernetes. Knowledge of observability tools like Datadog. Excellent problem-solving skills and a proactive approach to incident management. Strong communication skills to collaborate effectively across teams.

About the job

Aghanim is hiring a Senior/Principal DevOps Engineer in Lisbon. This position centers on owning and improving a fully cloud-native platform, built on Google Cloud Platform (GCP) and Cloudflare, and monitored through Datadog. The infrastructure is managed with Infrastructure as Code and automated CI/CD pipelines via GitHub Actions.

Role Overview

This is a hands-on role with significant responsibility. The Senior/Principal DevOps Engineer ensures the platform stays reliable during heavy traffic and rapid growth. The work includes meeting strict SLA/SLO targets, supporting scaling from 10 to 50 times current loads, and optimizing for both efficiency and cost as the company and its microservices expand.

Main Responsibilities

Cloud Infrastructure Management

Oversee and improve production infrastructure on GCP and Cloudflare (cloud-only, no on-premises systems).
Maintain high availability and performance for a SaaS platform serving both B2B and B2C customers.

Scalability and Highload Management

Design and operate systems that handle sudden traffic spikes, with increases up to 10–20 times within seconds.
Develop strategies for scaling compute, network, and data layers: autoscaling, capacity planning, and safe degradation.

SLA/SLO and Incident Management

Monitor and take responsibility for reliability metrics: availability, latency, and error rates as defined by SLA/SLO.
Lead incident response, from detection through mitigation, postmortem analysis, and implementing permanent solutions.

Infrastructure as Code and Kubernetes Operations

Build and maintain Infrastructure as Code using Terraform and Terragrunt when needed.
Manage Kubernetes clusters on GKE, including upgrades, scaling, and security improvements.
Create and maintain Helm charts and Kubernetes manifests.

Observability with Datadog

Implement and maintain observability systems in Datadog: metrics, logs, APM, dashboards, monitoring, and alerting.

About Aghanim

Aghanim is a forward-thinking technology company focused on delivering innovative SaaS solutions. We prioritize reliability and scalability, providing a robust platform that empowers both businesses and consumers. Join us to be part of a dynamic team that values creativity and technical excellence.

Similar jobs

1 - 20 of 440 Jobs

Search for Senior Principal Devops Engineer

440 results

Select all on this page (20)

Apply

Senior/Principal DevOps Engineer

Aghanim

Full-time|On-site|Lisbon

Aghanim is hiring a Senior/Principal DevOps Engineer in Lisbon. This position centers on owning and improving a fully cloud-native platform, built on Google Cloud Platform (GCP) and Cloudflare, and monitored through Datadog. The infrastructure is managed with Infrastructure as Code and automated CI/CD pipelines via GitHub Actions. Role Overview This is a hands-on role with significant responsibility. The Senior/Principal DevOps Engineer ensures the platform stays reliable during heavy traffic and rapid growth. The work includes meeting strict SLA/SLO targets, supporting scaling from 10 to 50 times current loads, and optimizing for both efficiency and cost as the company and its microservices expand. Main Responsibilities Cloud Infrastructure Management Oversee and improve production infrastructure on GCP and Cloudflare (cloud-only, no on-premises systems). Maintain high availability and performance for a SaaS platform serving both B2B and B2C customers. Scalability and Highload Management Design and operate systems that handle sudden traffic spikes, with increases up to 10–20 times within seconds. Develop strategies for scaling compute, network, and data layers: autoscaling, capacity planning, and safe degradation. SLA/SLO and Incident Management Monitor and take responsibility for reliability metrics: availability, latency, and error rates as defined by SLA/SLO. Lead incident response, from detection through mitigation, postmortem analysis, and implementing permanent solutions. Infrastructure as Code and Kubernetes Operations Build and maintain Infrastructure as Code using Terraform and Terragrunt when needed. Manage Kubernetes clusters on GKE, including upgrades, scaling, and security improvements. Create and maintain Helm charts and Kubernetes manifests. Observability with Datadog Implement and maintain observability systems in Datadog: metrics, logs, APM, dashboards, monitoring, and alerting.

Apr 15, 2026

Apply

Senior DevOps Engineer

inetum2

Full-time|On-site|Lisbon

We are seeking a talented and experienced Senior DevOps Engineer to join our dynamic team at inetum2. In this role, you will play a crucial part in designing, implementing, and managing our infrastructure and CI/CD pipelines. Your expertise will help us optimize our software development processes and ensure seamless deployment.As a key member of our team, you will collaborate with developers, system administrators, and other stakeholders to enhance our system's reliability and performance. You will also be responsible for troubleshooting and resolving issues in our development and production environments.

Oct 24, 2025

Apply

Senior DevOps Engineer

Altersolutions

Full-time|On-site|Lisbon

Join Altersolutions as a Senior DevOps Engineer and play a pivotal role in enhancing our cloud infrastructure. You will collaborate with cross-functional teams to design, implement, and maintain scalable software systems. Your expertise will ensure our operations run smoothly and efficiently.

Jun 5, 2025

Apply

DevOps Engineer

inetum2

Full-time|On-site|Lisbon

Join our dynamic team at inetum2 as a DevOps Engineer and contribute to innovative projects that enhance our operational efficiency. As a key player in our technology team, you'll collaborate closely with developers and operations to ensure seamless integration and deployment of applications.

May 14, 2025

Apply

Senior DevOps Engineer in the Telecom Sector

Devoteam

Full-time|On-site|Lisboa

Join our dynamic team at Devoteam as a Senior DevOps Engineer in the exciting Telecom Sector. In this role, you will leverage your expertise to streamline our cloud operations, enhance deployment processes, and ensure high availability of our services. You will collaborate with cross-functional teams to implement innovative solutions that drive our projects forward.

Mar 6, 2026

Apply

DevOps Engineer

inetum2

Full-time|On-site|Lisbon

Join our dynamic team at inetum2 as a DevOps Engineer, where you will play a crucial role in automating processes, improving system reliability, and enhancing the deployment pipeline. Your expertise will help our teams deliver high-quality software more efficiently.

Mar 27, 2026

Apply

Mobile DevOps Engineer

airapps

Full-time|On-site|Lisbon

airapps is looking for a Mobile DevOps Engineer in Lisbon to strengthen its technology team. This position centers on managing and improving deployment workflows for mobile applications. Role overview The Mobile DevOps Engineer will oversee mobile app deployment pipelines, focusing on smooth integration and delivery. The role calls for attention to detail and a drive to streamline processes. Key responsibilities Manage and optimize deployment processes for mobile applications Implement automation to support continuous integration and delivery Work with cloud services to improve operational efficiency Requirements Experience with automation tools and cloud platforms Background in mobile application deployment and integration

Apr 29, 2026

Apply

Senior DevOps/Site Reliability Engineer

Renesas Electronics Corporation

Full-time|On-site|Lisbon

About the Role Renesas Electronics Corporation is hiring a Senior DevOps/Site Reliability Engineer in Lisbon. This position focuses on strengthening infrastructure and maintaining dependable services across the organization. What You Will Do Work closely with development teams to coordinate deployments and support ongoing projects. Drive improvements in system performance and reliability. Develop and implement monitoring solutions to catch issues early and maintain service uptime. Location Lisbon

Apr 17, 2026

Apply

DevOps/SRE Engineer

inetum2

Full-time|On-site|Lisbon

Join inetum2 as a DevOps/SRE Engineer and play a crucial role in optimizing and automating our software development and deployment processes. You will collaborate with cross-functional teams to improve system reliability, scalability, and performance.

Oct 24, 2025

Apply

Senior DevOps Engineer (Kubernetes) - On-Site in Lisbon

DaCodes

Full-time|On-site|Lisbon, Lisbon, Portugal

Join our dynamic team at DaCodes, a leading software and digital transformation firm making significant impacts across various industries.With over a decade of experience, we pride ourselves on delivering innovative, technology-driven solutions through our talented team of over 220 #DaCoders, which includes developers, architects, UX/UI designers, project managers, and quality assurance testers. We collaborate with diverse clients across LATAM and the United States, consistently achieving outstanding results.At DaCodes, you will have the chance to advance your career, engage in diverse projects, and contribute to the development of cutting-edge, high-performance iOS applications.Our DaCoders are integral to our success, and you will have the opportunity to work with innovative startups and established global brands, applying your expertise to impactful projects.

Jun 30, 2025

Apply

DevOps Engineer - Backend Specialist

airapps

Full-time|On-site|Lisbon

airapps is looking for a DevOps Engineer with a focus on backend systems to join the team in Lisbon. This position centers on building and maintaining backend infrastructure that supports the company’s applications and services. Role overview The DevOps Engineer - Backend Specialist will take charge of implementing backend solutions that scale and remain reliable. Collaboration with development teams is a key part of the job, especially when refining deployment workflows and improving infrastructure. What you will do Develop and maintain backend systems to support business needs Work alongside developers to streamline deployment processes Contribute to infrastructure enhancements for better reliability and scalability Help ensure smooth, uninterrupted operation of applications and services Location This role is based in Lisbon.

Apr 29, 2026

Apply

DevOps Engineer - Cloud & Infrastructure Management

1GLOBAL

Full-time|On-site|Lisbon, Lisbon, Portugal

Join Our Team at 1GLOBALAt 1GLOBAL, we are at the forefront of mobile connectivity, providing advanced solutions for enterprises and consumers worldwide. With our cutting-edge telecom platform, including our proprietary global mobile core network and innovative eSIM technology, we deliver seamless communication services across 40 countries.We proudly serve a diverse clientele, including leading banks, global consumer goods giants, and innovative digital businesses. Our robust infrastructure connects over 70 million users and devices, empowering our partners to thrive in the mobile ecosystem.As a rapidly expanding and profitable enterprise, we surpassed US$200 million in revenue in 2025, with profits exceeding US$25 million. Our growth allows us to continually invest in our platform and global reach. Founded in 2022 by seasoned technology entrepreneurs, we are reshaping the telecommunications landscape as a regulated Mobile Virtual Network Operator (MVNO) in 12 nations and as a telecom operator in another 28.With headquarters in the Netherlands and R&D centres in Lisbon, Berlin, and São Paulo, our team of nearly 500 professionals is dedicated to redefining global mobile connectivity through technological excellence and innovation.About the RoleWe are seeking a skilled DevOps Engineer focused on Cloud and On-premises Infrastructure to enhance our Technology Department. In this role, you will design, deploy, and manage our runtime infrastructure, ensuring it remains secure, scalable, and cost-effective. You will be responsible for implementing container orchestration and service mesh architecture, maintaining multi-account AWS Organization setups, and automating processes to reduce manual intervention. Your expertise will also include system monitoring, maintenance, and ensuring optimal performance of our systems and applications.

Mar 24, 2025

Apply

Mid-Level/High-Level DevOps / SRE Engineer

Aghanim

Full-time|On-site|Lisbon

Aghanim is hiring a Mid-Level/High-Level DevOps / SRE Engineer in Lisbon. This role focuses on managing and improving our production platform, which runs on Google Cloud Platform (GCP) and Google Kubernetes Engine (GKE). Cloudflare sits at the front, Datadog provides observability, and CI/CD pipelines run through GitHub Actions. Work closely with Senior and Principal engineers to strengthen reliability, expand monitoring, and reduce manual operational work. The systems you support handle high loads and must be ready for sudden traffic spikes. What You Will Do Platform Operations (GCP/GKE) Manage and support production systems on GCP, with a focus on GKE and other managed services. Carry out platform enhancements and operational tasks as directed by more senior engineers. Infrastructure as Code & Delivery Enablement Apply infrastructure changes using Terraform and, where needed, Terragrunt. Develop and maintain Helm charts and Kubernetes manifests. Improve reliability of GitHub Actions and CI/CD workflows, including deployment automation. Monitoring & Observability (Datadog) Create and manage Datadog dashboards and monitors to ensure effective alerting. Find and address monitoring gaps in key system components. Refine alerts to cut noise and improve signal quality. Incident Management Participate in incident response and operational support: triage, mitigation using runbooks, escalation, and follow-up remediation. Contribute to postmortem reviews with clear facts, timelines, and actionable remediation steps. Security Fundamentals (DevSecOps) Set up and operate security tools and monitoring systems. Help triage findings and implement solutions under supervision. Promote secure-by-default practices such as secrets management, access control, and baseline hardening. Cost Awareness Understand and manage operational costs for the platform.

Apr 15, 2026

Apply

Senior Reliability Engineer - Data Infrastructure

ComplyAdvantage

Full-time|On-site|Lisbon, Portugal

What You'll Do: Join our dynamic Data Infrastructure team as a Senior Site Reliability Engineer (SRE). In this pivotal role, you'll ensure the reliability, availability, and performance of our essential data systems hosted on AWS and GCP. Your expertise in cloud infrastructure, automation, and operational excellence will play a key role in supporting our product for a diverse global clientele. As a Senior Site Reliability Engineer, your responsibilities will include: Designing, implementing, and maintaining robust and reliable data infrastructure services, encompassing SQL, NoSQL, Kafka, and Spark-based data layers. You will define and monitor Service Level Objectives (SLOs) and Service Level Agreements (SLAs). Participating in an on-call rotation to swiftly address incidents and ensure quick resolution of production issues. Conducting thorough post-incident reviews to pinpoint root causes and implement preventative measures. Managing and automating cloud infrastructure using Terraform and Helm, following GitOps principles. Implementing and sustaining comprehensive monitoring, logging, and tracing solutions to proactively identify and resolve performance and reliability issues. Monitoring and managing data infrastructure capacity, planning for future growth, and optimizing performance through tuning and automation. Developing and maintaining automation scripts and tools to streamline operational tasks, enhance efficiency, and minimize manual effort. Ensuring the security and compliance of data infrastructure services by implementing best practices for access control, data protection, and vulnerability management. Collaborating with development and data engineering teams to facilitate smooth deployments and operational support while maintaining thorough documentation of infrastructure configurations, processes, and procedures. Managing and maintaining distributed databases within a Kubernetes environment. Our Tech Stack: Cloud-Based Infrastructure: Fully cloud-based with a Kubernetes-focused tech stack. Compute workloads operate in Kubernetes clusters across multiple regions. Infrastructure Management: Extensive use of Terraform and Helm, adhering to GitOps paradigms for managing cloud infrastructure and Kubernetes applications. Core Technologies: Significant utilization of Kafka, distributed PostgreSQL and Cassandra QL, Elasticsearch, and Databricks/Spark. Development of inter-cloud failover options to support multi-cloud strategies. Diverse Applications: Teams develop and deploy containerized applications for low-latency APIs, machine learning models, and data processing pipelines.

Mar 5, 2026

Apply

DevOps Consultant

inetum2

Full-time|On-site|Lisbon

Join our dynamic team at inetum2 as a DevOps Consultant, where you will leverage your expertise to enhance our clients' operational efficiency through innovative solutions. You will collaborate with cross-functional teams to implement best practices in DevOps, ensuring seamless integration and delivery.

Feb 18, 2025

Apply

Senior Data Engineer

inetum2

Full-time|On-site|Lisbon

Join our innovative team at inetum2 as a Senior Data Engineer, where you will play a pivotal role in designing and implementing data solutions that drive business insights and enhance decision-making processes. You will leverage your expertise in data architecture, ETL processes, and database management to build robust data pipelines and ensure data integrity across various platforms.

Jul 7, 2025

Apply

Senior Data Engineer

Altersolutions

Full-time|On-site|Lisbon

Join Altersolutions as a Senior Data Engineer and play a pivotal role in shaping the future of data management. In this dynamic position, you will leverage your expertise to design, develop, and maintain robust data pipelines and architectures, ensuring seamless data flow and accessibility across various platforms. You will collaborate with cross-functional teams to transform data into actionable insights, driving decision-making processes and enhancing business outcomes.

Jun 3, 2025

Apply

Senior Network Engineer

inetum2

Full-time|On-site|Lisbon

Join our dynamic team as a Senior Network Engineer at inetum2, where your expertise will play a crucial role in designing, implementing, and maintaining our network infrastructure. You will collaborate with cross-functional teams to ensure optimal performance, security, and scalability of our network services.

Sep 30, 2025

Apply

Senior Antibot Engineer

NielsenIQ

Full-time|On-site|Lisbon, Porto

Join our dynamic team at NielsenIQ as a Senior Antibot Engineer, where you will be at the forefront of developing cutting-edge solutions to enhance our digital security measures. In this role, you will leverage your expertise to analyze and combat bot-related threats, ensuring the integrity of our platforms. Work collaboratively with cross-functional teams to design and implement robust anti-bot frameworks that protect our data and user experience.

Apr 13, 2026

Apply

Principal Data Scientist at NielsenIQ | Lisbon

NielsenIQ

Full-time|On-site|Lisbon

NielsenIQ is seeking a talented and experienced Principal Data Scientist to join our dynamic team in Lisbon. In this role, you will leverage your expertise in data science to drive innovative solutions and insights that influence strategic business decisions. You will collaborate with cross-functional teams to develop predictive models, analyze complex data sets, and create actionable insights that support our clients in achieving their objectives.

Mar 19, 2026

Create account — see all 440 results