Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Experience
Qualifications
Proficiency in Azure cloud services and infrastructureStrong understanding of site reliability engineering principlesExperience with automation and monitoring toolsAbility to troubleshoot complex systems and resolve issues efficientlyExcellent communication and collaboration skillsFluency in English (B2 level or higher)
About the job
Join Altersolutions as a Site Reliability Engineer specializing in Azure, where you will play a pivotal role in ensuring the reliability and performance of our cloud services. You will work closely with cross-functional teams to implement best practices and enhance system resilience. Your expertise will contribute to building scalable and efficient infrastructure, ensuring that our services are robust and available at all times.
About Altersolutions
Altersolutions is a forward-thinking tech company based in Lisbon, dedicated to delivering innovative solutions in cloud computing and IT services. We pride ourselves on fostering a vibrant workplace culture that emphasizes collaboration, growth, and continuous improvement.
Join our dynamic team as a Site Reliability Engineer, focusing on enhancing our cloud infrastructure. You will be responsible for ensuring the reliability and efficiency of our systems, collaborating closely with development and operations teams. Your expertise will help us maintain optimal performance while implementing innovative solutions.
About Us at GoCardlessGoCardless is a pioneering global bank payment solutions provider, trusted by over 100,000 businesses ranging from innovative startups to well-established enterprises. Our platform enables seamless collection and transfer of payments through direct debit, real-time payments, and open banking technology.We process over US$130 billion in payments annually across more than 30 countries, simplifying the collection of both recurring and one-off payments without the hassle, stress, or burdensome fees. Leveraging AI-driven solutions, we enhance payment success rates while minimizing fraud. Our open banking connectivity with over 2,500 banks empowers our customers to make quicker, more informed financial decisions.Headquartered in the UK with offices in London and Leeds, our team also operates in Australia, France, Ireland, Latvia, Portugal, and the United States.At GoCardless, we prioritize supporting you! Our hiring process is designed to be inclusive and accessible. If you require additional support or adjustments, please connect with your Talent Partner — we are here to assist! Remember: while we have certain requirements, we encourage anyone excited about this role to apply!Platform Engineering at GoCardlessThe Platform Engineering team is a diverse, globally distributed group. Currently, we are positioned in London and Riga, with a new hub opening in Lisbon. We collaborate closely with all engineering teams to enable them to build, release, manage, and scale their products effectively.Our focus combines strategic project delivery with operational excellence, with responsibilities including:Project Delivery: Creating new platform components from initial design through to deployment.Operational Support: Maintaining the health and stability of our systems through on-call rotations and effective incident management.Business as Usual (BAU): Ongoing maintenance, improvements, and support for audits and compliance.Our technology stack comprises Golang, Python, Ruby, Terraform, Atlantis, AWS, GCP, Kubernetes, GKE, GitHub, GitHub Actions, ArgoCD, Grafana, Prometheus, and Elastic.
What You'll Do: Join our dynamic Data Infrastructure team as a Senior Site Reliability Engineer (SRE). In this pivotal role, you'll ensure the reliability, availability, and performance of our essential data systems hosted on AWS and GCP. Your expertise in cloud infrastructure, automation, and operational excellence will play a key role in supporting our product for a diverse global clientele. As a Senior Site Reliability Engineer, your responsibilities will include: Designing, implementing, and maintaining robust and reliable data infrastructure services, encompassing SQL, NoSQL, Kafka, and Spark-based data layers. You will define and monitor Service Level Objectives (SLOs) and Service Level Agreements (SLAs). Participating in an on-call rotation to swiftly address incidents and ensure quick resolution of production issues. Conducting thorough post-incident reviews to pinpoint root causes and implement preventative measures. Managing and automating cloud infrastructure using Terraform and Helm, following GitOps principles. Implementing and sustaining comprehensive monitoring, logging, and tracing solutions to proactively identify and resolve performance and reliability issues. Monitoring and managing data infrastructure capacity, planning for future growth, and optimizing performance through tuning and automation. Developing and maintaining automation scripts and tools to streamline operational tasks, enhance efficiency, and minimize manual effort. Ensuring the security and compliance of data infrastructure services by implementing best practices for access control, data protection, and vulnerability management. Collaborating with development and data engineering teams to facilitate smooth deployments and operational support while maintaining thorough documentation of infrastructure configurations, processes, and procedures. Managing and maintaining distributed databases within a Kubernetes environment. Our Tech Stack: Cloud-Based Infrastructure: Fully cloud-based with a Kubernetes-focused tech stack. Compute workloads operate in Kubernetes clusters across multiple regions. Infrastructure Management: Extensive use of Terraform and Helm, adhering to GitOps paradigms for managing cloud infrastructure and Kubernetes applications. Core Technologies: Significant utilization of Kafka, distributed PostgreSQL and Cassandra QL, Elasticsearch, and Databricks/Spark. Development of inter-cloud failover options to support multi-cloud strategies. Diverse Applications: Teams develop and deploy containerized applications for low-latency APIs, machine learning models, and data processing pipelines.
Join our dynamic and forward-thinking Tech Team as a Site Reliability Engineering Manager focused on Data Infrastructure. In this pivotal role, you will lead and inspire a talented group of Site Reliability Engineers (SREs) while collaborating closely with Engineering, Product, and Security teams. Your mission is to enhance the resilience, scalability, and security of our platforms, playing a critical part in the execution of our strategy to combat financial crime.Key Responsibilities:Oversee the growth and development of your team, including hiring and onboarding new members.Foster a thriving environment that encourages innovation and high-quality outcomes.Act as a mentor and coach, embodying a learning mindset and embracing new technologies.Set strategic direction for your team in alignment with our overarching technology vision, making accountable tech decisions.Utilize your expertise in cloud systems to inform technical decision-making.Engage with stakeholders across engineering to ensure your team’s services meet the needs of internal customers.Collaborate within your team and across the organization to maintain industry standards in implementation.This role reports to the Director of Infrastructure and involves managing a team focused on our Stateful/Data layer technologies that power all services in both development and production environments. Our technology stack includes YugaByte (sharded Postgres), Kafka (via Strimzi), Elasticsearch (via ECK), Redis, and Spark/data warehousing on GCP and AWS utilizing their PaaS systems. Given the foundational nature of this technology stack, a collaborative mindset is essential.Technology Stack Overview:ComplyAdvantage operates entirely on a cloud-based architecture with a modern Kubernetes-centric tech stack. All computing workloads are managed in Kubernetes, with clusters distributed across multiple regions to cater to our global clientele. Our production services are strategically designed to be multi-cloud, currently hosted in both AWS and GCP.We leverage Terraform and Helm for defining our infrastructure and services, adhering to GitOps paradigms, ensuring that both production and non-production environments are version-controlled in Git, with changes managed through this system.
Join Our Team at GoCardlessAt GoCardless, we are revolutionizing the way businesses manage their payments. As a leading global bank payment provider, we empower over 100,000 businesses, from innovative start-ups to established enterprises, to efficiently collect and send payments using direct debit, real-time payments, and open banking solutions.With an impressive track record of processing over US$130bn+ in payments annually across more than 30 countries, we simplify the payment process for our customers, enabling them to handle both recurring and one-off payments seamlessly. Our AI-driven technologies enhance payment success rates while minimizing fraud risks. Partnering with over 2,500 banks, we provide our clients with the tools to make faster, data-driven financial decisions.Our headquarters are located in the UK, with additional offices in London, Leeds, Australia, France, Ireland, Latvia, Portugal, and the United States.At GoCardless, we prioritize support and inclusivity throughout our hiring process. If you require any assistance or adjustments, please connect with your Talent Partner. We are here to ensure you have all the support you need!Don’t worry if you don’t meet every single requirement; if this role excites you, we encourage you to apply!Platform Engineering at GoCardlessThe Platform Engineering team is a diverse, globally distributed unit, currently based in London and Riga, and expanding to our new hub in Lisbon. We collaborate with all engineering teams to empower them in building, releasing, operating, and scaling their products effectively.Our focus lies in delivering strategic projects while maintaining operational excellence. We strive for a harmonious balance between:Project Delivery: Crafting new platform components from conception to deployment.Operational Support: Ensuring our systems are robust and efficient.
Join Altersolutions as a Site Reliability Engineer (SRE) in Lisbon, where you will play a pivotal role in ensuring the reliability and performance of our cutting-edge systems. As an SRE, you will collaborate with cross-functional teams to enhance the stability and scalability of our services, while implementing best practices in monitoring, automation, and incident response.Your expertise will help us to strive for excellence in our product offerings, guaranteeing that our customers receive the highest level of service. If you are passionate about solving complex problems and want to be part of a dynamic team, we want to hear from you!
About the Role Renesas Electronics Corporation is hiring a Senior DevOps/Site Reliability Engineer in Lisbon. This position focuses on strengthening infrastructure and maintaining dependable services across the organization. What You Will Do Work closely with development teams to coordinate deployments and support ongoing projects. Drive improvements in system performance and reliability. Develop and implement monitoring solutions to catch issues early and maintain service uptime. Location Lisbon
Join Altersolutions as a Site Reliability Engineer specializing in Azure, where you will play a pivotal role in ensuring the reliability and performance of our cloud services. You will work closely with cross-functional teams to implement best practices and enhance system resilience. Your expertise will contribute to building scalable and efficient infrastructure, ensuring that our services are robust and available at all times.
About the RoleAs a critical member of the Site Reliability Engineering team at iCapital Network, you will play a pivotal role in ensuring that our platform consistently delivers dependable services to our esteemed clients. In this position, you will bridge the gap between software engineering and operational excellence by applying engineering principles to address infrastructure challenges. You will be tasked with designing and implementing scalable systems, architecting observability solutions for actionable insights, and developing automation strategies that enhance platform reliability. This role demands a systematic thinker who can effectively translate business needs into technical solutions and is passionate about fortifying complex systems.Responsibilities:Establish, implement, and refine service level objectives (SLOs) and service level indicators (SLIs) that align with customer and business expectations.Standardize monitoring and alerting practices through “monitors as code” (preferably using Terraform), incorporating quality gates such as severity, ownership, and runbook links.Develop and maintain observability standards encompassing metrics, logs, and traces, including instrumentation and dependency mapping patterns (OpenTelemetry where applicable).Lead technical evaluations and proofs of concept for observability platforms and integrations; set success criteria and outline the migration strategy for adoption.Define and implement reliability and operability standards for Kubernetes-based services, addressing scaling patterns, resource constraints, rollout safety, and establishing baseline dashboards and alerts during service onboarding.Drive automation efforts to reduce toil, enhance repeatability, and expedite recovery processes (incident workflows, runbooks, and remediation where suitable).Act as Incident Commander for high-severity incidents, facilitate postmortems, and promote continuous improvement through actionable items and measurable follow-through using established tooling workflows.
Join Air Apps as a Site Reliability Engineer (SRE)At Air Apps, we are innovators at heart, committed to transforming the landscape of personal and entrepreneurial planning with our groundbreaking AI-powered Personal & Entrepreneurial Resource Planner (PRP). Established in 2018 in Lisbon, our family-founded company has grown to achieve over 100 million downloads globally, with offices in both Lisbon and San Francisco.We thrive on challenging the status quo and are passionate about leveraging AI to create solutions that have a meaningful impact on people's lives. Here, you will have the opportunity to unleash your creativity and contribute to products that empower users worldwide.We invite you to join our mission to redefine resource management and make a difference in the lives of individuals and entrepreneurs.Your RoleAs a Site Reliability Engineer (SRE), you will ensure the reliability, availability, and scalability of our systems. You will operate at the intersection of software development and operational excellence, implementing automation and performance optimization strategies to enhance system resilience.Key ResponsibilitiesDesign and implement scalable, reliable, and fault-tolerant systems in cloud environments.Develop and maintain observability tools for monitoring, logging, and alerting (e.g., Prometheus, Grafana, Datadog, ELK).Automate infrastructure provisioning and incident response using Infrastructure as Code (IaC) tools like Terraform or CloudFormation.Enhance system performance, scalability, and incident response workflows to maximize uptime.Collaborate closely with development and DevOps teams to improve system design and reliability.Conduct root cause analysis (RCA) and implement preventative measures to reduce failures.Ensure high availability through effective load balancing, failover, and disaster recovery strategies.Improve CI/CD pipelines to accelerate deployment speed while maintaining stability.Optimize cloud cost and resource usage for AWS, Azure, or Google Cloud Platform (GCP).Participate in on-call rotations to respond to incidents and maintain system integrity.
Join Our Team at 1GLOBALAt 1GLOBAL, we are at the forefront of mobile connectivity, providing advanced solutions for enterprises and consumers worldwide. With our cutting-edge telecom platform, including our proprietary global mobile core network and innovative eSIM technology, we deliver seamless communication services across 40 countries.We proudly serve a diverse clientele, including leading banks, global consumer goods giants, and innovative digital businesses. Our robust infrastructure connects over 70 million users and devices, empowering our partners to thrive in the mobile ecosystem.As a rapidly expanding and profitable enterprise, we surpassed US$200 million in revenue in 2025, with profits exceeding US$25 million. Our growth allows us to continually invest in our platform and global reach. Founded in 2022 by seasoned technology entrepreneurs, we are reshaping the telecommunications landscape as a regulated Mobile Virtual Network Operator (MVNO) in 12 nations and as a telecom operator in another 28.With headquarters in the Netherlands and R&D centres in Lisbon, Berlin, and São Paulo, our team of nearly 500 professionals is dedicated to redefining global mobile connectivity through technological excellence and innovation.About the RoleWe are seeking a skilled DevOps Engineer focused on Cloud and On-premises Infrastructure to enhance our Technology Department. In this role, you will design, deploy, and manage our runtime infrastructure, ensuring it remains secure, scalable, and cost-effective. You will be responsible for implementing container orchestration and service mesh architecture, maintaining multi-account AWS Organization setups, and automating processes to reduce manual intervention. Your expertise will also include system monitoring, maintenance, and ensuring optimal performance of our systems and applications.
The Platform Infrastructure team at iCapital Network plays a crucial role in designing, standardizing, and advancing the foundational infrastructure that drives our global technology stack. In your position as Vice President of Platform Infrastructure Architecture, you will act as a key senior architect, shaping and refining the technical standards that support our Kubernetes platforms, cloud networking solutions, edge services, and API access methodologies.This role prioritizes architectural design while also involving hands-on tasks to validate designs through reference implementations and proof-of-concept projects. You will collaborate closely with Platform Infrastructure engineers and application teams to guarantee that platform capabilities are scalable, secure, and consistent across various regions, all while ensuring operational practicality.
Role overview inetum2 seeks an IT Cloud Infrastructure Administrator based in Lisbon. The position centers on managing cloud infrastructure and making sure services remain reliable and efficient. What you will do Oversee daily operations of cloud infrastructure and identify areas for improvement Collaborate with teams throughout the company to support cloud-based services Contribute to maintaining high service reliability and strong performance
inetum2 is seeking a highly skilled Senior Infrastructure Engineer to join our dynamic team in Lisbon. In this role, you will be responsible for designing, implementing, and maintaining our infrastructure systems, ensuring optimal performance and security. You will collaborate with cross-functional teams to drive innovative solutions and enhance our technological capabilities.The ideal candidate will possess a strong background in infrastructure management, cloud technologies, and network security. If you are passionate about technology and thrive in a fast-paced environment, we would love to hear from you!
inetum2 is seeking a highly skilled Senior Network and Cloud Infrastructure Architect to join our innovative team in Lisbon. In this pivotal role, you will be responsible for designing and implementing robust cloud infrastructure solutions that meet the needs of our diverse clientele. You will leverage your expertise in networking and cloud technologies to ensure optimal performance, security, and scalability of our systems.As a Senior Architect, you will collaborate closely with cross-functional teams to deliver top-notch solutions that drive business success. Your ability to communicate complex concepts clearly and your passion for technology will be key to helping us achieve our strategic objectives.
Join inetum2 as a Junior Product Manager specializing in Infrastructure & Cloud services. In this role, you will work closely with our product team to develop and enhance cloud-based services that meet the evolving needs of our clients. Your contributions will help shape the future of our product offerings and drive innovation within the company. As a key member of the team, you will assist in defining product requirements, conducting market research, and ensuring the successful launch of new features. If you're passionate about technology and eager to grow your career in product management, we want to hear from you!
SoSafe aspires to be the premier provider of human risk management solutions in Europe. Our award-winning awareness platform drives behavioral change through engaging and effective training and simulations focused on cybersecurity and data protection. With cybercrime costing the global economy over $10 trillion annually and increasing by 15% each year, we invite you to be part of the solution!Your Impact:Design, develop, and manage a robust cloud infrastructure that supports our SaaS offerings, prioritizing security and operational efficiency.Implement and utilize cutting-edge cloud services and solutions, such as serverless architectures, containerization, and orchestration technologies.Work alongside development teams to enhance CI/CD processes, improving deployment strategies and automation.Foster a culture of continuous learning and collaboration with team members and strategic stakeholders.Research and incorporate emerging cloud technologies and tools, contributing to the ongoing enhancement of our cloud environment.
Join inetum2 as a Senior Cloud Engineer and be part of a dynamic team dedicated to transforming cloud solutions. We are looking for experienced professionals who can design, implement, and manage cloud infrastructure, ensuring optimal performance and security.
Join inetum2 as a Senior Cloud Systems Engineer and take your career to the next level! In this pivotal role, you will leverage your expertise in cloud technologies to design, implement, and maintain innovative cloud solutions that drive our business forward.As a key member of our engineering team, you will collaborate with cross-functional teams to optimize cloud infrastructure and ensure the highest levels of security and performance. Your contributions will directly impact our clients' success and help shape the future of technology at inetum2.
About TripadvisorThe Tripadvisor Group connects individuals to unforgettable experiences, aiming to be the most trusted source for travel and adventures globally. Leveraging our extensive brands and technology, we engage our worldwide audience with partners through rich content, insightful travel guidance, and two-sided marketplaces for experiences, accommodations, restaurants, and various travel categories. Tripadvisor, Inc. (Nasdaq: TRIP) encompasses a diverse portfolio of travel brands, including Tripadvisor, Viator, and TheFork.We are seeking a proactive Cloud Security Engineer II (AWS) to serve as the primary defense for the Tripadvisor Experiences platform. This vital mid-level role integrates proactive security engineering with reactive incident response. You will immerse yourself in our cloud environment, continuously monitoring for threats, addressing security incidents, automating defenses, and collaborating with our engineering teams to enhance platform resilience.Job Location: This is a remote or hybrid position based in Portugal. Occasional travel to company offices may be required.
Mar 19, 2026
Sign in to browse more jobs
Create account — see all 340 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.