About the job
As a Linux System Administrator at Inetum, you will play a crucial role in ensuring the availability, performance, and security of Linux server environments for our diverse clientele. You'll deliver operations, automation, and Level 2/3 support while participating in a rotating on-call duty to maintain service continuity.
CONTEXT
- Join a shared services team of five administrators dedicated to supporting multiple client environments in a multi-tenant architecture.
- Take responsibility for the operational availability, performance, and security of our Linux infrastructure across both on-premises and cloud platforms.
- Engage in a rotating on-call schedule to uphold service continuity as per contractual SLAs.
Main Responsibilities:
System Administration:
- Install, configure, and maintain Linux servers (such as Ubuntu, Debian, CentOS, RHEL).
- Perform regular patching, upgrades, and system hardening.
- Manage users, groups, permissions, and access policies effectively.
- Ensure system stability, performance, and carry out capacity planning.
Shared Services and Platforms:
- Oversee shared services (e.g., DNS, NTP, SMTP relays, web servers, file servers).
- Provide support to application teams utilizing the shared Linux infrastructure.
- Maintain and optimize virtualization and/or container platforms (VMware, KVM, Docker, Kubernetes, based on the environment).
- Contribute to the standardization of environments and configurations across teams.
Monitoring, Incident Management, and On-call:
- Configure and maintain monitoring and alerting tools (e.g., Zabbix, Prometheus, Nagios, Grafana).
- Investigate and resolve incidents affecting shared systems and services.
Participate in a rotating on-call duty schedule (24/7 support):
- Respond to alerts within defined SLAs.
- Conduct first-level analysis, mitigation, and escalate issues as necessary.
- Communicate effectively with stakeholders during major incidents.
- Contribute to post-incident reviews (RCA) and ongoing improvement actions.
Security and Compliance:
- Implement security best practices and hardening guidelines on Linux systems.
- Collaborate with security teams to identify, prioritize, and remediate vulnerabilities.
- Manage firewalls, VPN access, and ensure secure remote access (SSH, bastion/jump hosts).
- Ensure compliance with internal policies and external regulations when applicable.
Backup, Recovery, and Continuity:
- Regularly test recovery and disaster scenarios.
- Assist in defining and meeting RPO/RTO targets for critical services.
Automation and Industrialization:
- Utilize scripting languages (Bash, Python, etc.) to automate repetitive operational tasks.
- Employ configuration management tools (e.g., Ansible, Puppet, Chef, SaltStack) to maintain consistent environments.
- Contribute to infrastructure as code and standard operating procedures.

