Job Title: Systems Linux Engineer
Role Summary The Systems Linux Engineer is responsible for the administration, maintenance, and optimization of Linux-based systems across the organization. This role ensures system stability, performance, and security through proactive monitoring, patching, automation, and operational support, while adhering to organizational and industry standards.
Key Responsibilities
Linux System Administration
- Install, configure, and maintain Linux servers (e.g., Red Hat, CentOS, Ubuntu).
- Manage system resources, user accounts, file systems, and permissions.
- Ensure system availability, reliability, and optimal performance.
Patching & Upgrades
- Plan and execute OS patching, upgrades, and vulnerability remediation.
- Ensure systems are up to date with security patches and compliance requirements.
- Perform impact analysis and rollback planning for updates.
Performance Monitoring & Tuning
- Monitor system performance (CPU, memory, disk, network).
- Identify bottlenecks and optimize system performance.
- Implement capacity planning and resource optimization strategies.
Security Hardening
- Implement system hardening in line with security best practices (CIS benchmarks).
- Manage access controls, SSH configurations, firewall rules, and auditing.
- Work closely with security teams to remediate vulnerabilities.
Automation & Scripting
- Develop and maintain scripts for automation of routine tasks (Shell, Python).
- Implement configuration management and automation tools (Ansible, Puppet, Chef).
- Drive infrastructure-as-code practices where applicable.
Backup & Recovery
- Manage system backups and ensure data integrity.
- Perform regular backup testing and recovery drills.
- Support disaster recovery (DR) and business continuity planning (BCP).
Incident & Problem Management
- Troubleshoot and resolve system-related incidents and outages.
- Perform root cause analysis (RCA) for recurring issues.
- Provide L2/L3 support for Linux-related escalations.
Monitoring & Logging
- Configure and maintain monitoring and alerting tools.
- Analyze logs and system events to detect issues proactively.
- Ensure integration with enterprise monitoring solutions.
Documentation & Compliance
- Maintain system documentation, configurations, and SOPs.
- Ensure compliance with internal policies and regulatory requirements.
- Support audits by providing required system and security documentation.
Requirements
Technical Skills
- Expertise in Linux OS (RHEL, CentOS, Ubuntu, SUSE).
- Strong command-line and shell scripting skills.
- Experience with configuration management tools (Ansible, Puppet, Chef).
- Familiarity with virtualization (VMware, KVM) and cloud platforms (AWS, Azure, GCP).
- Knowledge of networking concepts (TCP/IP, DNS, NFS, SSH).
- Experience with monitoring tools (Nagios, Zabbix, Prometheus).
- Understanding of containers (Docker, Kubernetes) is a plus.
Soft Skills
- Strong troubleshooting and analytical skills.
- Good communication and collaboration abilities.
- Ability to work in a fast-paced, operations-driven environment.
- Attention to detail and proactive mindset.
Preferred Certifications
- Red Hat Certified System Administrator (RHCSA) / RHCE
- Linux Professional Institute Certification (LPIC)
- Cloud certifications (AWS/Azure – optional)