Hybrid, 3 days onsite, 2 days remote
We are unable to sponsor as this is a permanent Full time role
A prestigious company is looking for a Manager - Linux Operations. This manager will manage a team of Linux professionals focused on the stability, performance, security, and reliability of Linux infrastructure/Operations. This role requires experience with technologies such as Ansible, Puppet, Red Hat, Docker, Kubernetes, terraform, CICD, etc.
Responsibilities:
- Manage and mentor a team of Linux administrators, providing technical leadership and guidance.
- Oversee the daily operations and maintenance of Linux Servers (on-prem and cloud).
- Develop and implement strategies for monitoring, performance tuning, and capacity planning.
- Ensure system availability, uptime, and business continuity through best practices and incident management.
- Lead root cause analysis of critical issues and implement preventive measures.
- Manage patch management, system upgrades, and configuration management (eg, Ansible, Puppet).
- Develop and maintain operational documentation, policies, and standard operating procedures.
- Collaborate with DevOps, security, and network teams to align infrastructure goals.
- Evaluate and recommend new tools, technologies, and methodologies to support business needs.
- Ensure compliance with security policies and data protection standards.
- Participate in budgeting, resource planning, and vendor management as needed.
Qualifications:
- Bachelor's degree in computer science, Information Technology, or related field (or equivalent experience).
- 7+ years of experience in Linux systems administration, with at least 2 years in a leadership or managerial role.
- Deep understanding of Linux (Red Hat) operating systems and internals.
- Linux Experience: Provide advanced system administration, operational support and problem resolution for a large complex Linux computing environment, including both virtualized and physical servers. Create and Patch AMIs, perform pull requests, write Automation code using tools such as Ansible, Terraform, etc.
- Cloud Experience - Strong knowledge of secure cloud infrastructure design and components, such as: servers, operating systems, networks, IAM, and storage. Cloud Certifications, specifically AWS Cloud certification would be preferred.
- Infra Automation - Expert knowledge in core automation development toolchain including Terraform, Ansible, Jenkins, Git, Harness.
- CICD Experience - Mastery of CICD best practices in a large organization. (GitOps/DevOps, secure builds, secure code promotion, deployments (Harness\Argo), automated testing (app and infra), integration of policy frameworks, cost-optimization, SLSA best practices)
- Strong experience with virtualization (eg, VMware, KVM), cloud platforms (AWS, Azure), and containerization (eg, Docker, Kubernetes).
- Hands-on experience with infrastructure automation and configuration management tools.