We are unable to sponsor for this permanent Full time role
Position is bonus eligible
Prestigious Financial Company is currently seeking a Manager of Linux Operations and Administration. Candidate will lead and oversee our Linux infrastructure operations team. This role is responsible for ensuring the stability, performance, and security of all Linux-based systems and services, managing day-to-day operations, and driving strategic initiatives to optimize infrastructure performance and reliability.
Responsibilities:
- Manage and mentor a team of Linux administrators, providing technical leadership and guidance.
- Oversee the daily operations and maintenance of Linux Servers (on-prem and cloud).
- Develop and implement strategies for monitoring, performance tuning, and capacity planning.
- Ensure system availability, uptime, and business continuity through best practices and incident management.
- Lead root cause analysis of critical issues and implement preventive measures.
- Manage patch management, system upgrades, and configuration management (eg, Ansible, Puppet).
- Develop and maintain operational documentation, policies, and standard operating procedures.
- Collaborate with DevOps, security, and network teams to align infrastructure goals.
- Evaluate and recommend new tools, technologies, and methodologies to support business needs.
- Ensure compliance with security policies and data protection standards.
- Participate in budgeting, resource planning, and vendor management as needed.
Qualifications:
- Deep understanding of Linux (Red Hat) operating systems and internals.
- Strong experience with virtualization (eg, VMware, KVM), cloud platforms (AWS, Azure), and containerization (eg, Docker, Kubernetes).
- Hands-on experience with infrastructure automation and configuration management tools.
- Solid understanding of networking concepts, security best practices, and monitoring systems.
- Proven ability to manage large-scale production environments and lead teams through incident response and recovery.
- Excellent communication, leadership, and organizational skills.
- Preferred Linux certifications (eg, RHCE)
- Preferred Experience with ITIL frameworks and service management tools.
- Preferred Background in supporting CI/CD pipelines and agile workflows.
- Linux Experience: Provide advanced system administration, operational support and problem resolution for a large complex Linux computing environment, including both virtualized and physical servers. Create and Patch AMIs, perform pull requests, write Automation code using tools such as Ansible, Terraform, etc.
- Cloud Experience - Strong knowledge of secure cloud infrastructure design and components, such as: servers, operating systems, networks, IAM, and storage. Cloud Certifications, specifically AWS Cloud certification would be preferred.
- Infra Automation - Expert knowledge in core automation development toolchain including Terraform, Ansible, Jenkins, Git, Harness.
- CICD Experience - Mastery of CICD best practices in a large organization. (GitOps/DevOps, secure builds, secure code promotion, deployments (Harness\Argo), automated testing (app and infra), integration of policy frameworks, cost-optimization, SLSA best practices)
- Resilient Design - Experience with architecting, implementing and maintaining highly available mission critical environments for 24/7 availability.
- Communication Skills - Great communication skills, especially with working with diverse and distributed teams
- Strong analytical and problem-solving skills
- Deliver on commitments - Ability to work independently as well as lead a team to solve complex problems in a timely manner.
- Bachelor's degree in computer science, Information Technology, or related field (or equivalent experience).
- 7+ years of experience in Linux systems administration, with at least 2 years in a leadership or managerial role.