System Operations Lead

Marietta, Georgia

Leidos
Job Expired - Click here to search for similar jobs
Description

Leidos is at the forefront of technology and innovation, proudly serving both government and commercial sectors. Our commitment to customer success and our ethical values guide our operations. We are currently seeking a highly skilled System Operations Lead to optimize and manage our CDC center's hybrid cloud and on-prem infrastructure, contingent upon contract award.

As a candidate, you must be a US Citizen or Green Card holder with the ability to obtain a Public Trust security clearance and be based in the Atlanta metro area for hybrid onsite/telecommuting work.

If you thrive in a dynamic environment and are passionate about technology, we want to hear from you. The System Operations Lead will oversee our cloud and on-prem services. You will collaborate closely with product management, development, and training teams, managing operations, security, content management, and system engineering staff to ensure our environments are secure, scalable, reliable, and cost-effective. Your role will encompass daily operations management, including monitoring, provisioning, and performance optimization while enforcing best practices for security and compliance.

Primary Responsibilities:
  • Supervise on-prem and cloud environments (Primary: Azure, Secondary: AWS, Google Cloud, etc.) ensuring performance, availability, and security.
  • Oversee deployment, automation, monitoring, scaling, and troubleshooting of cloud infrastructure.
  • Monitor system performance, detect issues, and implement solutions to guarantee maximum uptime and reliability.
  • Manage ITSM requests, incidents, problems, and change fulfillment within SLA requirements.
  • Lead the Incident Response (IR) team and coordinate resolution efforts.
  • Implement and maintain monitoring and automation tools for on-prem and cloud environments.
  • Oversee CDM operations, vulnerability management, and other cyber functions associated with securing federal cloud environments.
  • Support the implementation of cloud service design and transition into service operations.
  • Encourage team training on implementing SAFe Agile management concepts in on-prem/cloud service operations.
  • Optimize cloud resources for cost efficiency and performance.
  • Ensure on-prem/cloud security, abiding by industry standards and best practices.
  • Develop and uphold management processes, workflows, and operational standards.
  • Build and maintain disaster recovery plans, ensuring data integrity.
  • Provide mentorship and leadership for the system operations team.
  • Stay informed on industry trends and emerging technologies.
Manage a team responsible for:
  • Infrastructure systems, applications, and processes while ensuring timely identification and resolution of issues, including Microsoft-based servers, databases, VMware, and Linux server instances.
  • Maintaining system backups and exploring opportunities for automation and optimization.
  • Managing a complex server-based enclave, conducting vulnerability management activities, and configuring active directory.
  • Identifying and correcting hardware and software issues.
  • Providing technical support to companion workgroups for overlapping projects while fostering good inter-departmental relations.
  • Documenting and maintaining clear, concise information for incident resolution.
  • Communicating with users and providing status updates on system outages as necessary.
  • Managing information assurance vulnerability alerts (IAVAs) and system security scanning in compliance with System Security Plans.
  • Coordinating IAVA responses and system security scans, remediation, and implementation of security updates.
  • Planning and executing IT enhancements and project work.
  • Maintaining the test lab and inventory management of necessary resources.
  • Providing professional support in response to calls and emails promptly.
Basic Qualifications:
  • Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience).
  • 12 years of experience in on-prem and cloud operations, infrastructure management, or a related field.
  • Hands-on experience with cloud platforms (preferably Azure, acceptable with AWS or GCP) and cloud management tools.
  • Strong knowledge of cloud security best practices and regulatory compliance.
  • Proficiency in automation tools (e.g., Terraform, Ansible, CloudFormation).
  • Excellent problem-solving skills and ability to manage high-pressure situations.
  • Experience with cost control, performance monitoring, and capacity planning in a cloud environment.
  • Familiarity with Atlassian Software (Jira & Confluence).
  • Exceptional communication and interpersonal skills.
  • At least 2 years of experience with Linux server administration (RHEL and/or CentOS) in an enterprise environment.
  • Minimum seven (7) years of experience supporting users and systems in IT and/or information security environments.
  • At least 2 years of experience with network switches, routers, and firewalls from Cisco or similar vendors.
  • Experience with Windows/Linux operating systems, VMware, VSphere architecture, and VCenter.
  • Familiarity with VDI architecture and environments (e.g., Citrix).
  • A solid understanding of advanced security protocols and standards.
  • Experience documenting information for security accreditation and certification.
  • In-depth understanding of software and security architectures.
  • Commitment to best practices, including maintenance windows and change control procedures.
  • Availability to respond to administration and maintenance problems on an on-call basis.
  • Willingness to travel up to 20% per year.
Preferred Qualifications:
  • Experience with containerization technologies (Docker, Kubernetes).
  • Experience using SAFe Agile concepts.
  • Familiarity with DevOps principles and CI/CD pipelines.
  • Expertise in cost management and optimization in the cloud.
  • Relevant cloud certifications (e.g., AWS Certified Solutions Architect, Azure Solutions Architect, Google Cloud Professional Cloud Architect).
March 20, 2025

Pay Range: $112,450.00 - $203,275.00

The Leidos pay range for this job level is a guideline only and may not guarantee compensation. Additional factors considered include responsibilities, education, experience, skills, and market data.

Date Posted: 28 March 2025
Job Expired - Click here to search for similar jobs