Position: Sr Site Reliability Engineer (US Citizen)
Location: Remote within United States
Responsibilities/What You'll Do:
- Operational duties for our fedramp cloud based products - deployments, on-call, incident management.
- Participate in regular deployment sync calls and Operations hand-offs.
- Management of all cloud infrastructure elements - AWS GovCloud, private cloud, containers, VMs.
- Operate and improve our monitoring systems.
- Cloud automation, scripting, Infrastructure as Code.
- Write and maintain Ops documentation.
- Resolve escalations and help prevent reiteration of incidents with process, monitoring and reliability improvements.
- Contribute and implement DevOps best practices within the group.
Qualifications/Your Background:
- 5+ years of experience as a Site Reliability Engineer
- Experience working within High/Moderate FedRamp authorization levels, IL5/IL6 a plus
- Experience with the monthly continuous monitoring program of scanning, evaluating, patching and reporting on environment vulnerabilities.
- Experience interfacing with FedRamp sponsoring agencies / JAB / PMO
- Relevant experience preferably in an Operations environment
- Strong understanding of DevOps methodologies (CI/CD, IaC)
- Comfortable working with scripting languages
- Expertise in operating and troubleshooting microservices based distributed systems
- Deep understanding of container orchestration tools (Kubernetes preferred)
- Hands-on experience with at least one public cloud (AWS preferred)
- Good know-how of Linux administration and basic network troubleshooting
- Basic knowledge of database management
- Hands-on experience with infrastructure as code and automation tools (Ansible, Terraform, Jenkins)
- Knowledge of Virtualization, Cloud Architecture and Services, Automated Deployments