Apply for this Job
If you're seeking a sense of community and the ability for growth, look no further. Since 1982, we have been 100% dedicated to our people. Our approach permits greater ownership for individuals and welcomes input into decisions for a thriving workplace and happy employees. Our people are the core reason for AIS' success. As an employee owned company, we are looking for individuals that are passionate about finding innovative solutions, and excited about emerging technologies and capabilities. Introduction We are seeking a skilled Site Reliability Engineer to join our cross-functional scrum team for the DevOps - System Development Services (DO-SDS) project. The successful candidate will be responsible for ensuring the reliability, availability, and performance of the cloud infrastructure and applications. This role involves collaborating with development teams to design, build, and maintain scalable and resilient systems. What you will be doing Infrastructure Management: Design, deploy, and manage scalable, highly available, and secure cloud infrastructure using Infrastructure as Code (IaC) principles such as Terraform and ARM templates.
Monitoring and Optimization: Implement proactive measures to monitor, analyze, and optimize the cloud environment ensuring high availability and optimal resource utilization.
Incident Management: Respond to and resolve incidents related to cloud infrastructure and applications, ensuring minimal downtime and impact on users.
Automation: Develop and maintain automation scripts and tools to streamline operations and improve efficiency.
Collaboration: Work closely with development teams to ensure seamless integration of applications and services, and provide guidance on best practices for reliability and performance.
Security and Compliance: Implement and maintain security best practices, including access controls, encryption, and identity management using Azure AD and other tools.
Documentation: Maintain comprehensive documentation of cloud infrastructure, configurations, and processes to ensure knowledge sharing and continuity.
Training and Knowledge Transfer: Provide training and knowledge transfer to junior engineers and other team members on cloud infrastructure and Site Reliability Engineering practices.
Location and Clearance Requirements This is a remote position with occasional travel. The ability to obtain and maintain a Public Trust Clearance is required. Required for this Opportunity Experience: Minimum of 3 years of experience in Site Reliability Engineering or a related field.
IaC Development: Minimum of 3 years of experience developing and deploying Infrastructure as Code (IaC) using Terraform and ARM templates.
Cloud Technologies: Minimum of 3 years of experience with cloud technologies, preferably Azure.
Azure Certification: Azure Certification (e.g., Microsoft Certified Azure Administrator Associate or Azure Solutions Architect Expert).
Knowledge: Demonstrated knowledge of cloud services, including virtual machines, storage, networking, and Azure AD.
Scripting: Proficiency in scripting languages such as PowerShell, Bash, and Python.
Monitoring Tools: Hands-on experience with monitoring tools such as Prometheus, Grafana, and Azure Monitor.
Communication Skills: Exceptional verbal and written communication skills to effectively collaborate with team members and stakeholders.
Nice To Have Skills Prior experience with the Treasury is nice to have. Applied Information Sciences does not discriminate on the basis of race, national origin, religion, color, gender, sexual orientation, age, disability, protected veteran status, or any other basis. Employment decisions are based solely on qualifications, merit, and business needs.
Date Posted: 17 May 2025
Apply for this Job