Site Reliability Engineer
Rolling contract
$40-$48/hour DOE
Hybrid in the Denver Tech Center
This engineering role contributes significantly to planning, implementing, and maintaining system monitoring and observability artifacts for a complex enterprise network. Collaborates closely with developers to integrate observability, encompassing APM, NPM, SNMP monitoring, log aggregation, JVM monitoring, and network device monitoring. Contribute to revision control using Git, collaborate on network performance monitoring, automate processes through Scripting in BASH and Python, and bring expertise in WiFi monitoring and analysis. This role blends technical proficiency with collaboration, delivering impactful contributions to our systems and infrastructure.
MAJOR DUTIES AND RESPONSIBILITIES
Designs, implements, enhances and troubleshoots observability artifacts in assigned areas.
Research architecture documents and release notes to define an observability strategy for cloud services, on premises services, and WiFi hardware
Builds observability dashboard in Splunk, Datadog and Grafana
Builds observability artifacts that monitor systems performance, reliability, and daily data processing including APM (Application Performance Monitoring), NPM (Network Performance Monitoring), JVM (Java Virtual Machine) and API performance
Document release notes for all artifacts developed and deployed for stakeholders
REQUIRED QUALIFICATIONS
Basic knowledge in using ticketing and software tools to support the current operations.
Basic knowledge of network devices and basic network appliances
Strong understanding of Linux and Unix operating systems (RHEL, Ubuntu, SUSE, and Rocky Linux).
Strong technical writing skills to be used for documentation of work completed
Strong understanding AWS cloud infrastructure and deployment concepts
Familiarity with automation tools and Scripting languages (eg, BASH, Python).
Experience with revision control using Git.
Expertise with the Application Performance Monitoring suites such as Datadog, and Grafana
Understanding of SNMP monitoring for effective polling and visualization.
Experience with log aggregation tools such as Splunk or Loki.
Exposure to JVM monitoring and network device monitoring.
Ability to perform duties in a fast-paced environment and ability to learn new technology quickly
Required Education
Masters or Bachelor's Degree in Engineering or related field or related work experience
Required Related Work Experience and Number of Years
Engineering work experience 5+ yrs
Related work experience 5+ years wireless network experience