Senior SRE/Observability Engineer

Irving, Texas

United Software Group
Job Expired - Click here to search for similar jobs
Description:

Irving, TX (Day 1 Onsite)
Local candidates only.

Data Visualization
NTT Data is seeking a highly skilled Senior SRE/Observability Engineer with a deep understanding of SRE practice, observability, and extensive experience in SLO\SLI creation and management. This role will drive the adoption of SRE and Observability best practice across the Enterprise estate, utilizing Enterprise observability services to deliver tangible business value.

Responsibilities:
Design, develop and manage observability solutions, including metric identification\validation, centralizing in GEM\Prometheus & visualizing in Grafana dashboards.
Write and manage complex queries and alert definitions.
Bridge the gap between Operations Support teams and SRE operations.
Configure and manage monitoring, alerts, and observability using a range of tools including GEM, Splunk, Netcool, ELK, and AIM.
Maintain deep technical knowledge and operational experience with tools like AppDynamics, GEM, AIM\ELK, Splunk, Prometheus, and Grafana.
Understand and write code (Java, Python, Ansible etc.), programs, config files, and complex queries.
Establish design patterns for monitoring and benchmarking SLOs
Provide thought leadership and strategy in implementing and maintaining observability solutions.
Create and maintain operational process documentation for observability solutions.
Optimize the Observability Suite for monitoring application

Requirements:

• 5+ years of Grafana experience, or equivalent

• 2+ years of Python, Java, Ansible experience

• 3+ years of AppDynamics, GEM, AIM\ELK, Splunk, Prometheus experience
ICIMS RR ID
Additional Details
  • Skill Category : Regular
  • ICIMS RR ID : NA
  • Client Name : Wells Fargo & Company
  • Engagement Type : T&M Competitive 4+
  • Vertical : BFSI
  • Must-Have Primary Skill : Application Management-Application Modernization-Software Development and Services
  • Primary Skill: Yrs Experience : Expert (5+ Years Experience)
  • RC- Domain : Tools & Automation/Additional Tools
  • RC- Subdomain : 128 - Development, Design, Automation & Imaging (Scripting, PowerBuilder, DISM, ImageX, USMT, Creo, CA AutoSys, AutoCAD, Crysta Reports, Pupper, CHEF, Nagios, MDT/SCCM)
  • RC- Role : Build & Release Engineer
  • RC- Experience Level : III
  • RC- Geo Tier : US-3
  • RC- Exception Required : TBD
  • COVID-19 Vaccine Required? : (No Value)
  • Client-Accepted Visa Types : (No Value)
Date Posted: 24 May 2025
Job Expired - Click here to search for similar jobs