Site Reliability Engineer

Irving, Texas

Vital Tech Solutions
Apply for this Job
Hybrid (3 days on site/2 remote)

No C2C candidates

We are looking for a highly skilled Senior Site Reliability and operations Engineer (SRE) with extensive experience in implementation of Kubernetes-based distributed caching and solutions.

Responsibilities
  • Design, develop, and optimize distributed caching and compute grid solutions on Kubernetes/OpenShift
  • Understanding of microservices and containerized workloads using Kubernetes, Docker, and Helm.
  • Implement high-throughput compute grid solutions using Apache Ignite, GridGain, Coherence or similar technologies.
  • Optimize application performance by leveraging caching strategies, load balancing, and efficient data distribution.
Required Skills / Qualifications:
  • 7+ years strong experience in Kubernetes (OpenShift and on-prem/cloud clusters).
  • Understanding of programming languages like Java, Go, or Python.
  • Experience with containerization technologies (Docker, Helm, etc.).
  • Strong knowledge of CI/CD pipelines (Jenkins, ArgoCD, GitHub Actions).
  • Hands-on experience with observability tools (Prometheus, Grafana, Loki, Jaeger).
  • Understanding of networking, service meshes (Istio/Linkerd), and security best practices in Kubernetes.
  • Experience with multi-cluster and hybrid cloud Kubernetes deployments.
Vital Tech Solutions is an Equal Opportunity Affirmative Action employer. We prohibit discrimination in decisions concerning recruitment, hiring, compensation, benefits promotions, training, termination or any other condition of employment or career development.

All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, marital status, national origin, veteran status, disability status or any other legally protected status.

Date Posted: 22 April 2025
Apply for this Job