Lead Site Reliability Engineer

Cambridge, Massachusetts

Massachusetts Institute of Technology
Apply for this Job
Lead Site Reliability Engineer
  • Job Number: 24909
  • Functional Area: Information Technology
  • Department: Office of Research Computing and Data
  • School Area: VP Research
  • Employment Type: Full-time (Hybrid)
  • Employment Category: Exempt
  • Visa Sponsorship Available: No
  • Schedule:
Email a Friend Save Save Apply Now

Posting Description

LEAD SITE RELIABILITY ENGINEER, Office of Research Computing and Data (ORCD), to build and advance SRE functions in collaboration with a diverse team of systems engineers; play a pivotal part in the strategic transformation of infrastructure planning, design, delivery, and operations in support of ORCD's continued growth; and build and foster cross-functional collaboration between engineering and operations teams across MIT, ensuring alignment with institutional objectives and long-term strategic initiatives.

Find the full job description here:

Job Requirements

REQUIRED: Bachelor's degree in engineering, computer science, related field or equivalent industry experience; a minimum of seven years of experience in site reliability engineering or a related field; possess a deep and broad expertise across multiple technical domains, including Linux, networking, and virtualization; ability to drive innovation in system architecture and lead transformative design initiatives from the ground up; robust analytical and structured problem-solving skills, coupled with excellent communication and inter-personal abilities; deep understanding of Linux, LDAP, virtualization & config management in a large Linux-based engineering environment. PREFERRED: 10+ years of experience in site reliability engineering; experience working within an HPC/research computing environment; ability to analyze network traffic to identify technical issues and suspicious activities. Job

4/8/2025

Date Posted: 23 April 2025
Apply for this Job