Site Reliability Engineer II

Austin, Texas

Procore Technologies
Job Expired - Click here to search for similar jobs
Job Description

What if you could use your technology skills to develop a product that impacts the way communities' hospitals, homes, sports stadiums, and schools across the world are built? Construction impacts the lives of nearly everyone in the world, and yet it's also one of the world's least digitized industries. That's why we're looking for experienced Engineers to join our Cloud Platform Orchestration team to continue Procore's journey to revolutionize a historically underserved industry.

If you are a software engineer with a passion for creating an amazing experience for your fellow engineers, this is a great opportunity to join this newly formed team. As a Site Reliability Engineer on the Cloud Platform Orchestration Team, you'll be helping to build the core of Procore's cloud platform - a reliable, extensible, api-driven service orchestration platform capable of running thousands of services across the globe. We approach our work as an internal "product team", iteratively shipping new features with the goal of reducing friction and facilitating self-service for every engineer at Procore.

This position will report to the Manager of the Cloud Platform Orchestration team with the opportunity to be located in Austin, TX office.

What you'll do:

Contribute significantly to the architecture, design, and development of the container orchestration platform at the core of Procore's cloud platform

Build features that unlock global self-service capabilities for service authors and allow platform extensibility by other platform engineers

Use a collaborative approach to lead architectural design decisions that improve scalability and performance

Develop teammates by conducting code reviews, providing mentorship, pairing, and training opportunities

Serve as the subject matter expert of your domain, including tools, processes, and procedures that help guide others to create and maintain a healthy codebase

Facilitate an "open source" mindset and culture both across teams internally and outside of Procore through active participation in and contributions to the greater community

What we're looking for:

Bachelor's Degree in Computer Science, a related field, or comparable work experience

Coding experience with Go is required

2+ years of combined experience as a Software, Resiliency, or Reliability Engineer

Experience operating Kubernetes clusters at scale in Production

Contribution to open source projects (CNCF Incubating or Graduated projects preferred)

Experience working with software, platforms, and infrastructure at scale (we run thousands of hosts and have millions of users)

Strong experience documenting and driving process improvements

Growing as a technical leader on large initiatives with the ability to course-correct as needed

Technical Certifications are a plus

Experience with the following technologies is preferred:

Public Cloud (AWS, CGP, Azure)

Cloud automation tooling (e.g., Terraform, Crossplane, Ansible)

Service Mesh / Discovery Tooling (e.g. Consul, Envoy, Istio, Linkerd)

Continuous Deployment Tooling (e.g. ArgoCD, Argo Workflows, Circle CI, Spinnaker)

Date Posted: 28 April 2024
Job Expired - Click here to search for similar jobs