Job Title: Senior DevOps and Infrastructure Engineer
Location: Santa Clara, CA
Pay: $60-$75/HR
ONSITE, can transition to hybrid dependent on performance
Interviews: 2 rounds
Benefits: Health, Dental, Vision, 401K + More. 18 days of PTO.
Duration: Renews every 6 months, likely 18 months+
Job Description:
NVIDIA is looking for a Senior DevOps and Infrastructure Engineer to work in IPP's (Infrastructure, Planning and Process) Cloud Infrastructure Team. IPP is a global organization within NVIDIA. This group works with various other groups within NVIDIA such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure needs. These cloud services provide almost half a million automated jobs per day on thousands of servers helping with the productivity of thousands of NVIDIA's software engineers worldwide. The cloud hosts a heterogeneous mix of machines and devices with various operating systems (Windows/Linux/Android), a multitude of hardware platforms both NVIDIA GPUs and Tegra Processors. Are you passionate about distributed infrastructure and looking for sophisticated, critical issues, ready to build the next generation of cloud services, design creative solutions, mine through data to uncover real problems and fix them? We are excited to onboard a fun-loving person like you.
What you'll be doing
- Support OpenStack Team Operations queue working within our Infrastructure and Cloud Operations Environments.
- Support Cloud based operations, monitoring, troubleshooting and assisting triage and resolution of bugs, KPI investigations related to systems .
- Work closely with world-class SRE, PaaS and Infrastructure engineers, architects, technical product managers and application developers setting the best strategies in place for a product launch.
- Solve sophisticated problems involving multi-site OpenStack deployments supporting NVIDIA products.
- Collaborate with multi-functional teams, including system engineering, software engineering, mechanical/thermal engineering, operations, data center teams, external vendors, and other partners to successfully deliver a reliable and robust platform from concept to prototype to deployments.
- Directly contribute to the overall quality of deployments and improve time to market next gen products.
What we need to see
- Bachelor's or Master's Degree in CS/Equivalent.
- 5+ years of relevant experience.
- Open Stack Platform Experience - including nova, nova placement, Neutron, Glance, Cinder, and Ironic.
- Baremetal deployments
- Proficiency in Linux, Ansible, Python and Bash
- Physical, on premise experience with actual machines. WE ARE NOT LOOKING FOR CONTAINERIZED VIRTUAL EXPERIENCE.
- Monitoring experience - IMPI, KVI
- Networking experience - PXE process, including host configuration (BIOS and network card), general network configurations such as VLANs, switch port settings, and cabling.
Exact compensation may vary based on several factors, including skills, experience, and education.
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.