HPC Linux Operations Engineer Chicago & London

London

Jump Trading, LLC.
Apply for this Job

Jump Trading Group is committed to world-class research. We empower exceptional talents in Mathematics, Physics, and Computer Science to seek scientific boundaries, push through them, and apply cutting-edge research to global financial markets. Our culture is unique, emphasizing constant innovation, fearlessness, creativity, intellectual honesty, and a relentless competitive streak. We believe in winning together and unlocking individual talent through collaboration and mutual respect. At Jump, research outcomes drive more than superior risk-adjusted returns; we design, develop, and deploy technologies that change our world, fund start-ups across industries, and partner with leading research organizations and universities to solve complex problems.

We are seeking an adaptable, hands-on individual passionate about managing Linux HPC environments at scale, eager to handle complex operational tasks as their primary role.

What You'll Do:
  • Provide front-line support for 24/7 Linux HPC compute, storage, and interconnects, involving technologies like RDMA fabrics, parallel filesystems, HPC batch schedulers, FUSE filesystems, internal Jump software, multi-vendor hardware, cybersecurity, and high user expectations.
  • Resolve problem reports and questions from Jump's research community, managing the full lifecycle of issues.
  • Respond promptly to alerts.
  • Participate in large-scale maintenance operations, including evenings and weekends.
  • Contribute to global infrastructure projects.
  • Develop scripts and code to diagnose, resolve, and automate tasks.
  • Collaborate across teams to develop and test code in multiple programming languages.
  • Manage vendor relationships, including travel for meetings.
  • Implement and support monitoring systems for performance and faults.
  • Develop and update documentation for systems and users.
  • Maintain tools for production environment support.
  • Provide operational support as a core responsibility.
  • Follow cybersecurity and IT policies, using only approved hardware and software.
  • Participate in an on-call rotation.
  • Perform other assigned tasks as needed.
  • Work from the office approximately 5 days a week and be available for maintenance windows on Friday evenings or Saturday mornings.
Skills You'll Need:
  • Interest in operational work as a primary role.
  • At least 2+ years of professional Linux experience.
  • Experience with HPC components like parallel filesystems, batch systems, and network interconnects is a plus.
  • Proficiency in at least one programming or scripting language (e.g., Go, Python, C) with the ability to learn others quickly.
  • Strong root cause analysis skills.
  • Excellent communication skills, both verbal and written.
  • Strong collaboration skills and willingness to handle diverse tasks.
  • Ability to manage complex projects independently.
  • Sense of urgency and reliability.
  • Willingness to perform maintenance during evenings and weekends.
  • Ability to work effectively in a busy, open office environment.
Benefits:
  • Discretionary bonus eligibility
  • Medical, dental, and vision insurance
  • HSA, FSA, and Dependent Care options
  • Employer-paid group term life and AD&D insurance
  • Voluntary life and AD&D insurance
  • Paid vacation and holidays
  • Retirement plan with employer match
  • Paid parental leave
  • Wellness programs

Annual base salary range: $125,000 - $175,000 USD.

Date Posted: 17 May 2025
Apply for this Job