Senior HPC Infrastructure Engineer

4714745
  • Job type

    Permanent
  • Location

    Hampshire
  • Working Pattern

    Full-time
  • Specialism

    Infrastructure
  • Industry

    Technology & Internet Services
  • Pay

    £130,000

Senior HPC Infrastructure engineer | Fully Remote | Great benefits | £130,000

Your new company

I’ve partnered exclusively with a pioneering company that’s shaping the future of cloud infrastructure. Their innovative, high-performance, GPU-optimised platform is driving advancements in AI and HPC, while also championing sustainability for a greener, more efficient world.
This role is fully remote, with no expectation to ever be in an office. You’ll also enjoy the fantastic perk of unlimited holiday, giving you the freedom to recharge and thrive.

Your new role

This is a hands-on, fully remote role focused on designing and delivering high-performance computing (HPC) clusters. You’ll lead end-to-end architecture and deployment projects, working closely with internal teams and external suppliers to build scalable, GPU-optimised environments. From planning hardware and data centre requirements to configuring networks, storage, and compute management software, you’ll be at the heart of technical delivery. The role also involves supporting service teams with escalations, collaborating with software engineers to enhance platform capabilities, and staying up to date with the latest in HPC hardware. It’s a great opportunity for someone who thrives in project-led infrastructure work and wants to help shape cutting-edge HPC solutions.


What you'll need to succeed

  • Slurm: Proven experience managing and tuning HPC job schedulers.
  • Infiniband and RoCE: Deep knowledge of high-speed networking technologies.
  • Ansible: Proficiency in using Ansible for automation and configuration management.
  • Networking: Strong networking fundamentals, ideally with experience in complex environments.
  • Data Centre Infrastructure: Familiarity with planning and supporting power, cooling, and rack layouts.
  • Cluster Deployment: End-to-end experience deploying and scaling HPC clusters.
  • Server Architecture: Understanding of GPU-optimised server hardware and operating systems.
  • Scripting & Automation: Comfortable scripting in Bash, Python, or similar for deployment and maintenance tasks.

What you'll get in return

  • Share options.
  • Unlimited holiday policy.
  • 100% Remote working.
  • Fantastic opportunities to develop - they make a habit of promoting in-house.
  • A great team with a passion for working collaboratively.
  • Enhanced family-friendly policies.
  • A truly flexible workplace!

What you need to do now

If you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now.

If this job isn't quite right for you, but you are looking for a new position, please contact us for a confidential discussion about your career.

Apply for this job

Talk to Jacob Clift, the specialist consultant managing this position

Located in Southampton, 3rd Floor, One Dorset Street, SouthamptonTelephone 023 82 020 113
Click here to access our Privacy Policy, which provides detailed information on how we use and protect your personal information, and your rights in relation to this.