Principal HPC Network Engineer (remote in the EU)

  • Full-time

Company Description

Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open source innovation with deep expertise in Kubernetes orchestration, Mirantis empowers platform engineering teams to deliver composable, production-ready developer platforms across any environment—on-premises, in the cloud, at the edge, or in sovereign data centers. As enterprises navigate the growing complexity of AI-driven workloads, Mirantis delivers the automation, GPU orchestration, and policy-driven control needed to manage infrastructure with confidence and agility. Committed to open standards and freedom from lock-in, Mirantis ensures that customers retain full control of their infrastructure strategy.


Mirantis serves many of the world’s leading enterprises, including Adobe, DocuSign, Liberty Mutual, PayPal, Reliance Jio, Societe Generale, Splunk, and Volkswagen. Learn more at www.mirantis.com.

Job Description

Role Overview:
We are seeking a highly skilled Senior HPC Networking Engineer to design, deploy, manage, and troubleshoot high-performance networking environments. The ideal candidate will have deep expertise in InfiniBand technologies, strong general networking knowledge, and hands-on experience with Fortinet solutions. You will play a critical role in ensuring the performance, reliability, and scalability of HPC infrastructure.

Key Responsibilities:

  • Design, deploy, and maintain high-performance network infrastructures for HPC environments, with a strong focus on InfiniBand fabrics.

  • Troubleshoot complex network issues across InfiniBand and Ethernet environments, ensuring minimal downtime and optimal performance.

  • Manage and optimize InfiniBand components, including switches, HCAs, subnet managers, and fabric configurations.

  • Perform performance tuning, monitoring, and capacity planning for HPC networking systems.

  • Implement and maintain network security using Fortinet solutions (FortiGate, FortiManager, FortiAnalyzer).

  • Diagnose and resolve issues related to routing, switching, latency, and throughput across hybrid network environments.

  • Collaborate with compute, storage, and platform teams to support HPC workloads and cluster operations.

  • Develop and maintain documentation for network architecture, configurations, and operational procedures.

  • Participate in on-call rotations and provide escalation support for critical incidents.

  • Lead or contribute to network upgrades, migrations, and new deployments.

 

Qualifications

Required:

  • 5+ years of experience in network engineering, with a focus on HPC or data center environments.

  • Strong hands-on experience with InfiniBand technologies (e.g., Mellanox/NVIDIA).

  • Solid understanding of networking fundamentals: TCP/IP, routing protocols (BGP, OSPF), VLANs, QoS, and network design.

  • Proven experience deploying and troubleshooting Fortinet solutions (FortiGate, FortiManager, VPNs, firewall policies).

  • Experience with network performance analysis and troubleshooting tools.

  • Familiarity with Linux systems and scripting for automation (e.g., Bash, Python).

  • Strong analytical and problem-solving skills.

Preferred:

  • Experience with large-scale HPC clusters or AI/ML infrastructure.

  • Knowledge of RDMA, MPI, and low-latency networking concepts.

  • Certifications such as FCSS/FCNSP (Fortinet), CCNP/CCIE, or equivalent.

  • Experience with automation and Infrastructure as Code tools (e.g., Ansible, Terraform).

Soft Skills:

  • Strong communication and collaboration skills.

  • Ability to work independently and handle complex technical challenges.

  • Detail-oriented with a proactive approach to problem-solving.

 

Additional Information

What We Offer:

  • Opportunity to work on cutting-edge HPC infrastructure.

  • Collaborative and innovative work environment.

  • Competitive salary and benefits package.

#Remote

We are a Leader for Container Management in G2 (#2 after AWS)!

By clicking the link above or any third-party link within this posting, you are leaving this site and going to a third-party website where the third-party website's terms and privacy policy apply

Privacy Notice