🌍 Global Opportunities
Updated Hourly
🎓 Student Friendly

parttimejobs.work

Flexible Work, Better Balance

⏰ Full-time

HPC Network Engineering Manager - AI Infrastructure

EPAM Systems
Location 📍 Remote, Brazil
Posted 📅 June 07, 2026
Work Type ⏰ Full-time

Position Overview

We are seeking an HPC Network Engineering Manager - AI Infrastructure to guide architecture and technical direction for AI research and Kubernetes-based GPU infrastructure. You will steer standards for InfiniBand/RDMA, Ethernet, Kubernetes networking, SmartNIC/DPU, and observability across large programs while mentoring senior engineers. Join us to shape reliable, scalable network platforms for massive distributed AI workloads—apply now.

Responsibilities

  • Define and own a multi-year architectural vision and roadmap for InfiniBand/RDMA and high-speed Ethernet fabrics supporting massive GPU clusters and distributed AI/LLM workloads across the client portfolio
  • Govern evaluation and standardization of cluster network topologies such as Fat-tree, Clos, Rail-optimized, and Dragonfly, and set decision frameworks aligned to scale, performance, and cost constraints
  • Establish and enforce engineering standards for host-sid...

Apply Now

Submit Application →

Quick and easy application process

Job Details

Employment Type
Full-time
📊
Category
Redes e sistemas
🏠
Work Arrangement
On-site
📍
Location
Remote, Brazil