Job Description
Job Summary
We are seeking a highly skilled InfiniBand Engineer with strong expertise in advanced networking technologies to design, deploy, and support high-performance, low-latency network infrastructures. The ideal candidate will have hands-on experience with InfiniBand fabrics, data center networking, and large-scale distributed computing environments (HPC / AI / ML clusters).
Key Responsibilities
- Design, implement, and manage large-scale InfiniBand (IB) fabrics in data center and HPC environments.
- Configure and troubleshoot InfiniBand switches and adapters (e.G., Mellanox / NVIDIA IB platforms).
- Perform fabric bring-up, subnet management (OpenSM), partitioning, and performance tuning.
- Monitor and optimize network performance, latency, throughput, and congestion control.
- Integrate InfiniBand with Ethernet-based networking environments.
- Support RDMA technologies (RoCE, iWARP) and GPUDirect ...
Ready to Apply?
Take the next step in your AI career. Submit your application to Aptly Technology Corporation today.
Submit Application