Skip to content

Performance Issues with MPIRun Due to Virtual Network Interfaces #13161

Open
@GeofferyGeng

Description

@GeofferyGeng

Thank you for taking the time to submit an issue!

Background information

What version of Open MPI are you using?

v4.1.7rc1

Describe how Open MPI was installed

installed by MLNX_OFED

Please describe the system on which you are running

  • Operating system/version: Ubuntu 22.04
  • Computer hardware: Intel(R) Xeon(R) Platinum 8480+
  • Network type: Eth and mellanox

Details of the problem

I have a server with a single network card that has virtualized over 200 network interfaces. This causes significant delays when using mpirun, as the process hangs for a long time. I used UCX debug and found that the delays are primarily occurring on the bridged network interface.

Is there a solution for this issue? Any recommendations on how to optimize or configure the network interfaces to improve the performance of mpirun? Thank you!

ucx log

[1742871925.548610] [pod-hpc-02:1702645:0]       tcp_iface.c:945  UCX  DEBUG filtered out bridge device virbr0
[1742872077.918760] [pod-hpc-02:1702645:0]       tcp_iface.c:945  UCX  DEBUG filtered out bridge device wlan

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions