You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This issue was resolved by using solutions mentioned in #3972, #3573.
After setting the following options, it works now
--mca btl_openib_cuda_async_recv false --mca btl_openib_receive_queues P,256,256:S,128,256,192,128:S,2048,1024,1008,64:S,12288,1024,1008,64:S,131072,1024,1008,64
Thank you for taking the time to submit an issue!
Background information
What version of Open MPI are you using? (e.g., v1.10.3, v2.1.0, git branch name and hash, etc.)
3.1.1 (also in 3.0.0)
Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)
open mpi was built using 3.1.1 tarball.
cuda: 9.2
Please describe the system on which you are running
Details of the problem
The test i ran is osu_bcast from osu-micro-benchmarks-5.4.3.tar.gz, built with cuda. The same test case works under mvpich 2.3 with cuda.
the following is the cmd line and hang stack
The text was updated successfully, but these errors were encountered: