We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Command to reproduce:
mpirun -x -np 2496 -mca btl_openib_warn_default_gid_prefix 0 --bind-to core --tag-output --timestamp-output --display-map -mca pml ucx -x UCX_NET_DEVICES=mlx5_0:1 -mca btl_openib_if_include mlx5_0:1 -mca coll_hcoll_enable 0 -x UCX_TLS=rc,sm -mca opal_pmix_base_async_modex 0 -mca mpi_add_procs_cutoff 100000 --map-by node /hpc/scrap/users/mtt/scratch/ucx_ompi/20170510_132009_18827_734706_clx-hercules-113/installs/50e4/tests/mpich_tests/mpich-mellanox.git/test/mpi/pt2pt/probe-unexp
Reproducibility ~50% with rc and rc_x, ud works fine.
Most of processes (~2461) are in MPIx_Fence, others (~35) are in mca_pml_ucx_waitall.
@alinask
The text was updated successfully, but these errors were encountered:
may be related to #1512 and #1513
Sorry, something went wrong.
UCT: fix openucx#1502, openucx#1513
eae599d
- Fix hang in MPI_Finalize with UCX_TLS=rc[_x],sm
7bc4db7
29d86d1
d2a722a
Merge pull request #1532 from evgeny-leksikov/rc_hang_fin
c8f891f
UCT: fix #1502, #1513
Merge pull request #1541 from evgeny-leksikov/v12_port
edca6c0
UCT: fix #1502, #1513 (backport to v1.2)
evgeny-leksikov
No branches or pull requests
Command to reproduce:
Reproducibility ~50% with rc and rc_x, ud works fine.
Most of processes (~2461) are in MPIx_Fence, others (~35) are in mca_pml_ucx_waitall.
@alinask
The text was updated successfully, but these errors were encountered: