You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I did not manage to reproduce it thru lots of iterations, but I had to specify additional env var -x UCX_NET_DEVICES=mlx5_3:1 to get rid of #1534 symptoms. Thought we could try to reproduce it with original command line when #1534 is fixed.
OMPI version open-mpi/ompi@917d96b (compiled without debug)
UCX version 69545a1 (default configuration)
For OSHMEM with ConnectX-4 adapter for the following cmdline:
/ompi2/msg/bin/shmemrun -np 896 --mca coll '^hcoll' --mca pml ucx --mca spml ucx --mca mtl '^r2' --mca btl self --mca mpi_add_procs_cutoff 0 --mca pmix_base_async_modex true -x UCX_TLS=dc_x -x SHMEM_SYMMETRIC_HEAP_SIZE=2470M --map-by node hello_oshmem
I'm getting this backtrace:
The text was updated successfully, but these errors were encountered: