Skip to content

v1.5.0

Compare
Choose a tag to compare
@yosefe yosefe released this 14 Feb 10:53
· 40 commits to v1.5.x since this release
4185bbd

Features:

  • New emulation mode enabling full UCX functionality (Atomic, Put, Get)
    over TCP and RDMA-CORE interconnects which don't implement full RDMA semantics
  • Non-blocking API for all one-sided operations. All blocking communication APIs marked
    as deprecated
  • New client/server connection establishment API, which allows connected handover between workers
  • Support for rdma-core direct-verbs (DEVX) and DC with mlx5 transports
  • GPU - Support for stream API and receive side pipelining
  • Malloc hooks using binary instrumentation instead of symbol override
  • Statistics for UCT tag API
  • GPU-to-Infiniband HCA affinity support based on locality/distance (PCIe)

Bugfixes:

  • Fix overflow in RC/DC flush operations
  • Update description in SPEC file and README
  • Fix RoCE source port for dc_mlx5 flow control
  • Improve ucx_info help message
  • Fix segfault in UCP, due to int truncation in count_one_bits()
  • Multiple other bugfixes (full list on github)

Tested configurations:

  • InfiniBand: MLNX_OFED 4.4-4.5, distribution inbox drivers, rdma-core
  • CUDA: gdrcopy 1.2, cuda 9.1.85
  • XPMEM: 2.6.2
  • KNEM: 1.1.2