Skip to content

Commit

Permalink
Merge pull request #4589 from brminich/topic/ucp_tag_offload_bb_cuda
Browse files Browse the repository at this point in the history
UCP/TAG: Tag offload bounce buffer optimization
  • Loading branch information
yosefe authored Dec 20, 2019
2 parents 9e26ce0 + b4cbd74 commit 9a72ae3
Showing 1 changed file with 4 additions and 1 deletion.
5 changes: 4 additions & 1 deletion src/ucp/tag/offload.c
Original file line number Diff line number Diff line change
Expand Up @@ -256,7 +256,10 @@ ucp_tag_offload_do_post(ucp_request_t *req)

mdi = context->tl_rscs[wiface->rsc_index].md_index;

if (ucs_unlikely(length >= worker->tm.offload.zcopy_thresh)) {
/* Do not use bounce buffer for receives to GPU memory to avoid
* cost of h2d transfers (i.e. cuda_copy from staging to dest memory). */
if ((length >= worker->tm.offload.zcopy_thresh) ||
!UCP_MEM_IS_ACCESSIBLE_FROM_CPU(req->recv.mem_type)) {
if (length > wiface->attr.cap.tag.recv.max_zcopy) {
/* Post maximum allowed length. If sender sends smaller message
* (which is allowed per MPI standard), max recv should fit it.
Expand Down

0 comments on commit 9a72ae3

Please sign in to comment.