Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixes for endpoint flush - v1.5.x #3064

Merged
merged 4 commits into from
Nov 28, 2018

Conversation

yosefe
Copy link
Contributor

@yosefe yosefe commented Nov 27, 2018

Picked from #3054
Fixes #3063

(#3058 was supposed to be for v1.5.x, but it was based on master by mistake)

We must not update cached tail in uct_mm_ep_flush() if there are any
pending elements. We may get new send resources but not use them, so
flush could return UCS_OK while there pending requests.

Fixes openucx#3052
- Ignore remote completions in case of forced flush (CLOSE_MODE_CANCEL)

- The UCP_EP_FLAG_FLUSH_STATE_VALID flag can't be used because it's not
  valid in release mode. Instead, use the UCP_EP_FLAG_DEST_EP flag as an
  indirect indication that we may have some operations which wait for
  software rma/amo completion.

- Fix missing initializion of flush state in case of client/server
  connection establishment with p2p lanes and *without* pre-request.
@yosefe yosefe added this to the v1.5.0 milestone Nov 27, 2018
@yosefe yosefe changed the title Topic/flush fixes v1.5.x 2 Fixes for endpoint flush - v1.5.x Nov 27, 2018
@swx-jenkins1
Copy link

Test PASSed.
See http://bgate.mellanox.com/jenkins/job/gh-ucx-pr/5654/ for details.

@mellanox-github
Copy link
Contributor

Test FAILed.
See http://hpc-master.lab.mtl.com:8080/job/hpc-ucx-pr/8329/ for details (Mellanox internal link).

@mellanox-github
Copy link
Contributor

Test FAILed.
See http://hpc-master.lab.mtl.com:8080/job/hpc-ucx-pr/8331/ for details (Mellanox internal link).

@mellanox-github
Copy link
Contributor

Test FAILed.
See http://hpc-master.lab.mtl.com:8080/job/hpc-ucx-pr/8332/ for details (Mellanox internal link).

@swx-jenkins1
Copy link

Test PASSed.
See http://bgate.mellanox.com/jenkins/job/gh-ucx-pr/5656/ for details.

@mellanox-github
Copy link
Contributor

Test PASSed.
See http://hpc-master.lab.mtl.com:8080/job/hpc-ucx-pr/8333/ for details (Mellanox internal link).

@yosefe yosefe merged commit 02078b9 into openucx:v1.5.x Nov 28, 2018
@yosefe yosefe deleted the topic/flush-fixes-v1.5.x-2 branch November 28, 2018 13:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants