Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RMA - standard compliance and empty groups #9540

Closed
thomasgillis opened this issue Oct 14, 2021 · 4 comments
Closed

RMA - standard compliance and empty groups #9540

thomasgillis opened this issue Oct 14, 2021 · 4 comments

Comments

@thomasgillis
Copy link

thomasgillis commented Oct 14, 2021

My issue (which is more a question) is twofold and originates from the small reproducer code osc.cpp that fails with OpenMPI.

  1. I think it relates to the issue Regression: MPI_Win_start()/complete() with MPI_GROUP_EMPTY #8677 which has been solved and merge since then (cfr in the reproducers empty.cpp). Do you have any idea of the release timeline for 4.1.2 (which I guess will contain the merge)?

  2. On some machines, the reproducer osc.cpp still fails with various MPI implementations while it goes through with others (like pt2pt). To me it raises the question of the compliance of the code with the MPI Standard. Would it be possible for a osc expert to take a look and confirm that the code complies to the standard? In particular to the type deletion right after the MPI_Get and a single rank that is at the same time target and origin of RMA calls.

Thank you very much for your time and your help!

@devreal
Copy link
Contributor

devreal commented Oct 18, 2021

I can confirm that the osc.cpp hangs with the osc/ucx component. @janjust Any chance you can take a look at it?

@hppritcha
Copy link
Member

I have a fix in the works.

@jsquyres jsquyres assigned hppritcha and unassigned janjust Oct 19, 2021
@jsquyres jsquyres added this to the v4.1.2 milestone Oct 19, 2021
@gpaulsen
Copy link
Member

@devreal, Howard said he's working on a fix now

hppritcha added a commit to hppritcha/ompi that referenced this issue Oct 19, 2021
related to open-mpi#9540

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
hppritcha added a commit to hppritcha/ompi that referenced this issue Oct 19, 2021
related to open-mpi#9540

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
hppritcha added a commit to hppritcha/ompi that referenced this issue Oct 25, 2021
related to open-mpi#9540

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
(cherry picked from commit e20108c)
hppritcha added a commit to hppritcha/ompi that referenced this issue Oct 26, 2021
related to open-mpi#9540

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
(cherry picked from commit e20108c)
hppritcha added a commit to hppritcha/ompi that referenced this issue Oct 26, 2021
related to open-mpi#9540

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
(cherry picked from commit e20108c)
@hppritcha
Copy link
Member

closed via #9556 and related PRs to release branches

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants