-
Notifications
You must be signed in to change notification settings - Fork 423
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
UCT/MM: Replaced error value with debug log message for shm_unlink #9670
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM besides minor comment, pls verify it fixed the issue
@shasson5 please squash |
src/uct/sm/mm/posix/mm_posix.c
Outdated
if (status != UCS_OK) { | ||
goto err_close; | ||
} | ||
uct_posix_unlink(md, seg->seg_id, UCS_LOG_LEVEL_DEBUG); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
UCX_LOG_LEVEL_DIAG
src/uct/sm/mm/posix/mm_posix.c
Outdated
@@ -586,7 +584,7 @@ uct_posix_mem_alloc(uct_md_h tl_md, size_t *length_p, void **address_p, | |||
err_close: | |||
close(fd); | |||
if (!(seg->seg_id & UCT_POSIX_SEG_FLAG_PROCFS)) { | |||
uct_posix_unlink(md, seg->seg_id); | |||
uct_posix_unlink(md, seg->seg_id, UCS_LOG_LEVEL_ERROR); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
UCS_LOG_LEVEL_WARN
What
Display log message instead of returning an error when failing on
uct_posix_unlink
inuct_posix_mem_alloc
Why ?
Prevent UCX init from failing over sporadic "No Such file or directory" error when trying to delete a SHM file.
This PR fix this issue:
https://redmine.mellanox.com/issues/3602767
Also the same error was seen in:
https://redmine.mellanox.com/issues/3730544