Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GTEST/UD: Increase UD EP timeout when running under valgrind - v1.17.x #9913

Merged

Conversation

iyastreb
Copy link
Contributor

@iyastreb iyastreb commented May 30, 2024

This is double commit of #9880, into v1.17.x branch

Fix for RM#3918537

I managed to reproduce this issue on rock machines in 100% of the cases, but only when running this test under high CPU load. This CPU load I generate using dummy 64 processes (yes > /dev/null).
I checked ud_ep timeout logic, and it seems to work correctly. So the reasonable fix would be to increase UCX_UD_TIMEOUT (from 30s to 300s) when running under valgrind. With increased timeout the issue is not reproducible anymore, even with artificial CPU load

@tvegas1 tvegas1 changed the title GTEST/UD: Increase UD EP timeout when running under valgrind GTEST/UD: Increase UD EP timeout when running under valgrind - v1.17.x May 30, 2024
@yosefe yosefe enabled auto-merge May 30, 2024 07:48
@yosefe yosefe merged commit 9d59a08 into openucx:v1.17.x Jun 1, 2024
140 checks passed
@iyastreb iyastreb deleted the gtest/ud/valgrind-increase-timeout-1.17.x branch June 1, 2024 08:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants