Skip to content
This repository has been archived by the owner on Sep 30, 2022. It is now read-only.

Add new CLI options: timeout, get-stack-traces, report-state-on-timeout #1317

Merged
merged 2 commits into from
Sep 14, 2016
Merged

Add new CLI options: timeout, get-stack-traces, report-state-on-timeout #1317

merged 2 commits into from
Sep 14, 2016

Conversation

jjhursey
Copy link
Member

Bring timeout feature introduced in open-mpi/ompi#1724 to the 2.x series. Target is 2.1.0 per the discussion on the original PR.

This required a bit of a custom patch due to architecture differences on master vs the 2.x series. I kept this as two commits to show what I changed. These should be squashed into commit once reviewed.

Three options introduced:

  • --timeout
  • --get-stack-traces
  • --report-state-on-timeout

bot:assign: @rhc54
bot:milestone:v2.1.0
bot:label:enhancement

@mellanox-github
Copy link

Test PASSed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/2083/ for details.

@mellanox-github
Copy link

Test PASSed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/2084/ for details.

@rhc54
Copy link

rhc54 commented Aug 16, 2016

👍

@jjhursey
Copy link
Member Author

Would you prefer to have these squashed to 1 commit, or keep them as two (first original from master and second fixes to make it work for v2.x)?

@jsquyres
Copy link
Member

@jjhursey Thanks for asking. I like the trackability of the 2 commits: one directly traceable back to master is a Good Thing.

@jjhursey
Copy link
Member Author

Ok - That sounds fine to me. So we'll leave it as two commits. I think this PR is ready to go once the RMs are ready for it.

Ralph Castain and others added 2 commits August 25, 2016 09:12
… timeout to assist with debugging Jenkins tests

If requested, obtain stacktraces for each application process and report it to stderr upon timeout

stack traces: minor improvements

- Also include the hostname and PID of the each process for which
  we're sending the stack traces (vs. just including the ORTE process
  name)
- Send a specific error message if we couldn't find "gstack" in the
  $PATH (e.g., on OS X)
- Send a sepcific error message if gstack fails to run
- Print a message that obtaining the stack traces may take a few
  seconds so that users don't wonder what's happening

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>

help-orterun.txt: minor tweaks

Trivial update: show "--timeout" (instead of "-timeout") in the help
message, just to encourage the use of double-dash options.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>

trivial: stacktrace -> stack trace

Trivial word smything.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit ebe159a)

-- Josh Hursey : Had to remove 2 conflict, which will be added back shortly
 * orte/orted/orted_submit.c
 * orte/orted/orted_submit.h
Three options introduced:
 * --timeout 2
 * --get-stack-traces
 * --report-state-on-timeout
@mellanox-github
Copy link

Test PASSed.
See http://bgate.mellanox.com/jenkins/job/gh-ompi-release-pr/2127/ for details.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants