Event Notification Extension #18

rhc54 · 2017-03-24T15:22:40Z

Signed-off-by: Ralph Castain rhc@open-mpi.org

rhc54 · 2017-03-27T17:10:35Z

@dsolt @jjhursey @abouteiller Please comment - I'd like some feedback before doing the prototype implementation

jjhursey · 2017-03-29T20:19:04Z

I think this sounds ok.

One of the points made was the need for events to be sent between threads within a process. That seems difficult to manage since we would have to track which threadid registered the request, and enforce that only that threadid receive the notification. And multiple threads could be registered for the same event so that would need to be tracked.

That seems like a lot of bookkeeping to me, but maybe I'm thinking about it wrong.

rhc54 · 2017-03-30T01:32:56Z

I agree about tracking at a thread level - probably outside where we want to go, and I should clarify the comment. My intent was only to support multiple registrations against the same code or group of codes. It would be up to the user to ensure that each thread registered a different callback function.

This was the mechanism, when combined with a new data range of "proc_local", I had envisioned for someone to "notify" a separate thread of an event such as declaration of a programming model.

abouteiller · 2017-03-30T02:14:03Z

RFC0018.md

+None of these require modification of existing PMIx APIs, nor addition of new ones. Instead, all can be supported by adding attributes to direct the behavior of the existing event registration/notification functions.
+
+#### Event registration extensions
+The primary need here is for attributes indicating desired ordering of the event handler being registered vs other handlers that have already been registered or will subsequently be registered. In both cases, it is necessary that the user provide something to identify each handler so the relative position can later be specified - this is to be done via the existing _PMIX\_EVENT\_HDLR\_NAME_ attribute. In addition, the existing _PMIX\_EVENT\_ORDER\_PREPEND_ attribute directs PMIx to prepend the handler being registered to the front of the chain, as opposed to the default append behavior.


Shouldn't the default behavior be to prepend? In a typical use case, the application will initialize libraries in a "more generic to more specific" order, meaning that the last registered handler is the one that is the most "specialized". So it most often should have precedence at trying to "fix the problem", before passing the hand to more generic handlers.

I understand that this additional PR greatly alleviates this concern, but it still seems more logical in that direction to me.

I was reflecting the current default behavior, but I am open to modifying that if desired. Let's discuss at the meeting and see what people think.

abouteiller · 2017-03-30T02:18:59Z

RFC0018.md

+
+* PMIX\_EVENT\_HDLR\_BEFORE - put this event handler immediately before the one specified in the (char*) value. The named event handler must be in the same category (single, multi, or default) as the one being registered. An error will be returned if the named event handler is not found.
+
+* PMIX\_EVENT\_HDLR\_AFTER  - put this event handler immediately after the one specified in the (char*) value. The named event handler must be in the same category (single, multi, or default) as the one being registered. An error will be returned if the named event handler is not found.


I do not remember if registering twice an handler is already illegal or not. If it is not, it certainly should become now, as otherwise the before/after relation is ill defined.

The before/after relation is only defined for the specific registration request that contains it. So if you register another handler for the same code, then it must include its own relational directives. I will clarify, however, that you cannot register more than one handler with the same string identifier.

abouteiller · 2017-03-30T02:28:21Z

RFC0018.md

+#### Event notification extensions
+Two changes are proposed in this area, although only one actually impacts the standards header file by adding the following attribute:
+
+* PMIX\_EVENT\_NO\_TERMINATION - directs that all subsequent steps in the event handler chain must not call for termination of the application. Any other operations are permitted.


I think the logic is reverse here:
lets consider a chain A B C D(default)

We call A(status, attr={})
if A can handle the issue reported by status it changes the attribute to NO_TERMINATION.

We then call B(status, attr={NO_TERMINATION})
B cannot handle the condition (it indicates a fatal error of some sort that cannot be corrected), with these definitions, B cannot reset to terminate.

I propose the opposite logic:

A(status, attr={DEFAULT_TERMINATION})
A is happy, so it sets NO_TERMINATION

B(status, attr={NO_TERMINATION})
B is unhappy, so it sets TERMINATION

C(status, attr={WANT_TERMINATION})
C is happy or unhappy, doesn't matter, WANT_TERMINATION cannot be overridden

D is default, sees WANT_TERMINATION, terminates.

Now, if all handlers are happy, we'd reach
D(status, NO_TERMINATION)

So basically what you are proposing is that we allow each handler to modify the collected array of results, as opposed to only adding to them? We would definitely have to define which attributes can be overwritten, but implementation would be simple and not require an API change, if that's what people want to do. Again, worth bringing up in the meeting.

rhc54 · 2017-03-30T17:35:19Z

@abouteiller @jjhursey I have updated the RFC to reflect your comments - please review

jjhursey

This looks good. I still think we need to be careful about wording around delivering an event to a specific thread. That requires tracking of thread ids and making sure we deliver the event in the context of the same thread. I'd like to make that the caller's problem to solve, not PMIx. I think the wording in here is appropriately flexible.

Signed-off-by: Ralph Castain <rhc@open-mpi.org>

Approved Signed-off-by: Ralph Castain <rhc@open-mpi.org>

Return an error if someone asks to register an event handler before/after another handler that is not yet known. Signed-off-by: Ralph Castain <rhc@open-mpi.org>

rhc54 added IN PROGRESS ATTRIBUTES labels Mar 24, 2017

rhc54 added this to the v2.0 milestone Mar 24, 2017

rhc54 mentioned this pull request Mar 29, 2017

Coordination Across Programming Models (OpenMP/MPI) #17

Merged

abouteiller requested changes Mar 30, 2017

View reviewed changes

abouteiller approved these changes Mar 30, 2017

View reviewed changes

jjhursey approved these changes Mar 30, 2017

View reviewed changes

rhc54 pushed a commit to rhc54/openpmix that referenced this pull request Apr 2, 2017

Implement the event notification extension RFC (pmix/RFCs#18)

c442ba8

Signed-off-by: Ralph Castain <rhc@open-mpi.org>

rhc54 mentioned this pull request Apr 2, 2017

Implement the event notification extension RFC openpmix/openpmix#344

Merged

rhc54 added SUBMITTED and removed IN PROGRESS labels Apr 2, 2017

rhc54 pushed a commit to rhc54/openpmix that referenced this pull request Apr 3, 2017

Implement the event notification extension RFC (pmix/RFCs#18)

88875f0

Signed-off-by: Ralph Castain <rhc@open-mpi.org>

Extend the event notification system to meet evolving needs

814bfb9

Approved Signed-off-by: Ralph Castain <rhc@open-mpi.org>

rhc54 merged commit 89fbe44 into pmix:master Apr 6, 2017

rhc54 deleted the rfc/eventext branch April 6, 2017 20:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Event Notification Extension #18

Event Notification Extension #18

rhc54 commented Mar 24, 2017

rhc54 commented Mar 27, 2017

jjhursey commented Mar 29, 2017

rhc54 commented Mar 30, 2017

abouteiller Mar 30, 2017

rhc54 Mar 30, 2017

abouteiller Mar 30, 2017

rhc54 Mar 30, 2017

abouteiller Mar 30, 2017

rhc54 Mar 30, 2017

rhc54 commented Mar 30, 2017

jjhursey left a comment


		* PMIX\_EVENT\_HDLR\_BEFORE - put this event handler immediately before the one specified in the (char*) value. The named event handler must be in the same category (single, multi, or default) as the one being registered. An error will be returned if the named event handler is not found.

		* PMIX\_EVENT\_HDLR\_AFTER - put this event handler immediately after the one specified in the (char*) value. The named event handler must be in the same category (single, multi, or default) as the one being registered. An error will be returned if the named event handler is not found.

Event Notification Extension #18

Event Notification Extension #18

Conversation

rhc54 commented Mar 24, 2017

rhc54 commented Mar 27, 2017

jjhursey commented Mar 29, 2017

rhc54 commented Mar 30, 2017

abouteiller Mar 30, 2017

Choose a reason for hiding this comment

rhc54 Mar 30, 2017

Choose a reason for hiding this comment

abouteiller Mar 30, 2017

Choose a reason for hiding this comment

rhc54 Mar 30, 2017

Choose a reason for hiding this comment

abouteiller Mar 30, 2017

Choose a reason for hiding this comment

rhc54 Mar 30, 2017

Choose a reason for hiding this comment

rhc54 commented Mar 30, 2017

jjhursey left a comment

Choose a reason for hiding this comment