Skip to content

WeeklyTelcon_20191210

Geoffrey Paulsen edited this page Dec 16, 2019 · 1 revision

Open MPI Weekly Telecon


  • Dialup Info: (Do not post to public mailing list or public wiki)

Attendees (on Web-ex)

  • Geoffrey Paulsen (IBM)
  • William Zhang (AWS)
  • Austen Lauria (IBM)
  • Brian Barrett (AWS)
  • Josh Hursey (IBM)
  • Brendan Cunningham (Intel)
  • Michael Heinz (Intel)
  • Todd Kordenbrock (Sandia)

not there today (I keep this for easy cut-n-paste for future notes)

  • Noah Evans (Sandia)
  • Akshay Venkatesh (NVIDIA)
  • Edgar Gabriel (UH)
  • Harumi Kuno (HPE)
  • Howard Pritchard (LANL)
  • Matthew Dosanjh (Sandia)
  • Thomas Naughton (ORNL)
  • Artem Polyakov (Mellanox)
  • Jeff Squyres (Cisco)
  • George Bosilca (UTK)
  • David Bernhold (ORNL)
  • Brandon Yates (Intel)
  • Charles Shereda (LLNL)
  • Erik Zeiske
  • Joshua Ladd (Mellanox)
  • Mark Allen (IBM)
  • Matias Cabral (Intel)
  • Nathan Hjelm (Google)
  • Ralph Castain (Intel)
  • Xin Zhao (Mellanox)
  • mohan (AWS)

Release Branches

Review v3.0.x Milestones v3.0.4

Review v3.1.x Milestones v3.1.4

  • 3.0.5 and 3.1.5 have shipped
  • Planning for no new fixes on 3.x, unless super critical
  • BUT, looks like something was messed up with 3.1.5, not sure about 3.0.x branch
    • Brian will read up on the issue and see if we need to release to address.
    • May be just an issue with Fedora / RHEL 7.8 that we don't see it on earlier RHEL.

Review v4.0.x Milestones v4.0.3

  • v4.0.3 in the works.
    • Schedule: End of january.
    • There's a problem in Open MPI v4.0.2, that packagers will hit in UCX 1.7
      • PR 1752 may drive an earlier release in case if UCX will be released sooner.
  • PR 7116
    • Ensure no backwards compat issues?
    • Howard will send email to ARM.
  • PR 7149 - Geoff go look at.

Do we want a v4.1.x release?

  • A few new enhancements desirable.
  • Added a Target v4.1.x label
    • Many new enhancements / features would be useful
    • 7151 - This is indeed a performance enhancement.
    • 7173
    • Should look into amount of work back-porting features to a release branch.
    • It would be a major thing. But always say we don't take features into release branch thats out there.
      • people continue to open PRs with features.
    • Two issues:
      • One - we've really stalled out v5.0.0
      • Two - are performance features really an issue to pull in?
        • PR 7151 - seems to be boarderline bugfix / feature / risky
  • PR 7151 - enhancement -

v5.0.0

  • Schedule: April 2020?
    • Wiki - go look at items, and we should discuss a bit in weekly calls.
    • Some items:
      • MPI1 removed stuff.

New Business

Issue 7220

Target labels on PRs. just one for branch going into

PPRTE discussion at super computing:

  • Probably should get down to supporting only one runtime.
  • Josh, Ralph, Jeff, Brian , and Tom
  • Met one day to talk about PRRTE / ORTE and what to do.
  • PRRTE probably makes the most sense
    • git submodules much better than subversion external modules.
    • Being part of the OMPI package is limiting.
    • Boxes in the Runtime to prevent ORTE from taking off on it's own.
    • Not a huge operation.
      • PMIx would be a first class citizen
      • Still bundle PRRTE in tarballs, so could launch over ssh.
      • Have to add additional Nightly testing to catch issues.
    • Talked about not being a bash script.
    • Ralph said he had most of this working on a branch.
  • PRRTE only has external hwloc, pmix, and libevent.
    • If you pull this in, will need to build PRRTE with the internal versions of
    • May accelerate need to kill off internals in Open-MPI to simplify things.
  • Release tarballs;
    • Still drop these into tarball for conveience?
    • Should discuss, perhaps a version of the tarball that has everything?
  • Possibly do a survey again, to just have everything external?
  • PRRTE Testing
    • Can develop some PMIx Unit test(s) for PMIx library and for Resource managers
      • To mimic the way that Open MPI uses PMIx.
      • PMIx acceptance tests in Open MPI project
    • Currently don't have much Runtime tests.
      • Mapping, binding, output filename, etc.
      • Use these tests to

PRRTE with/without Open-MPI was discussed at PMIx BOF

  • Questions, and discussion. Interested.

Face to face

  • It's official! Portland Oregon, Feb 17, 2020.
    • Safe to begin booking travel now.
  • Please register on Wiki page, since Jeff has to register you.
  • Date looks good. Feb 17th right before MPI Forum
    • 2pm monday, and maybe most of Tuesday
    • Cisco has a portland facility and is happy to host.
    • about 20-30 min drive from MPI Forum, will probably need a car.

Infrastrastructure

Review Master Master Pull Requests

CI status

  • IBM's PGI test has NEVER worked. Is it a real issue or local to IBM.
    • Austen is looking into
  • Absoft 32bit fortran failures.

Depdendancies

PMIx Update

ORTE/PRRTE

  • No discussion this week.

MTT


Back to 2019 WeeklyTelcon-2019

Clone this wiki locally