Skip to content

WeeklyTelcon_20210406

Geoffrey Paulsen edited this page Apr 7, 2021 · 1 revision

Open MPI Weekly Telecon ---

  • Dialup Info: (Do not post to public mailing list or public wiki)

Attendees (on Web-ex)

  • Austen Lauria (IBM)
  • Brendan Cunningham (Cornelis Networks)
  • Brian Barrett (AWS)
  • Christoph Niethammer (HLRS)
  • David Bernhold (ORNL)
  • Edgar Gabriel (UH)
  • Geoffrey Paulsen (IBM)
  • George Bosilca (UTK)
  • Harumi Kuno (HPE)
  • Hessam Mirsadeghi (UCX/nVidia)
  • Jeff Squyres (Cisco)
  • Joseph Schuchart
  • Josh Hursey (IBM)
  • Marisa Roman (Cornelius)
  • Matthew Dosanjh (Sandia)
  • Michael Heinz (Cornelis Networks)
  • Naughton III, Thomas (ORNL)
  • Raghu Raja (AWS)
  • William Zhang (AWS)

not there today (I keep this for easy cut-n-paste for future notes)

  • Akshay Venkatesh (NVIDIA)
  • Artem Polyakov (nVidia/Mellanox)
  • Aurelien Bouteiller (UTK)
  • Brandon Yates (Intel)
  • Charles Shereda (LLNL)
  • Erik Zeiske
  • Geoffroy Vallee (ARM)
  • Howard Pritchard (LANL)
  • Joshua Ladd (nVidia/Mellanox)
  • Mark Allen (IBM)
  • Matias Cabral (Intel)
  • Nathan Hjelm (Google)
  • Noah Evans (Sandia)
  • Ralph Castain (Intel)
  • Scott Breyer (Sandia?)
  • Shintaro iwasaki
  • Todd Kordenbrock (Sandia)
  • Tomislav Janjusic
  • Xin Zhao (nVidia/Mellanox)

New Items

  • master /
  • AWS can install CUDA and prevent build breakage.
  • And ask nVidia/Mellanox to add this to their MTT.

v4.1

  • MPICH datatype stuff that came in from IBM

Reformatting master

  • Sessions branch is pretty big, But howard wants to wait until v5.0.0 has been released for a while.
    • So plan was to wait for rest of formatting until sessions is rebased, and then format master.
    • Howard's having a few more issues on sessons, so is okay with us reformatting
  • Reformatted opal, but still more reformatting to be done on master/v5.0.x
  • But reformatting v4.0.x and v4.1.x seems too risky.
  • clang-format instructions are in the format file.
  • also ran clang-tidy, and we don't have directions for that yet.
  • Requires clang-format at least v11 (Different version clang-format than clang compiler)

32bit? Do we want to continue to support this?

  • Require a C11 compiler to support 32bit platforms.
  • Debian is the only Linux distro that supports 32bit.
    • Can be done in PMIx or PRRTE if desirable.
    • 32bit atomics stay, because we still support 32bit datatypes.
    • 32bit only architectures are removed.

4.0.x

  • blocking on UCX issues (see New topics above)
    • George, will get to it soon.
  • Too many Open Issues (50)
    • Geoff and Howard will continue to go over v4.0.x issues, and try to close or address many of them.
      • May need to label some as wont_fix, and then close

v4.1.x

  • Same boat, waiting for George's datatype fix.

PRRTE v2.0

  • Any blockers please bring to Josh's attention.

Some outstanding work for the way that OMPI calls PRRTE configure.

  • Also some changes with libcurl, especially since this breaks OMPI built.
    • PMIx can interface with REST interfaces (used by libcurl)
    • JSON
  • Trying to do an RC this or next week.
  • Read the docs/sphynx - Come up with a plan with Jeff.
  • OMPI docs and manpages, but persistant problem that mpirun is really prrterun
  • Docs might not be ready, but perhaps a cheat-sheet
  • Packagers need some advice, and need a README, few more weeks at minimum.
  • We could role it as an rc, as an alpha.
  • NOT ready for anything close to an actual release canidate

MTT

  • Jeff will discuss absoft to upgrade gcc (need C11 compiler for 32bit support)

Open-MPI v5.0

  • PMIx and PRRTE are close to a release canidate.
    • This week ( First full week of April)
  • What do we do with the mpirun Manpage?
    • Didn't want OMPI requiring Sphynx, but if PRRTE and PMIx in same tar
  • Ralph almost has singleton comm spawn working
    • Single node without the mpirun process

Video Presentation BOF

  • ECP talk went well.
  • Some opinions about restructured text vs Markdown.
    • We're sticking with restructured text

Longer Term discussions

Doc update

  • PR 8329 - convert README, HACKING, and possibly Manpages to restructured text.
    • Uses https://www.sphinx-doc.org/en/master/ (Python tool, can pip install)
    • Intent this is for v5.0
      • mpirun / prrterun - we had quite a bit of details in orte, but are updating as much as possible.
    • Ralph has asked about this for PMIx/PRRTE since this is turning out to work
  • No update - 3/16
    • Could be independent of PMIx and PRRTE.
    • PMIx and PRRTE want to follow suite, and not require both pandoc and sphynx.
Clone this wiki locally