Skip to content

WeeklyTelcon_20200303

Geoffrey Paulsen edited this page Mar 4, 2020 · 1 revision

Open MPI Weekly Telecon ---

  • Dialup Info: (Do not post to public mailing list or public wiki)

Attendees (on Web-ex)

  • Geoffrey Paulsen (IBM)
  • Howard Pritchard (LANL)
  • Jeff Squyres (Cisco)
  • Artem Polyakov (Mellanox)
  • Edgar Gabriel (UH)
  • Austen Lauria (IBM)
  • Harumi Kuno (HPE)
  • Joseph Schuchart
  • Josh Hursey (IBM)
  • Joshua Ladd (Mellanox)
  • Michael Heinz (Intel)
  • Noah Evans (Sandia)
  • Ralph Castain (Intel)
  • sbreyer
  • Todd Kordenbrock (Sandia)
  • William Zhang (AWS)

not there today (I keep this for easy cut-n-paste for future notes)

  • Brendan Cunningham (Intel)
  • Akshay Venkatesh (NVIDIA)
  • Thomas Naughton (ORNL)
  • Scott Breyer (Sandia?
  • Nathan Hjelm (Google)
  • Charles Shereda (LLNL)
  • David Bernhold (ORNL)
  • George Bosilca (UTK)
  • Matthew Dosanjh (Sandia)
  • Brandon Yates (Intel)
  • Erik Zeiske
  • Mark Allen (IBM)
  • Matias Cabral (Intel)
  • Xin Zhao (Mellanox)
  • mohan (AWS)

Old Business

  • MTT -

    • If you change your MTT to startup PRRTE at begining of session, and just use prun.
    • Can see times cut in half or more.
    • This is good, but also need to test mpirun wrapper.
    • Cisco is converting half of MPI installs to use prrte/prun
  • PRRTE Transition:

  • OMPI master submodule pointers setup to track PMIx and PRRTE master.

    • Hopefully long term, master can track release branches.
    • But still ensure there's some regression tracking of master/master/master.
    • But once things settle down, might not want everyone's masters tracking each other.
    • But if we DONT have master/master/master then new features that span across repos will be challenging

Release Branches

Review v3.0.x Milestones v3.0.6

Review v3.1.x Milestones v3.1.6

  • Michael is interested in a schedule.
    • He will discuss and propose a date

Review v4.0.x Milestones v4.0.3

  • v4.0.3 in the works.
    • Config file needs to be committed to
    • Merging in PRs
    • Seeing some instability on github right now.

v5.0.0

  • Schedule:
    • Feature Freeze: End of April
    • Release: End of June
  • Austen took an initial stab at issues and is starting a google sheets of v5.0 features.
  • Issue - 7475 Blocker for Amazon.
    • When marking items as blockers, please add rational.
    • This is functionality is in v4.0, and if you don't specify a PML you'd get this modex.
    • Apparently the check wasn't working in 3.0 or 4.0
  • May need doc updates for command line changes, both prrte, and mca changes.
    • Want to get some feedback some community.
    • Huge list of features

master

  • Biggest thing on master is prrte.
    • Issues are being found and fixed.
  • Cisco mtt failing due to -np

Face to face

  • Maybe mid-late summer. No discussion

Infrastrastructure

  • scale-testing, PRs have to opt-into it.

Review Master Master Pull Requests

CI status


Depdendancies

PMIx Update

  • CI testing only tests build and did it run, but doesn't test HOW it ran.
    • Environment setup can be a bit different.
    • For example no-permissions in /tmp. Might pass on one machine, and fail on another without /tmp permissions.

ORTE/PRRTE

MTT


Back to 2019 WeeklyTelcon-2019

Clone this wiki locally