shared memory segments not cleaned up by vader btl after program aborted #6322

leofang · 2019-01-29T19:05:13Z

Background information

What version of Open MPI are you using?

v3.1.2, v3.1.3

Describe how Open MPI was installed

We have an internally managed Conda environment and we build our own Conda packages, include openmpi. (I think it was built from the tarball downloaded on the Open MPI website.) Then, it was installed using conda install openmpi.

Please describe the system on which you are running

Operating system/version: Debian GNU/Linux 8 (jessie)
Computer hardware: Intel(R) Xeon(R) CPU E5-2630 v4 @ 2.20GHz + 512 GB RAM + 4 Nvidia V100 GPUs
Network type: tcp/ip

Details of the problem

If one spawns a few MPI processes, let them do some work, but terminate them abnormally (ctrl-C and whatnot), it can be seen that in /dev/shm/ there will be shared memory segments related to the vader component that are not unlinked by Open MPI during the cleanup phase:

leofang@xf03id-srv5:~$ mpirun -n 4 python do_very_long_calculation_in_mpi.py
...output not important...
^C
leofang@xf03id-srv5:~$ ls -lt /dev/shm | more
total 28464
-rw------- 1 leofang leofang 4194312 Jan 25 23:06 vader_segment.xf03id-srv5.73dc0001.2
-rw------- 1 leofang leofang 4194312 Jan 25 23:06 vader_segment.xf03id-srv5.73dc0001.3
-rw------- 1 leofang leofang 4194312 Jan 25 23:06 vader_segment.xf03id-srv5.73dc0001.1
-rw------- 1 leofang leofang 4194312 Jan 25 23:06 vader_segment.xf03id-srv5.73dc0001.0

I know that we didn't have this issue with v3.1.1 ~~and I'll test v3.1.3 later~~ (UPDATE: 3.1.3 also has this problem). For now I just need to know if this is a known bug with v3.1.2 so that I can avoid this version in our Conda settings. Thanks!

UPDATE: a minimal working example in Python is provided below

import time
import mpi4py # I used v3.0.0
mpi4py.rc.initialize = False # avoid auto initialization
from mpi4py import MPI

MPI.Init_thread() # manually initialize
comm = MPI.COMM_WORLD
rank = comm.Get_rank()
size = comm.Get_size()
print("I'm rank", rank, "from a pool of size", size, ", begin sleeping...")
time.sleep(30)
print("rank", rank, "awakes!")
MPI.Finalize()

and terminate it as mentioned above during the 30s sleeping. Note that if -n 1 is used, no residual segment would be left in /dev/shm even with abnormal abort. I double checked that in the 3.1.x series this problem only happens for 3.1.2 and 3.1.3.

UPDATE2: identical MWE in C

#include <mpi.h>
#include <stdio.h>
#include <unistd.h>

int main(int argc, char *argv[])
{
  int provided, size, rank, len;
  char name[MPI_MAX_PROCESSOR_NAME];

  MPI_Init(&argc, &argv);

  MPI_Comm_size(MPI_COMM_WORLD, &size);
  MPI_Comm_rank(MPI_COMM_WORLD, &rank);

  printf("I am process %d of %d. Start sleeping...\n", rank, size);
  sleep(30);
  printf("rank %d awakes!\n", rank);

  MPI_Finalize();
  return 0;
}

The text was updated successfully, but these errors were encountered:

jsquyres · 2019-02-06T11:41:17Z

When you say spawn, do you mean MPI_Comm_spawn, or do you just mean mpirun?

leofang · 2019-02-06T13:56:15Z

I meant mpirun, thanks for asking!

leofang · 2019-02-15T20:26:42Z

It's odd. I just tested v3.1.3 and this also happens...

leofang · 2019-02-16T03:34:20Z

Hello all, I've updated the post to include a short MWE to reproduce the issue, which I now believe is a bug. Please take a look. Thanks.

leofang · 2019-02-16T03:45:22Z

Update: an MWE written in C also reproduces the error.

leofang · 2019-04-15T13:57:50Z

Just wanna ring a bell and see if anyone could reproduce this. Thanks.

jsquyres · 2019-04-15T14:07:20Z

A fix for vader (shared memory) cleanup just recently went in on the v4.0.x branch (but didn't make v4.0.1). Can you test any recent nightly snapshot on the v4.0.x branch and see if the problem has been resolved for you?

https://www.open-mpi.org/nightly/v4.0.x/

leofang · 2019-05-05T18:07:44Z

@jsquyres Thanks for your reply and sorry for long silence. I just tested both the latest released versions v3.1.4 and v.4.0.1. I think the fix has appeared in v4.0.1 but not in v3 yet. Am I right that #6550 is the fix for this issue? Will it be back ported to v3.1 at some point?

Thanks, and please feel free to close this issue.

jsquyres added the question label Feb 6, 2019

leofang changed the title ~~shared memory segments not cleaned up by vader btl after program aborted in v3.1.2~~ shared memory segments not cleaned up by vader btl after program aborted Feb 15, 2019

leofang mentioned this issue Feb 16, 2019

Avoid shared memory bug in openmpi>=3.1.2 NSLS-II/lightsource2-recipes#523

Closed

leofang mentioned this issue May 5, 2019

Shared memory not getting cleaned up #6547

Closed

leofang mentioned this issue May 13, 2019

Bump Open MPI to v4.0.x NSLS-II/lightsource2-recipes#674

Open

jsquyres closed this as completed May 14, 2019

mkre mentioned this issue Jan 15, 2020

Intermittent failures possibly related to name conflicts in leftover Vader shared memory segment files #7308

Closed

heatherkellyucl mentioned this issue Mar 25, 2020

Install Request: OpenMPI 4.0.2 or later (stale shared memory segments bug) UCL-RITS/rcps-buildscripts#337

Closed

aburabazam mentioned this issue Jun 4, 2022

When running EnergyPlus as a library, if run_energyplus finishes in one thread, running it again from a new thread dead locks NREL/EnergyPlus#9453

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

shared memory segments not cleaned up by vader btl after program aborted #6322

shared memory segments not cleaned up by vader btl after program aborted #6322

leofang commented Jan 29, 2019 •

edited

Loading

jsquyres commented Feb 6, 2019

leofang commented Feb 6, 2019

leofang commented Feb 15, 2019

leofang commented Feb 16, 2019

leofang commented Feb 16, 2019

leofang commented Apr 15, 2019

jsquyres commented Apr 15, 2019

leofang commented May 5, 2019

shared memory segments not cleaned up by vader btl after program aborted #6322

shared memory segments not cleaned up by vader btl after program aborted #6322

Comments

leofang commented Jan 29, 2019 • edited Loading

Background information

What version of Open MPI are you using?

Describe how Open MPI was installed

Please describe the system on which you are running

Details of the problem

jsquyres commented Feb 6, 2019

leofang commented Feb 6, 2019

leofang commented Feb 15, 2019

leofang commented Feb 16, 2019

leofang commented Feb 16, 2019

leofang commented Apr 15, 2019

jsquyres commented Apr 15, 2019

leofang commented May 5, 2019

leofang commented Jan 29, 2019 •

edited

Loading