Fix 1571 (CMake variable & detection of MPI CUDA awareness) #1602

mhoemmen · 2017-08-14T03:28:14Z

@trilinos/tpetra Hi all! I wrote a PR that does the following:

Defines a CMake variable, Tpetra_ASSUME_CUDA_AWARE_MPI, for whether Tpetra may assume that the MPI implementation is CUDA aware.
Attempts to detect the default value of this variable (this currently only works with OpenMPI; it currently safely assumes OFF for other MPI implementations).
Enforces CUDA_VERSION >= 7.5.

I took particular care to avoid breaking the cross-compilation case. The PR explicitly checks CMAKE_CROSSCOMPILING; if ON, Tpetra does not attempt to run executables. This is relevant because I know users who do cross-compilation with CUDA builds right now.

I welcome feedback! My only concern is that the PR forces Tpetra to decide whether MPI is CUDA aware at configure time. Some MPI implementations, like MVAPICH (see http://mvapich.cse.ohio-state.edu/userguide/gdr/2.2/ ), let users control this at run time, by setting an environment variable. This means that Tpetra may also need run-time environment variable control. However, we can always add that feature later. The CMake option is still useful, because it could determine the environment variable's default value. Thus, I think the PR is fine as it stands.

@trilinos/tpetra If building with CUDA, explicitly enforce that CUDA_VERSION be at least 7.5. See #1278 for details. Also, state CUDA_VERSION requirement explicitly in Tpetra's release notes (packages/tpetra/ReleaseNotes.txt).

@trilinos/tpetra Add a CMake option Tpetra_ASSUME_CUDA_AWARE_MPI, with associated macro TPETRA_ASSUME_CUDA_AWARE_MPI defined in TpetraCore_config.h. If the CMake option is ON, Tpetra may assume that the MPI implementation it uses is CUDA aware. See #1571 for discussion, and #1088 for an application. The option currently defaults to OFF. #1571 requires that we actually have a test for this option, at least for OpenMPI, so we haven't finished the issue yet.

@trilinos/tpetra Tpetra's CMake logic now attempts to detect whether the MPI implementation is CUDA aware. If automatic detection does not succeed, Tpetra just makes the safe assumption that MPI is not CUDA aware. Currently, automatic detection requires OpenMPI. If not using OpenMPI, Tpetra conservatively assumes lack of CUDA awareness. It would be wise for us to extend detection to support other MPI implementations, but for now, this covers a common use case for Trilinos testing. Automatic detection depends on running an executable. This is relevant for cross compilation, so I have added two measures to protect against misleading results in that case: 1. If CMAKE_CROSSCOMPILING is ON, Tpetra skips detection and prints a configure-time message telling users that they may set Tpetra_ASSUME_CUDA_AWARE_MPI explicitly. 2. If users set Tpetra_ASSUME_CUDA_AWARE_MPI explicitly, Tpetra skips detection and assumes the user's value as the default.

ibaned

I've thrown in a few questions & suggestions for improvement. I'm not going to block this PR on anything, but replies are appreciated.

ibaned · 2017-08-14T13:49:44Z