Hi all,
I'm having a Cuda error when running LB simulations on the ubuntu 18.04
LTS with Espresso dev, Cuda 10.1 and OpenMPI 2.1.1. This error will be
gone every time when I reboot the computer but will still appear
randomly later. The script is working fine on another machine. I'm not
sure if it's a hardware or software issue now, Does anyone has any
suggestions on this issue?
CUDA error: unknown error
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 0 in communicator MPI_COMMUNICATOR 4
with errorcode 1.
NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
Cheers
Le