OPENMPI detected AcceleratorCudaInit[0]: ======================== AcceleratorCudaInit[0]: Device Number : 0 AcceleratorCudaInit[0]: ======================== AcceleratorCudaInit[0]: Device identifier: Tesla V100-SXM2-16GB AcceleratorCudaInit[0]: totalGlobalMem: 16911433728 AcceleratorCudaInit[0]: managedMemory: 1 AcceleratorCudaInit[0]: isMultiGpuBoard: 0 AcceleratorCudaInit[0]: warpSize: 32 AcceleratorCudaInit[0]: pciBusID: 4 AcceleratorCudaInit[0]: pciDeviceID: 0 AcceleratorCudaInit[0]: maxGridSize (2147483647,65535,65535) AcceleratorCudaInit: using default device AcceleratorCudaInit: assume user either uses AcceleratorCudaInit: a) IBM jsrun, or AcceleratorCudaInit: b) invokes through a wrapping script to set CUDA_VISIBLE_DEVICES, UCX_NET_DEVICES, and numa binding AcceleratorCudaInit: Configure options --enable-setdevice=no local rank 0 device 0 bus id: 0004:04:00.0 AcceleratorCudaInit: ================================================ SharedMemoryMpi: World communicator of size 24 SharedMemoryMpi: Node communicator of size 1 local rank 3 device 0 bus id: 0004:04:00.0 local rank 2 device 0 bus id: 0004:04:00.0 local rank 1 device 0 bus id: 0004:04:00.0 0SharedMemoryMpi: SharedMemoryMPI.cc acceleratorAllocDevice 1073741824bytes at 0x200080000000 - 2000bfffffff for comms buffers Setting up IPC local rank 5 device 0 bus id: 0004:04:00.0 __|__|__|__|__|__|__|__|__|__|__|__|__|__|__ __|__|__|__|__|__|__|__|__|__|__|__|__|__|__ __|_ | | | | | | | | | | | | _|__ __|_ _|__ __|_ GGGG RRRR III DDDD _|__ __|_ G R R I D D _|__ __|_ G R R I D D _|__ __|_ G GG RRRR I D D _|__ __|_ G G R R I D D _|__ __|_ GGGG R R III DDDD _|__ __|_ _|__ __|__|__|__|__|__|__|__|__|__|__|__|__|__|__ __|__|__|__|__|__|__|__|__|__|__|__|__|__|__ | | | | | | | | | | | | | | Copyright (C) 2015 Peter Boyle, Azusa Yamaguchi, Guido Cossu, Antonin Portelli and other authors This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by local rank 4 device 0 bus id: 0004:04:00.0 the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. Current Grid git commit hash=1713de35c0dc339564661dd7df8a72583f889e91: (HEAD -> feature/dirichlet) uncommited changes Grid : Message : ================================================ Grid : Message : MPI is initialised and logging filters activated Grid : Message : ================================================ Grid : Message : Requested 1073741824 byte stencil comms buffers Grid : Message : MemoryManager::Init() setting up Grid : Message : MemoryManager::Init() cache pool for recent allocations: SMALL 8 LARGE 2 Grid : Message : MemoryManager::Init() Unified memory space Grid : Message : MemoryManager::Init() Using cudaMallocManaged Grid : Message : 0.139000 s : ++++++++++++++++++++++++++++++++++++++++++++++++ Grid : Message : 0.151000 s : Testing with full communication Grid : Message : 0.158000 s : ++++++++++++++++++++++++++++++++++++++++++++++++ Grid : Message : 0.165000 s : Grid Layout Grid : Message : 0.171000 s : Global lattice size : 64 64 64 96 Grid : Message : 0.181000 s : OpenMP threads : 6 Grid : Message : 0.189000 s : MPI tasks : 2 2 2 3 Grid : Message : 0.177717 s : Initialising 4d RNG Grid : Message : 0.342461 s : Intialising parallel RNG with unique string 'The 4D RNG' Grid : Message : 0.342483 s : Seed SHA256: 49db4542db694e3b1a74bf2592a8c1b83bfebbe18401693c2609a4c3af1 Grid : Message : 0.370454 s : Initialising 5d RNG Grid : Message : 3.174160 s : Intialising parallel RNG with unique string 'The 5D RNG' Grid : Message : 3.174420 s : Seed SHA256: b6316f2fac44ce14111f93e0296389330b077bfd0a7b359f781c58589f8a Grid : Message : 22.119339 s : Drawing gauge field Grid : Message : 38.113060 s : Random gauge initialised Grid : Message : 38.113320 s : Applying BCs for Dirichlet Block5 [0 0 0 0 0] Grid : Message : 38.113470 s : Applying BCs for Dirichlet Block4 [0 0 0 0] Grid : Message : 43.906786 s : Setting up Cshift based reference