1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-11-10 07:55:35 +00:00
Commit Graph

5193 Commits

Author SHA1 Message Date
Peter Boyle
96ac56cace Double precision variants for summation accuracy 2019-08-14 13:08:01 +01:00
Peter Boyle
2b037e3daa Update todo list 2019-08-14 13:07:26 +01:00
Peter Boyle
2d2de7aede Freeze the seed 2019-08-14 13:07:11 +01:00
Peter Boyle
ce97638bac Think the reduction is now sorted and cleaned up 2019-08-11 11:09:01 +01:00
Peter Boyle
53e3ab4131 Fix force term 2019-08-11 11:06:13 +01:00
Peter Boyle
9cd33a7b9c Printing improvement 2019-07-31 08:01:24 +01:00
Peter Boyle
639dc1ab21 GPU reduction fix and also exit backtrace option 2019-07-31 01:23:23 +01:00
Peter Boyle
9117f61109 GPU friendly 2019-07-31 01:22:54 +01:00
Peter Boyle
9dad7a0094 Reproducible reduction and axpy_norm offload from Gianluca.
Hopefully get CG running entirely on GPU
2019-07-30 00:14:12 +01:00
Peter Boyle
1282e1067f Do the force term on the accelerator too. Needed particularly because comms buffers
are device memory.
2019-07-29 22:58:35 +01:00
Peter Boyle
275c1c920f More info dump on error from CUDA 2019-07-26 12:18:53 +01:00
Peter Boyle
fe700a183a Getting HMC to run 2019-07-26 12:18:29 +01:00
Peter Boyle
34108296cd Merge branch 'develop' into feature/gpu-port
Conflicts:
	Grid/simd/Grid_avx512.h
2019-07-20 17:05:35 +01:00
Peter Boyle
76c704b84b Intrinsics for CLANG are now fixed in v6 2019-07-20 16:52:24 +01:00
Peter Boyle
ce255ec359 Relocate to fix build failure for comms none 2019-07-20 16:37:03 +01:00
Peter Boyle
1c096626cb Hypercube defaults to on if HPE detected, but override to off possible 2019-07-20 16:06:16 +01:00
Peter Boyle
ce8b247426 Compiles 2019-07-20 15:16:02 +01:00
Peter Boyle
80481f81be Constructor typo 2019-07-20 09:58:24 +01:00
Peter Boyle
d85dcc72df Multinode fix 2019-07-20 07:13:28 +01:00
Peter Boyle
3fedcd6d52 Compiles 2019-07-20 07:12:44 +01:00
Peter Boyle
25ba4c5f80 Merge branch 'develop' into feature/gpu-port
Conflicts:
	HMC/Mobius2p1fEOFA.cc
	tests/forces/Test_rect_force.cc
2019-07-19 11:01:55 +01:00
Peter Boyle
671bcbcccb Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2019-07-19 10:48:22 +01:00
Peter Boyle
ff325376cb Fix single precision deriv test fail 2019-07-19 10:47:44 +01:00
Peter Boyle
9e926e3fc5 Build fix in develop 2019-07-19 10:01:52 +01:00
Peter Boyle
775eaee199 Fix for suspected Intel 2018.1 compiler bug under O3 2019-07-19 07:57:34 +01:00
Peter Boyle
0fd2827d5d Fix fail in single 2019-07-19 05:28:26 +01:00
Peter Boyle
bdd79f9ef8 TODO update 2019-07-18 22:04:28 +01:00
Peter Boyle
0695f8cec2 Single precision compile fix. Soon deprecate single precision 2019-07-18 22:02:31 +01:00
Peter Boyle
9fa705c5a0 comma fix 2019-07-18 21:38:11 +01:00
Peter Boyle
331f5a53dc New header 2019-07-18 14:51:09 +01:00
Peter Boyle
a23dc295ac Remove compiler errors and warnings 2019-07-18 14:47:02 +01:00
Peter Boyle
08904f830e Merge develop 2019-07-16 11:59:56 +01:00
Peter Boyle
fa9cd50c5b Merge branch 'develop' into feature/gpu-port 2019-07-16 11:55:17 +01:00
Peter Boyle
7c11525d1a Local stencil for complex wilson loops etc 2019-07-14 14:05:09 +01:00
Peter Boyle
42c1dbb1d1 General local stencil first cut for Patrick force term 2019-07-14 14:04:28 +01:00
Peter Boyle
6179acfda0 Put back a call that was required 2019-07-14 13:59:54 +01:00
Peter Boyle
07601ac1f5 Replace instantiation of Gparity 2019-07-12 17:18:12 +01:00
Peter Boyle
705a8098b2 Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
Conflicts:
	Grid/stencil/Stencil.h
2019-07-12 17:14:11 +01:00
Peter Boyle
a29b43d755 Stencil comms cleaner 2019-07-12 17:12:25 +01:00
Peter Boyle
368c8369ce Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port 2019-07-12 17:11:29 +01:00
Peter Boyle
c0d89a2dbb TODO updates 2019-07-12 17:11:15 +01:00
Peter Boyle
78ebd93281 Cuda 9.1 happy 2019-07-12 17:11:00 +01:00
Peter Boyle
3d58daf70f Safety check 2019-07-12 17:10:35 +01:00
Peter Boyle
bd155ca5c0 Overlap comms with comput now supported 2019-07-12 09:09:40 +01:00
Peter Boyle
91e2cf9b40 All axes can be used for comms now 2019-07-12 09:08:26 +01:00
Peter Boyle
3cc9947731 Better welcome printing 2019-07-12 06:47:51 +01:00
Peter Boyle
f15eeb0283 localise scope of variables declared in macro 2019-07-12 06:47:01 +01:00
Peter Boyle
0996ba9396 Pretty messaging 2019-07-12 06:45:31 +01:00
Peter Boyle
966a203dcb Interactions with GPU compilation 2019-07-11 03:16:17 +01:00
Peter Boyle
44170cc15f Initialise CUDA device prior to entering MPI.
This may or may not interact with Summit which configures MPI - CUDA mapping with jsrun.
TBD
Cases of OpenMPI and MVAPICH are covered, and default to cudaSetDevice(0) otherwise
2019-07-11 03:14:23 +01:00