Peter Boyle
07c0c02f8c
Speed up Cshift
2020-05-11 17:02:01 -04:00
Peter Boyle
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
Peter Boyle
b57a4d32aa
Merge branch 'develop' into feature/gpu-port
2018-12-13 05:11:34 +00:00
f592ec8baa
Hadrons: contractor performance fix
2018-11-16 20:59:49 +00:00
8b007b5c24
Hadrons: remove the use of OpenMP reductions
2018-11-16 20:00:29 +00:00
88d9922e4f
Hadrons: fast A2A matrix contraction kernels
2018-11-06 19:49:09 +00:00
1651111d18
Hadrons: final, portable form of the contractor benchmark
2018-11-05 21:29:13 +00:00
fb7d021b9d
Hadrons: moving Hadrons to root directory, build system improvements
2018-08-28 15:00:40 +01:00