Peter Boyle
639dc1ab21
GPU reduction fix and also exit backtrace option
2019-07-31 01:23:23 +01:00
Peter Boyle
9117f61109
GPU friendly
2019-07-31 01:22:54 +01:00
Peter Boyle
9dad7a0094
Reproducible reduction and axpy_norm offload from Gianluca.
...
Hopefully get CG running entirely on GPU
2019-07-30 00:14:12 +01:00
Peter Boyle
1282e1067f
Do the force term on the accelerator too. Needed particularly because comms buffers
...
are device memory.
2019-07-29 22:58:35 +01:00
Peter Boyle
275c1c920f
More info dump on error from CUDA
2019-07-26 12:18:53 +01:00
Peter Boyle
fe700a183a
Getting HMC to run
2019-07-26 12:18:29 +01:00
Peter Boyle
34108296cd
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
Grid/simd/Grid_avx512.h
2019-07-20 17:05:35 +01:00
Peter Boyle
76c704b84b
Intrinsics for CLANG are now fixed in v6
2019-07-20 16:52:24 +01:00
Peter Boyle
ce255ec359
Relocate to fix build failure for comms none
2019-07-20 16:37:03 +01:00
Peter Boyle
1c096626cb
Hypercube defaults to on if HPE detected, but override to off possible
2019-07-20 16:06:16 +01:00
Peter Boyle
ce8b247426
Compiles
2019-07-20 15:16:02 +01:00
Peter Boyle
80481f81be
Constructor typo
2019-07-20 09:58:24 +01:00
Peter Boyle
d85dcc72df
Multinode fix
2019-07-20 07:13:28 +01:00
Peter Boyle
3fedcd6d52
Compiles
2019-07-20 07:12:44 +01:00
Peter Boyle
25ba4c5f80
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
HMC/Mobius2p1fEOFA.cc
tests/forces/Test_rect_force.cc
2019-07-19 11:01:55 +01:00
Peter Boyle
671bcbcccb
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-07-19 10:48:22 +01:00
Peter Boyle
ff325376cb
Fix single precision deriv test fail
2019-07-19 10:47:44 +01:00
Peter Boyle
9e926e3fc5
Build fix in develop
2019-07-19 10:01:52 +01:00
Peter Boyle
775eaee199
Fix for suspected Intel 2018.1 compiler bug under O3
2019-07-19 07:57:34 +01:00
Peter Boyle
0fd2827d5d
Fix fail in single
2019-07-19 05:28:26 +01:00
Peter Boyle
bdd79f9ef8
TODO update
2019-07-18 22:04:28 +01:00
Peter Boyle
0695f8cec2
Single precision compile fix. Soon deprecate single precision
2019-07-18 22:02:31 +01:00
Peter Boyle
9fa705c5a0
comma fix
2019-07-18 21:38:11 +01:00
Peter Boyle
331f5a53dc
New header
2019-07-18 14:51:09 +01:00
Peter Boyle
a23dc295ac
Remove compiler errors and warnings
2019-07-18 14:47:02 +01:00
Peter Boyle
08904f830e
Merge develop
2019-07-16 11:59:56 +01:00
Peter Boyle
fa9cd50c5b
Merge branch 'develop' into feature/gpu-port
2019-07-16 11:55:17 +01:00
Peter Boyle
7c11525d1a
Local stencil for complex wilson loops etc
2019-07-14 14:05:09 +01:00
Peter Boyle
42c1dbb1d1
General local stencil first cut for Patrick force term
2019-07-14 14:04:28 +01:00
Peter Boyle
6179acfda0
Put back a call that was required
2019-07-14 13:59:54 +01:00
Peter Boyle
07601ac1f5
Replace instantiation of Gparity
2019-07-12 17:18:12 +01:00
Peter Boyle
705a8098b2
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
...
Conflicts:
Grid/stencil/Stencil.h
2019-07-12 17:14:11 +01:00
Peter Boyle
a29b43d755
Stencil comms cleaner
2019-07-12 17:12:25 +01:00
Peter Boyle
368c8369ce
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
2019-07-12 17:11:29 +01:00
Peter Boyle
c0d89a2dbb
TODO updates
2019-07-12 17:11:15 +01:00
Peter Boyle
78ebd93281
Cuda 9.1 happy
2019-07-12 17:11:00 +01:00
Peter Boyle
3d58daf70f
Safety check
2019-07-12 17:10:35 +01:00
Peter Boyle
bd155ca5c0
Overlap comms with comput now supported
2019-07-12 09:09:40 +01:00
Peter Boyle
91e2cf9b40
All axes can be used for comms now
2019-07-12 09:08:26 +01:00
Peter Boyle
3cc9947731
Better welcome printing
2019-07-12 06:47:51 +01:00
Peter Boyle
f15eeb0283
localise scope of variables declared in macro
2019-07-12 06:47:01 +01:00
Peter Boyle
0996ba9396
Pretty messaging
2019-07-12 06:45:31 +01:00
Peter Boyle
966a203dcb
Interactions with GPU compilation
2019-07-11 03:16:17 +01:00
Peter Boyle
44170cc15f
Initialise CUDA device prior to entering MPI.
...
This may or may not interact with Summit which configures MPI - CUDA mapping with jsrun.
TBD
Cases of OpenMPI and MVAPICH are covered, and default to cudaSetDevice(0) otherwise
2019-07-11 03:14:23 +01:00
c3d0c176ab
cleaning up Kl2 contraction
2019-07-05 16:29:46 +01:00
0a71f8bb10
Merge pull request #222 from guelpers/feature/kl2QEDseq
...
EMLepton: Multiple source-sink separations at once
2019-07-05 16:22:34 +01:00
3a31ba2ea2
Merge remote-tracking branch 'upstream/develop' into feature/kl2QEDseq
2019-07-03 14:37:56 +01:00
eac6337466
Hadrons: EMLepton: multiple source-sink separations at once
2019-07-03 14:36:34 +01:00
ab7537e002
Merge pull request #221 from fionnoh/bugfix/A2ALoop
...
Bugfix for A2ALoop module
2019-07-03 14:13:51 +01:00
Peter Boyle
6e3c3214a3
Offload loops
2019-07-02 17:25:40 +01:00