Peter Boyle
368c8369ce
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
2019-07-12 17:11:29 +01:00
Peter Boyle
c0d89a2dbb
TODO updates
2019-07-12 17:11:15 +01:00
Peter Boyle
78ebd93281
Cuda 9.1 happy
2019-07-12 17:11:00 +01:00
Peter Boyle
3d58daf70f
Safety check
2019-07-12 17:10:35 +01:00
Peter Boyle
bd155ca5c0
Overlap comms with comput now supported
2019-07-12 09:09:40 +01:00
Peter Boyle
91e2cf9b40
All axes can be used for comms now
2019-07-12 09:08:26 +01:00
Peter Boyle
3cc9947731
Better welcome printing
2019-07-12 06:47:51 +01:00
Peter Boyle
f15eeb0283
localise scope of variables declared in macro
2019-07-12 06:47:01 +01:00
Peter Boyle
0996ba9396
Pretty messaging
2019-07-12 06:45:31 +01:00
Michael Marshall
12afb0395f
Debugging transposeSpin - seems just not to be implemented for Lattice<x>
2019-07-11 17:42:26 +01:00
Felix Erben
ec4aa978ab
why cant I spinTranspose
2019-07-11 14:01:41 +01:00
Peter Boyle
966a203dcb
Interactions with GPU compilation
2019-07-11 03:16:17 +01:00
Peter Boyle
44170cc15f
Initialise CUDA device prior to entering MPI.
...
This may or may not interact with Summit which configures MPI - CUDA mapping with jsrun.
TBD
Cases of OpenMPI and MVAPICH are covered, and default to cudaSetDevice(0) otherwise
2019-07-11 03:14:23 +01:00
Michael Marshall
7bc4a06f3f
This is probably what you want ...
2019-07-10 12:29:33 +01:00
Michael Marshall
cd659525e1
You probably want to add this to the build. And you may need to do a bootstrap
2019-07-10 12:08:37 +01:00
Felix Erben
dc2240d2d8
why does sliceSum in Nucleon.hpp not work
2019-07-10 11:34:16 +01:00
Felix Erben
98cf20cf06
continued work on baryons
2019-07-09 17:42:36 +01:00
Felix Erben
cc3346073e
continued work on baryons
2019-07-09 17:30:32 +01:00
Felix Erben
3848da7c50
added nucleon module (non-distillation)
2019-07-08 17:43:14 +01:00
c3d0c176ab
cleaning up Kl2 contraction
2019-07-05 16:29:46 +01:00
0a71f8bb10
Merge pull request #222 from guelpers/feature/kl2QEDseq
...
EMLepton: Multiple source-sink separations at once
2019-07-05 16:22:34 +01:00
Felix Erben
b7d0cf6751
buxfix in diquark sum / baryons
2019-07-04 22:06:37 +01:00
3a31ba2ea2
Merge remote-tracking branch 'upstream/develop' into feature/kl2QEDseq
2019-07-03 14:37:56 +01:00
eac6337466
Hadrons: EMLepton: multiple source-sink separations at once
2019-07-03 14:36:34 +01:00
ab7537e002
Merge pull request #221 from fionnoh/bugfix/A2ALoop
...
Bugfix for A2ALoop module
2019-07-03 14:13:51 +01:00
Felix Erben
2c1a077369
continued on baryons
2019-07-02 17:55:28 +01:00
Peter Boyle
6e3c3214a3
Offload loops
2019-07-02 17:25:40 +01:00
Peter Boyle
d6ffadb33b
Coalesced write
2019-07-02 17:25:13 +01:00
Michael Marshall
ae3abbe53d
Added the ability for Perambulator module to save unsmeared sinks through the addition of two optional parameters:
...
UnsmearedSinkFileName: If present, specifies the filename to write to
UnsmearedSinkMultiFile: defaults to true to write each sink vector to a different file, but can be set to 0 for a single file
2019-07-01 17:28:27 +01:00
Felix Erben
5fc0188205
started saving sinks
2019-07-01 14:51:59 +01:00
Peter Boyle
4c3225412b
Drop 5dVEC
2019-07-01 07:31:26 +01:00
Peter Boyle
b8f7bfbb26
Dont stream as poor perf in some cases
2019-07-01 07:30:25 +01:00
Peter Boyle
7b7c470917
Accelerator loop
2019-07-01 07:29:51 +01:00
Peter Boyle
532e226b22
cuda 9.1 fixes
2019-07-01 07:29:22 +01:00
Peter Boyle
6a13731818
Move GPU cuda call earlier
2019-07-01 07:28:41 +01:00
fionnoh
67690df3bd
Changes nedded to have a current insertion on every second time slice - avoids unnecessary contractions
2019-06-28 15:18:28 +08:00
fionnoh
1059189abf
Bugfix for A2ALoop module
2019-06-27 13:49:55 +08:00
fionnoh
ce29b18dc9
New modules for loading in MFs as diskvectors and producing propagaotrs from 4 quark contractions
2019-06-27 13:46:06 +08:00
fionnoh
421a0a8a36
Changes to A2Autils, A2AMatirx and DiskVector code that is needed for Hadrons 4 quark contraction module
2019-06-27 13:45:20 +08:00
fionnoh
ac530636ca
A2Aloop bugfix
2019-06-27 13:44:47 +08:00
Michael Marshall
2d940a598c
Inserted four extra parameters just to make this test compile. Needs to be fixed properly
2019-06-19 10:37:50 +01:00
Michael Marshall
c28c5fc61b
Inserted four extra parameters just to make this test compile. Needs to be fixed properly
2019-06-19 10:31:41 +01:00
Michael Marshall
015340d60c
Elided superfluous copy on write
2019-06-19 09:37:03 +01:00
Peter Boyle
1cd4ee0706
Thrust used on GPU builds
2019-06-18 12:50:35 +01:00
Peter Boyle
b8f71b6777
Fix NVCC warning unused variable
2019-06-17 13:58:45 +01:00
Peter Boyle
703dc20377
Compile tests fix
2019-06-16 13:59:29 +01:00
Peter Boyle
d976e5c514
Pow is being awkward in thrust for reasons I don't understand. Possible thrust bug.
2019-06-16 12:05:11 +01:00
Peter Boyle
d7b3efe893
Compile fix
2019-06-15 17:03:15 +01:00
Peter Boyle
f710d7bd45
TODO list update
2019-06-15 12:54:27 +01:00
Peter Boyle
cb336aa8f8
Thread loop constructs changing a little
2019-06-15 12:54:11 +01:00