Felix Erben
ec4aa978ab
why cant I spinTranspose
2019-07-11 14:01:41 +01:00
Peter Boyle
966a203dcb
Interactions with GPU compilation
2019-07-11 03:16:17 +01:00
Peter Boyle
44170cc15f
Initialise CUDA device prior to entering MPI.
...
This may or may not interact with Summit which configures MPI - CUDA mapping with jsrun.
TBD
Cases of OpenMPI and MVAPICH are covered, and default to cudaSetDevice(0) otherwise
2019-07-11 03:14:23 +01:00
Michael Marshall
7bc4a06f3f
This is probably what you want ...
2019-07-10 12:29:33 +01:00
Michael Marshall
cd659525e1
You probably want to add this to the build. And you may need to do a bootstrap
2019-07-10 12:08:37 +01:00
Felix Erben
dc2240d2d8
why does sliceSum in Nucleon.hpp not work
2019-07-10 11:34:16 +01:00
Felix Erben
98cf20cf06
continued work on baryons
2019-07-09 17:42:36 +01:00
Felix Erben
cc3346073e
continued work on baryons
2019-07-09 17:30:32 +01:00
Felix Erben
3848da7c50
added nucleon module (non-distillation)
2019-07-08 17:43:14 +01:00
c3d0c176ab
cleaning up Kl2 contraction
2019-07-05 16:29:46 +01:00
0a71f8bb10
Merge pull request #222 from guelpers/feature/kl2QEDseq
...
EMLepton: Multiple source-sink separations at once
2019-07-05 16:22:34 +01:00
Felix Erben
b7d0cf6751
buxfix in diquark sum / baryons
2019-07-04 22:06:37 +01:00
3a31ba2ea2
Merge remote-tracking branch 'upstream/develop' into feature/kl2QEDseq
2019-07-03 14:37:56 +01:00
eac6337466
Hadrons: EMLepton: multiple source-sink separations at once
2019-07-03 14:36:34 +01:00
ab7537e002
Merge pull request #221 from fionnoh/bugfix/A2ALoop
...
Bugfix for A2ALoop module
2019-07-03 14:13:51 +01:00
Felix Erben
2c1a077369
continued on baryons
2019-07-02 17:55:28 +01:00
Peter Boyle
6e3c3214a3
Offload loops
2019-07-02 17:25:40 +01:00
Peter Boyle
d6ffadb33b
Coalesced write
2019-07-02 17:25:13 +01:00
Michael Marshall
ae3abbe53d
Added the ability for Perambulator module to save unsmeared sinks through the addition of two optional parameters:
...
UnsmearedSinkFileName: If present, specifies the filename to write to
UnsmearedSinkMultiFile: defaults to true to write each sink vector to a different file, but can be set to 0 for a single file
2019-07-01 17:28:27 +01:00
Felix Erben
5fc0188205
started saving sinks
2019-07-01 14:51:59 +01:00
Peter Boyle
4c3225412b
Drop 5dVEC
2019-07-01 07:31:26 +01:00
Peter Boyle
b8f7bfbb26
Dont stream as poor perf in some cases
2019-07-01 07:30:25 +01:00
Peter Boyle
7b7c470917
Accelerator loop
2019-07-01 07:29:51 +01:00
Peter Boyle
532e226b22
cuda 9.1 fixes
2019-07-01 07:29:22 +01:00
Peter Boyle
6a13731818
Move GPU cuda call earlier
2019-07-01 07:28:41 +01:00
fionnoh
1059189abf
Bugfix for A2ALoop module
2019-06-27 13:49:55 +08:00
Michael Marshall
2d940a598c
Inserted four extra parameters just to make this test compile. Needs to be fixed properly
2019-06-19 10:37:50 +01:00
Michael Marshall
c28c5fc61b
Inserted four extra parameters just to make this test compile. Needs to be fixed properly
2019-06-19 10:31:41 +01:00
Michael Marshall
015340d60c
Elided superfluous copy on write
2019-06-19 09:37:03 +01:00
Peter Boyle
1cd4ee0706
Thrust used on GPU builds
2019-06-18 12:50:35 +01:00
Peter Boyle
b8f71b6777
Fix NVCC warning unused variable
2019-06-17 13:58:45 +01:00
Peter Boyle
703dc20377
Compile tests fix
2019-06-16 13:59:29 +01:00
Peter Boyle
d976e5c514
Pow is being awkward in thrust for reasons I don't understand. Possible thrust bug.
2019-06-16 12:05:11 +01:00
Peter Boyle
d7b3efe893
Compile fix
2019-06-15 17:03:15 +01:00
Peter Boyle
f710d7bd45
TODO list update
2019-06-15 12:54:27 +01:00
Peter Boyle
cb336aa8f8
Thread loop constructs changing a little
2019-06-15 12:54:11 +01:00
Peter Boyle
462900b48d
Modified entire test directory to suit new GPU constructs for looping
2019-06-15 12:53:27 +01:00
Peter Boyle
0561c2edeb
Benchmarks modified for new GPU constructs
2019-06-15 12:52:56 +01:00
Peter Boyle
0184719216
Change to predicate type
2019-06-15 12:52:26 +01:00
Peter Boyle
24202dbc51
Thread loop construct change
2019-06-15 12:52:07 +01:00
Peter Boyle
d763c303c5
Clean acceleerator barrier
2019-06-15 12:51:45 +01:00
Peter Boyle
8e394d3bf9
New loop construct
2019-06-15 12:51:15 +01:00
Peter Boyle
b881d5489b
Move SchurDiagTwoKappa to Algorithms
2019-06-15 12:50:45 +01:00
Peter Boyle
82306913a8
Move Schur operator into correct place
2019-06-15 12:49:22 +01:00
Peter Boyle
49f90cc7eb
use pragma once
2019-06-15 12:45:22 +01:00
Peter Boyle
b77af0210b
Thread loop. Probably deprecate this impl
2019-06-15 12:44:56 +01:00
Peter Boyle
5254ede2d8
New loops. Revisit as accelerator loop in future audit
2019-06-15 12:44:29 +01:00
Peter Boyle
16e5d7945e
Hard to make 5D vec work with GPU code
2019-06-15 12:43:43 +01:00
Peter Boyle
decc99ca76
Accelerator version
2019-06-15 12:43:00 +01:00
Peter Boyle
464cd65931
Still to test this fully
2019-06-15 12:35:14 +01:00