Peter Boyle
10d16ab76c
Remove explict instantiation from here
2019-06-08 13:40:32 +01:00
Peter Boyle
1f997fa484
Instantiate via explict .cc files for parallel make.
2019-06-08 13:39:51 +01:00
Peter Boyle
0ee6e77cbc
Compiles GPU and CPU, still gives good performance on CPU
2019-06-05 13:28:16 +01:00
Peter Boyle
18d3cde29a
Compile on GPU workd
2019-06-05 00:14:58 +01:00
Peter Boyle
7323099966
Instatiation fix
2019-06-05 00:14:38 +01:00
Peter Boyle
6379651cdd
Generic or GPU ready for benchmark test on GPU
2019-06-05 00:13:52 +01:00
Peter Boyle
ba4fd756b9
Fix signature, but deprecating this loops style
2019-06-05 00:12:36 +01:00
Peter Boyle
d185fc1ebf
clean up instantiation
2019-06-05 00:11:52 +01:00
Peter Boyle
96b36d8367
Instantiation clean up
2019-06-05 00:11:27 +01:00
Peter Boyle
899f8b5065
Instantiation clean up 5d vec removal
2019-06-05 00:11:05 +01:00
Peter Boyle
c8d0483fe9
Remove 5d vectorisation
2019-06-05 00:10:37 +01:00
Peter Boyle
0f214e5f76
Clean up instantiation
2019-06-05 00:10:13 +01:00
Peter Boyle
9636324069
GPU happy code
2019-06-05 00:08:54 +01:00
Peter Boyle
8a5489d9e6
Move the loop into a central kernel call.
2019-06-05 00:08:13 +01:00
Peter Boyle
b47f73c222
GPU happy
2019-06-04 21:30:39 +01:00
Peter Boyle
5720ced0fd
Simplifying
2019-06-04 21:30:08 +01:00
Peter Boyle
2c87b56b53
Making GPU happier
2019-06-04 21:29:44 +01:00
Peter Boyle
dbad48d802
Remove Ls vectorised DWF
2019-06-04 21:27:40 +01:00
Peter Boyle
4557a1365a
Remove Ls vectorised DWF
2019-06-04 20:59:59 +01:00
Peter Boyle
16e9b87d98
Remove Ls vectorised DWF as unused and hard to maintain
2019-06-04 20:59:01 +01:00
Peter Boyle
685eea3d0f
Small cosmetic
2019-06-04 20:58:14 +01:00
Peter Boyle
65b48831fb
Simplify code
2019-06-04 20:56:30 +01:00
Peter Boyle
57396fc595
Simplify code
2019-06-04 20:56:23 +01:00
Peter Boyle
a2e199df50
Simplifying Cayley cases.
2019-06-04 20:54:52 +01:00
Peter Boyle
45b15d10d3
GPU happy changes
2019-06-04 20:49:16 +01:00
Peter Boyle
33d6bbe32b
GPU must use accelerator vectors
2019-06-04 20:48:52 +01:00
Peter Boyle
ade4a126da
Getting closer on the GPU port, but will start deleting 5th dim vectorised variants
...
for code maintainability
2019-06-04 11:53:44 +01:00
Peter Boyle
7b59ab5bd7
Compiling after reorganisation
2019-06-03 15:46:26 +01:00
Peter Boyle
fcd8cfe257
Gparity in
2019-06-03 15:45:09 +01:00
Peter Boyle
b4b53812cb
Move implementation to specific implementation headers
2019-06-03 15:43:01 +01:00
Peter Boyle
085cac583f
Implementation in header
2019-06-03 15:42:36 +01:00
Peter Boyle
25e3b8640c
Move to header
2019-06-03 15:42:05 +01:00
Michael Marshall
c81d3d422d
Housekeeping. #include <Grid.h> ---> #include <Grid/Grid.h>
2019-06-03 15:25:05 +01:00
Michael Marshall
54edb9906e
Housekeeping. #include <Grid.h> ---> #include <Grid/Grid.h>
2019-06-03 15:20:46 +01:00
Peter Boyle
44bbec50b0
Making GPU compile happy
2019-06-03 14:57:04 +01:00
Peter Boyle
ec68b67d5d
Attempt at unified GPU and CPU kernel
2019-06-03 14:55:51 +01:00
Peter Boyle
778450e0c8
Move to implementation subdir
2019-06-03 14:53:56 +01:00
Peter Boyle
567aa5f366
Move to implementation subdir
2019-06-03 14:53:33 +01:00
Peter Boyle
2ab7e2b175
Force instantiation in .cc files.
...
Eventually move into multiple files
2019-06-03 14:52:59 +01:00
Peter Boyle
6f61be044d
Dont instantiate in header
2019-06-03 14:52:01 +01:00
Peter Boyle
269e00509e
Don't instantiate in header
2019-06-03 14:51:24 +01:00
Peter Boyle
a5e90b0ddc
Making the kernels more GPU happy
2019-06-03 14:50:54 +01:00
Peter Boyle
5622faf226
pragma once ifdef guard
2019-06-03 14:50:26 +01:00
Michael Marshall
eb737daeb5
Merge branch 'develop' into feature/distil
...
* develop: (34 commits)
Hadrons: EMLepton: Wall source
Revert "cleaning up Kl2 contraction"
cleaning up Kl2 contraction
posibility to save/load schedules directly from the application parameters
moving VERSION file to the empty ChangeLog one, this create compilation problems with #include <version> in recent versions of LLVM and case-insensitive FS (typically macOS)
Added precision tuning to Hadrons parameterfile writing
Kl2 QED cleanup
Added ZFIMPL to SeqGamma
Added ZFIMPL to SeqConserved module
F1 ensemble running with 96%~ acceptance etc..
Make detection of HPE 8600 automatic
Added variables that were missing from wall source setup
Exposed a coulomb/landau enum to the gauge fixing module
Coulomb gauge added as an option
More logging, timing, and 4d/5d logic for eigpack gauge transforms
Added gauge transform option to eigpack IO
Hadrons: Lepton Propagator for kl2, sign swap for antiperiodic boundary
A2A Lepton-Meson Field contraction
Verbose
Iteratoin range fix
...
2019-05-31 18:20:43 +01:00
9a34edcf9f
Kl2 QED cleanup
2019-05-23 13:43:22 +01:00
e675c6a48c
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
2019-05-23 12:41:54 +01:00
Peter Boyle
918e673078
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-05-22 09:57:02 +01:00
Peter Boyle
44b53c3ba2
F1 ensemble running with 96%~ acceptance etc..
2019-05-22 09:56:26 +01:00
ae5ad986e2
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
2019-05-19 14:35:46 +01:00
Peter Boyle
ee6f96d85c
Merge pull request #210 from grid-test-organisation/feature/gpu-port-develop
...
Cayley fermion functions for GPUs
2019-05-18 19:06:20 +01:00