Peter Boyle
57396fc595
Simplify code
2019-06-04 20:56:23 +01:00
Peter Boyle
a2e199df50
Simplifying Cayley cases.
2019-06-04 20:54:52 +01:00
Peter Boyle
c2625a127e
Non blocking loop. Want to change the naming here.
2019-06-04 20:52:59 +01:00
Peter Boyle
8794d35c78
GPU
2019-06-04 20:52:27 +01:00
Peter Boyle
24bff6dbe6
Minor improvements
2019-06-04 20:51:48 +01:00
Peter Boyle
45b15d10d3
GPU happy changes
2019-06-04 20:49:16 +01:00
Peter Boyle
33d6bbe32b
GPU must use accelerator vectors
2019-06-04 20:48:52 +01:00
Peter Boyle
7a1569bd46
Annoying, cannot rely on equivalence of Grid ComplexD adn Eigen Complex type on GPU.
...
Solve with ComplexD typecasts but must be a better way
2019-06-04 20:47:49 +01:00
Peter Boyle
6e2e904a0e
NVCC compiles happy. Start to develop strategy for writing generic
...
code for GPU kernels and CPU kernels.
2019-06-04 20:46:35 +01:00
Peter Boyle
d92a17f359
Suppress NVCC warnings in pugixml with pragma
2019-06-04 20:45:53 +01:00
Peter Boyle
ade4a126da
Getting closer on the GPU port, but will start deleting 5th dim vectorised variants
...
for code maintainability
2019-06-04 11:53:44 +01:00
Peter Boyle
7b59ab5bd7
Compiling after reorganisation
2019-06-03 15:46:26 +01:00
Peter Boyle
fcd8cfe257
Gparity in
2019-06-03 15:45:09 +01:00
Peter Boyle
b4b53812cb
Move implementation to specific implementation headers
2019-06-03 15:43:01 +01:00
Peter Boyle
085cac583f
Implementation in header
2019-06-03 15:42:36 +01:00
Peter Boyle
25e3b8640c
Move to header
2019-06-03 15:42:05 +01:00
Michael Marshall
c81d3d422d
Housekeeping. #include <Grid.h> ---> #include <Grid/Grid.h>
2019-06-03 15:25:05 +01:00
Michael Marshall
54edb9906e
Housekeeping. #include <Grid.h> ---> #include <Grid/Grid.h>
2019-06-03 15:20:46 +01:00
Peter Boyle
44bbec50b0
Making GPU compile happy
2019-06-03 14:57:04 +01:00
Peter Boyle
ec68b67d5d
Attempt at unified GPU and CPU kernel
2019-06-03 14:55:51 +01:00
Peter Boyle
778450e0c8
Move to implementation subdir
2019-06-03 14:53:56 +01:00
Peter Boyle
567aa5f366
Move to implementation subdir
2019-06-03 14:53:33 +01:00
Peter Boyle
2ab7e2b175
Force instantiation in .cc files.
...
Eventually move into multiple files
2019-06-03 14:52:59 +01:00
Peter Boyle
6f61be044d
Dont instantiate in header
2019-06-03 14:52:01 +01:00
Peter Boyle
269e00509e
Don't instantiate in header
2019-06-03 14:51:24 +01:00
Peter Boyle
a5e90b0ddc
Making the kernels more GPU happy
2019-06-03 14:50:54 +01:00
Peter Boyle
5622faf226
pragma once ifdef guard
2019-06-03 14:50:26 +01:00
Peter Boyle
82ecd520c7
Macos happy fix under nvcc
2019-06-03 14:48:50 +01:00
Michael Marshall
eb737daeb5
Merge branch 'develop' into feature/distil
...
* develop: (34 commits)
Hadrons: EMLepton: Wall source
Revert "cleaning up Kl2 contraction"
cleaning up Kl2 contraction
posibility to save/load schedules directly from the application parameters
moving VERSION file to the empty ChangeLog one, this create compilation problems with #include <version> in recent versions of LLVM and case-insensitive FS (typically macOS)
Added precision tuning to Hadrons parameterfile writing
Kl2 QED cleanup
Added ZFIMPL to SeqGamma
Added ZFIMPL to SeqConserved module
F1 ensemble running with 96%~ acceptance etc..
Make detection of HPE 8600 automatic
Added variables that were missing from wall source setup
Exposed a coulomb/landau enum to the gauge fixing module
Coulomb gauge added as an option
More logging, timing, and 4d/5d logic for eigpack gauge transforms
Added gauge transform option to eigpack IO
Hadrons: Lepton Propagator for kl2, sign swap for antiperiodic boundary
A2A Lepton-Meson Field contraction
Verbose
Iteratoin range fix
...
2019-05-31 18:20:43 +01:00
Peter Boyle
ffde81f22a
Nsimd() and coalesced support
2019-05-25 12:44:07 +01:00
Peter Boyle
d8098f1ecd
coalesced support
2019-05-25 12:43:31 +01:00
Peter Boyle
aca788cf4f
Move coalesced read into tensors
2019-05-25 12:43:00 +01:00
9a34edcf9f
Kl2 QED cleanup
2019-05-23 13:43:22 +01:00
e675c6a48c
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
2019-05-23 12:41:54 +01:00
Peter Boyle
918e673078
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-05-22 09:57:02 +01:00
Peter Boyle
44b53c3ba2
F1 ensemble running with 96%~ acceptance etc..
2019-05-22 09:56:26 +01:00
Peter Boyle
2095c12eac
Make detection of HPE 8600 automatic
2019-05-22 09:54:21 +01:00
ae5ad986e2
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
2019-05-19 14:35:46 +01:00
Peter Boyle
ee6f96d85c
Merge pull request #210 from grid-test-organisation/feature/gpu-port-develop
...
Cayley fermion functions for GPUs
2019-05-18 19:06:20 +01:00
Peter Boyle
9fe68857a9
Runs multiGPU with coalesced access on tesseract
2019-05-18 17:42:41 +01:00
Peter Boyle
37336c9e0c
Allow compress to be either vector or scalar types
2019-05-18 17:41:13 +01:00
Peter Boyle
6c4da3bbc7
Stencil now runs with coalesced accesses
2019-05-18 17:40:35 +01:00
Peter Boyle
a584b16c4a
Adding a non-blocking kernel launch
2019-05-18 17:39:54 +01:00
Peter Boyle
48b1c806ed
Coulomb gauge added as an option
2019-05-17 17:36:32 +01:00
Felix Erben
435653490e
fixed contraction issue
2019-05-17 10:50:15 +01:00
Felix Erben
acd5a01b65
some work on baryons
2019-05-16 15:11:50 +01:00
gfilaci
1a82533d22
fix inner product with thrust reduction
2019-05-14 15:35:54 +01:00
Michael Marshall
ec7d96ce3b
Merge branch 'develop' into feature/distil
...
* develop:
Hadron WeakEye and A2ALoop bug fixes, and WWVVContraction bug fix
DiskVector: fix of memory bug triggering segfault when the cache is accessed following a certain pattern
MFermion::GaugeProp fix for 4d fields
2019-05-14 13:10:40 +01:00
gfilaci
955cc7790f
MooeeInvDag offloaded to GPU
2019-05-13 14:25:29 +01:00
gfilaci
1179123ac2
MooeeInv offloaded to GPU
2019-05-13 12:37:12 +01:00