Peter Boyle
|
6e2e904a0e
|
NVCC compiles happy. Start to develop strategy for writing generic
code for GPU kernels and CPU kernels.
|
2019-06-04 20:46:35 +01:00 |
|
Peter Boyle
|
d92a17f359
|
Suppress NVCC warnings in pugixml with pragma
|
2019-06-04 20:45:53 +01:00 |
|
Peter Boyle
|
47c063f984
|
Remove Ls Vec cases from benchmarks
|
2019-06-04 20:45:35 +01:00 |
|
Peter Boyle
|
7e27a5213a
|
Tests builds clean.
|
2019-06-04 20:45:20 +01:00 |
|
Peter Boyle
|
ade4a126da
|
Getting closer on the GPU port, but will start deleting 5th dim vectorised variants
for code maintainability
|
2019-06-04 11:53:44 +01:00 |
|
Peter Boyle
|
7b59ab5bd7
|
Compiling after reorganisation
|
2019-06-03 15:46:26 +01:00 |
|
Peter Boyle
|
fcd8cfe257
|
Gparity in
|
2019-06-03 15:45:09 +01:00 |
|
Peter Boyle
|
b4b53812cb
|
Move implementation to specific implementation headers
|
2019-06-03 15:43:01 +01:00 |
|
Peter Boyle
|
085cac583f
|
Implementation in header
|
2019-06-03 15:42:36 +01:00 |
|
Peter Boyle
|
25e3b8640c
|
Move to header
|
2019-06-03 15:42:05 +01:00 |
|
Peter Boyle
|
44bbec50b0
|
Making GPU compile happy
|
2019-06-03 14:57:04 +01:00 |
|
Peter Boyle
|
ec68b67d5d
|
Attempt at unified GPU and CPU kernel
|
2019-06-03 14:55:51 +01:00 |
|
Peter Boyle
|
778450e0c8
|
Move to implementation subdir
|
2019-06-03 14:53:56 +01:00 |
|
Peter Boyle
|
567aa5f366
|
Move to implementation subdir
|
2019-06-03 14:53:33 +01:00 |
|
Peter Boyle
|
2ab7e2b175
|
Force instantiation in .cc files.
Eventually move into multiple files
|
2019-06-03 14:52:59 +01:00 |
|
Peter Boyle
|
6f61be044d
|
Dont instantiate in header
|
2019-06-03 14:52:01 +01:00 |
|
Peter Boyle
|
269e00509e
|
Don't instantiate in header
|
2019-06-03 14:51:24 +01:00 |
|
Peter Boyle
|
a5e90b0ddc
|
Making the kernels more GPU happy
|
2019-06-03 14:50:54 +01:00 |
|
Peter Boyle
|
5622faf226
|
pragma once ifdef guard
|
2019-06-03 14:50:26 +01:00 |
|
Peter Boyle
|
82ecd520c7
|
Macos happy fix under nvcc
|
2019-06-03 14:48:50 +01:00 |
|
|
b120ef1fe4
|
Merge pull request #217 from guelpers/feature/EMlepwall
Hadrons: EMLepton: Wall source
|
2019-05-30 11:13:27 +02:00 |
|
|
166feb6483
|
Hadrons: EMLepton: Wall source
|
2019-05-30 10:07:08 +01:00 |
|
|
e35e8da111
|
Revert "cleaning up Kl2 contraction"
This reverts commit f244fed6ab .
|
2019-05-29 11:23:17 +02:00 |
|
Peter Boyle
|
ffde81f22a
|
Nsimd() and coalesced support
|
2019-05-25 12:44:07 +01:00 |
|
Peter Boyle
|
d8098f1ecd
|
coalesced support
|
2019-05-25 12:43:31 +01:00 |
|
Peter Boyle
|
aca788cf4f
|
Move coalesced read into tensors
|
2019-05-25 12:43:00 +01:00 |
|
|
f244fed6ab
|
cleaning up Kl2 contraction
|
2019-05-24 13:08:35 +01:00 |
|
|
9b3701ae27
|
posibility to save/load schedules directly from the application parameters
|
2019-05-24 13:08:20 +01:00 |
|
|
4ac27340b9
|
moving VERSION file to the empty ChangeLog one, this create compilation problems with #include <version> in recent versions of LLVM and case-insensitive FS (typically macOS)
|
2019-05-24 13:05:17 +01:00 |
|
|
c7c0a1065f
|
Merge pull request #214 from guelpers/feature/kl2QEDseq
Kl2 contraction with sequential propagators
|
2019-05-23 20:31:41 +01:00 |
|
|
80947130f9
|
Merge pull request #215 from fionnoh/develop
Added precision tuning to Hadrons parameterfile writing
|
2019-05-23 18:44:58 +01:00 |
|
fionnoh
|
0aee73ea6b
|
Added precision tuning to Hadrons parameterfile writing
|
2019-05-23 18:43:25 +01:00 |
|
|
0290ee1f6d
|
Merge pull request #213 from fionnoh/develop
Added ZFIMPL to SeqConserved module
|
2019-05-23 13:46:02 +01:00 |
|
|
9a34edcf9f
|
Kl2 QED cleanup
|
2019-05-23 13:43:22 +01:00 |
|
fionnoh
|
246f10001e
|
Added ZFIMPL to SeqGamma
|
2019-05-23 12:42:40 +01:00 |
|
|
e675c6a48c
|
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
|
2019-05-23 12:41:54 +01:00 |
|
fionnoh
|
a66d110b88
|
Added ZFIMPL to SeqConserved module
|
2019-05-23 11:49:54 +01:00 |
|
Peter Boyle
|
918e673078
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2019-05-22 09:57:02 +01:00 |
|
Peter Boyle
|
44b53c3ba2
|
F1 ensemble running with 96%~ acceptance etc..
|
2019-05-22 09:56:26 +01:00 |
|
Peter Boyle
|
2095c12eac
|
Make detection of HPE 8600 automatic
|
2019-05-22 09:54:21 +01:00 |
|
Peter Boyle
|
a0e9f3b0a0
|
Plan for GPU port
|
2019-05-20 09:46:19 +01:00 |
|
|
ae5ad986e2
|
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
|
2019-05-19 14:35:46 +01:00 |
|
Peter Boyle
|
a9342c6ae5
|
Udpdate TODO afer gianluc marge
|
2019-05-18 22:58:25 +01:00 |
|
Peter Boyle
|
ee6f96d85c
|
Merge pull request #210 from grid-test-organisation/feature/gpu-port-develop
Cayley fermion functions for GPUs
|
2019-05-18 19:06:20 +01:00 |
|
Peter Boyle
|
77ca45ff49
|
Merge pull request #211 from fionnoh/develop
Enum for gaugefix and bug fix for wall source
|
2019-05-18 18:57:52 +01:00 |
|
Peter Boyle
|
4e9df9e93c
|
GPU patches
|
2019-05-18 17:43:11 +01:00 |
|
Peter Boyle
|
9fe68857a9
|
Runs multiGPU with coalesced access on tesseract
|
2019-05-18 17:42:41 +01:00 |
|
Peter Boyle
|
37336c9e0c
|
Allow compress to be either vector or scalar types
|
2019-05-18 17:41:13 +01:00 |
|
Peter Boyle
|
6c4da3bbc7
|
Stencil now runs with coalesced accesses
|
2019-05-18 17:40:35 +01:00 |
|
Peter Boyle
|
a584b16c4a
|
Adding a non-blocking kernel launch
|
2019-05-18 17:39:54 +01:00 |
|