Peter Boyle
|
4557a1365a
|
Remove Ls vectorised DWF
|
2019-06-04 20:59:59 +01:00 |
|
Peter Boyle
|
16e9b87d98
|
Remove Ls vectorised DWF as unused and hard to maintain
|
2019-06-04 20:59:01 +01:00 |
|
Peter Boyle
|
685eea3d0f
|
Small cosmetic
|
2019-06-04 20:58:14 +01:00 |
|
Peter Boyle
|
65b48831fb
|
Simplify code
|
2019-06-04 20:56:30 +01:00 |
|
Peter Boyle
|
57396fc595
|
Simplify code
|
2019-06-04 20:56:23 +01:00 |
|
Peter Boyle
|
a2e199df50
|
Simplifying Cayley cases.
|
2019-06-04 20:54:52 +01:00 |
|
Peter Boyle
|
ade4a126da
|
Getting closer on the GPU port, but will start deleting 5th dim vectorised variants
for code maintainability
|
2019-06-04 11:53:44 +01:00 |
|
Peter Boyle
|
7b59ab5bd7
|
Compiling after reorganisation
|
2019-06-03 15:46:26 +01:00 |
|
Peter Boyle
|
fcd8cfe257
|
Gparity in
|
2019-06-03 15:45:09 +01:00 |
|
Peter Boyle
|
b4b53812cb
|
Move implementation to specific implementation headers
|
2019-06-03 15:43:01 +01:00 |
|
Peter Boyle
|
085cac583f
|
Implementation in header
|
2019-06-03 15:42:36 +01:00 |
|
Peter Boyle
|
25e3b8640c
|
Move to header
|
2019-06-03 15:42:05 +01:00 |
|
Peter Boyle
|
44bbec50b0
|
Making GPU compile happy
|
2019-06-03 14:57:04 +01:00 |
|
Peter Boyle
|
ec68b67d5d
|
Attempt at unified GPU and CPU kernel
|
2019-06-03 14:55:51 +01:00 |
|
Peter Boyle
|
778450e0c8
|
Move to implementation subdir
|
2019-06-03 14:53:56 +01:00 |
|
Peter Boyle
|
567aa5f366
|
Move to implementation subdir
|
2019-06-03 14:53:33 +01:00 |
|
Peter Boyle
|
2ab7e2b175
|
Force instantiation in .cc files.
Eventually move into multiple files
|
2019-06-03 14:52:59 +01:00 |
|
Peter Boyle
|
6f61be044d
|
Dont instantiate in header
|
2019-06-03 14:52:01 +01:00 |
|
Peter Boyle
|
269e00509e
|
Don't instantiate in header
|
2019-06-03 14:51:24 +01:00 |
|
Peter Boyle
|
a5e90b0ddc
|
Making the kernels more GPU happy
|
2019-06-03 14:50:54 +01:00 |
|
Peter Boyle
|
5622faf226
|
pragma once ifdef guard
|
2019-06-03 14:50:26 +01:00 |
|
Peter Boyle
|
ee6f96d85c
|
Merge pull request #210 from grid-test-organisation/feature/gpu-port-develop
Cayley fermion functions for GPUs
|
2019-05-18 19:06:20 +01:00 |
|
Peter Boyle
|
9fe68857a9
|
Runs multiGPU with coalesced access on tesseract
|
2019-05-18 17:42:41 +01:00 |
|
gfilaci
|
955cc7790f
|
MooeeInvDag offloaded to GPU
|
2019-05-13 14:25:29 +01:00 |
|
gfilaci
|
1179123ac2
|
MooeeInv offloaded to GPU
|
2019-05-13 12:37:12 +01:00 |
|
gfilaci
|
22e35c9ddd
|
M5Ddag offloaded to GPU
|
2019-05-10 12:23:39 +01:00 |
|
gfilaci
|
698b45e163
|
remove unused typedef
|
2019-05-09 11:19:39 +01:00 |
|
gfilaci
|
f1744b3f01
|
M5D offloaded to GPU
|
2019-05-09 11:17:55 +01:00 |
|
gfilaci
|
2b3c22f03d
|
bandwidth dependent on grid default precision
|
2019-05-08 12:01:11 +01:00 |
|
gfilaci
|
8423a05940
|
duplicate CayleyFermion5D for gpu
|
2019-05-08 11:51:37 +01:00 |
|
gfilaci
|
b23305dbe2
|
fix M5D flop count
|
2019-05-02 11:08:21 +01:00 |
|
gfilaci
|
d3b5c02e2d
|
measure M5D bandwidth and fix M5D flop count
|
2019-05-02 11:02:39 +01:00 |
|
gfilaci
|
44e0360b97
|
replace std::vector with Vector
|
2019-05-02 10:55:36 +01:00 |
|
Peter Boyle
|
3cad67e569
|
Compile on tesseract
|
2019-04-28 07:38:09 +01:00 |
|
Peter Boyle
|
a4d9200293
|
Fixing AVX 512 instantiation error. Need to move to extern templates urgently.
|
2019-01-02 00:27:07 +00:00 |
|
Peter Boyle
|
38852737e4
|
No compile fix on clang
|
2019-01-01 15:55:13 +00:00 |
|
Peter Boyle
|
4bf86ae60a
|
NVCC clean up
|
2019-01-01 15:02:50 +00:00 |
|
Peter Boyle
|
07ee87ff5a
|
GPU happy. Still need to prevent hand kernels being callable under NVCC
|
2019-01-01 15:00:33 +00:00 |
|
Peter Boyle
|
35ed1defac
|
Passes make check now single and double compile
|
2018-12-19 11:09:32 +00:00 |
|
Peter Boyle
|
422764757d
|
Updates in tests to make all of Grid compile
|
2018-12-14 16:55:54 +00:00 |
|
Peter Boyle
|
b57a4d32aa
|
Merge branch 'develop' into feature/gpu-port
|
2018-12-13 05:11:34 +00:00 |
|
Peter Boyle
|
f3f24b3017
|
Optional Twisted BC's added, in "DoubleStore" for WilsonImpl.
Untested but doesn't affect answers when twists are all zero. The zero is the default behaviour
for ImplParams.
|
2018-11-08 12:55:25 +00:00 |
|
Peter Boyle
|
6f421c7a6f
|
Block solver in the SchurRedBlack plus timing report cleaner
|
2018-11-07 12:26:56 +00:00 |
|
Peter Boyle
|
24c07694bc
|
Mixed precision now supported in MADWF
|
2018-10-14 00:22:52 +01:00 |
|
Peter Boyle
|
f0229025e2
|
MADWF working across a range of actions
|
2018-10-13 19:55:03 +01:00 |
|
Peter Boyle
|
49f25e08e8
|
PauliVillars based 4D -> 5D reconstruction with Fourier Accelerated PV inverse
by Christoph. Differs from the one by Rudy in BFM since it vectorises the twisted
4D solves in pairs.
|
2018-10-11 12:35:32 +01:00 |
|
Peter Boyle
|
b1c4e96382
|
Updates to actions etc..
|
2018-09-24 22:10:30 +01:00 |
|
|
fb7d021b9d
|
Hadrons: moving Hadrons to root directory, build system improvements
|
2018-08-28 15:00:40 +01:00 |
|