paboyle
|
38e2a32ac9
|
Single SIMD lane operations for CUDA
|
2018-06-27 21:28:06 +01:00 |
|
paboyle
|
efa84ca50a
|
Keep Cuda 9.1 happy
|
2018-06-27 21:27:32 +01:00 |
|
paboyle
|
5e96d6d04c
|
Keep CUDA happy
|
2018-06-27 21:27:11 +01:00 |
|
paboyle
|
df30bdc599
|
CUDA happy
|
2018-06-27 21:26:49 +01:00 |
|
paboyle
|
7f45222924
|
Diagnostics on memory alloc fail
|
2018-06-27 21:26:20 +01:00 |
|
paboyle
|
dd891f5e3b
|
Use NVCC to suppress device Eigen
|
2018-06-27 21:25:17 +01:00 |
|
|
d2c42e6f42
|
Hadrons: scaled DWF action
|
2018-06-26 14:59:33 +01:00 |
|
Daniel Richtmann
|
2881b3e8e5
|
WilsonMG: Remove unnecessary static assertions
|
2018-06-26 14:42:30 +02:00 |
|
|
049cc518f4
|
Hadrons: introduction message 2
|
2018-06-25 19:08:39 +01:00 |
|
|
2e1c66897f
|
Hadrons: introduction message
|
2018-06-25 19:08:22 +01:00 |
|
|
adcef36189
|
Hadrons: Möbius DWF action
|
2018-06-25 15:58:35 +01:00 |
|
fionnoh
|
2f121c41c9
|
Commiting reation of meson field code before a merge with the upstream branch feature/hadrons
|
2018-06-25 12:20:46 +01:00 |
|
|
e0ed7e300f
|
Hadrons: spurious Dminus removed
|
2018-06-22 16:33:43 +02:00 |
|
|
485207901b
|
Merge branch 'develop' into feature/hadrons
|
2018-06-22 16:15:32 +02:00 |
|
|
c760f0a4c3
|
Hadrons: remove make_5D/4D functions and FreeProp fix
|
2018-06-22 16:12:46 +02:00 |
|
|
c84eeedec3
|
Hadrons: GaugeProp module for z-Wilson actions
|
2018-06-22 15:53:22 +02:00 |
|
fionnoh
|
1ac3526f33
|
Small changes to the A2A header and module
|
2018-06-22 12:29:42 +01:00 |
|
fionnoh
|
0de090ee74
|
Temporarily added in the contraction code that produced the working 2-pt function. This is commited for reference only and will be removed in the next push.
|
2018-06-22 12:28:41 +01:00 |
|
|
91405de3f7
|
Hadrons: new solver exposing fermion matrix and generic source/solve import/export
|
2018-06-22 12:14:37 +02:00 |
|
fionnoh
|
8fccda301a
|
Fixed a bug where the guess was always subtracted after the solve and included appropriate weights for the sources in the one case we're looking at now. More work needs to be done to make the 5d/4d source logic less brittle.
|
2018-06-21 16:36:59 +01:00 |
|
fionnoh
|
7a0abfac89
|
Restructured the class that computes and returns the A2A vectors.
|
2018-06-21 16:36:06 +01:00 |
|
fionnoh
|
ae37fda699
|
A more elegant way to subtract guesses from solve and a bool check before verifying residual
|
2018-06-20 16:07:40 +01:00 |
|
fionnoh
|
b5fc5e2030
|
All to all module update that hit a promising milestone. Commiting for a reference for future changes.
|
2018-06-20 10:59:07 +01:00 |
|
Daniel Richtmann
|
cc5d025ea4
|
WilsonMG: Adapt staggered GMRES/MR tests to "new" constructor
|
2018-06-18 16:20:20 +02:00 |
|
paboyle
|
6c97a6a071
|
Coalescing version of the kernel
|
2018-06-13 20:52:29 +01:00 |
|
paboyle
|
73bb2d5128
|
Ugly hack to speed up compile on GPU; we don't use the hand kernels on GPU anyway so why compile
|
2018-06-13 20:35:28 +01:00 |
|
paboyle
|
b710fec6ea
|
Gpu code first version of specialised kernel
|
2018-06-13 20:34:39 +01:00 |
|
paboyle
|
b2a8cd60f5
|
Doubled gauge field is useful
|
2018-06-13 20:27:47 +01:00 |
|
paboyle
|
867ee364ab
|
Explicit instantiation hooks
|
2018-06-13 20:27:12 +01:00 |
|
paboyle
|
25becc9324
|
GPU tweaks for benchmarking; really necessary?
|
2018-06-13 20:26:07 +01:00 |
|
paboyle
|
94d1ae4c82
|
Some prep work for GPU shared memory. Need to be careful, as will try GPU direct
RDMA and inter-GPU memory sharing on SUmmit later
|
2018-06-13 20:24:06 +01:00 |
|
paboyle
|
2075b177ef
|
CUDA_ARCH more carefule treatment
|
2018-06-13 20:22:34 +01:00 |
|
paboyle
|
847c761ccc
|
Move sfw IEEE fp16 into central location
|
2018-06-13 20:22:01 +01:00 |
|
paboyle
|
8287ed8383
|
New GPU vector targets
|
2018-06-13 20:21:35 +01:00 |
|
paboyle
|
e6be7416f4
|
Use managed memory
|
2018-06-13 20:14:00 +01:00 |
|
paboyle
|
26863b6d95
|
User Managed memory
|
2018-06-13 20:13:42 +01:00 |
|
paboyle
|
ebd730bd54
|
Adding 2D loops
|
2018-06-13 20:13:01 +01:00 |
|
paboyle
|
066be31a3b
|
Optional GPU target SIMD types; work in progress and trying experiments
|
2018-06-13 20:07:55 +01:00 |
|
paboyle
|
7a4c142955
|
Add GPU specific simd targets
|
2018-06-13 19:55:30 +01:00 |
|
Daniel Richtmann
|
ddcb53bce2
|
Merge remote-tracking branch 'upstream/develop' into feature/wilsonmg
|
2018-06-13 09:50:37 +02:00 |
|
Daniel Richtmann
|
d1c80e1d46
|
WilsonMG: Correct years in copyright line
|
2018-06-13 09:44:09 +02:00 |
|
Daniel Richtmann
|
c73cc7d354
|
WilsonMG: Add tests with MG preconditioner running single precision, outer solver running in double
|
2018-06-12 16:10:48 +02:00 |
|
Daniel Richtmann
|
49fdc324a0
|
WilsonMG: Make MG correctness checks abort on failing tests
|
2018-06-12 16:10:48 +02:00 |
|
Daniel Richtmann
|
f32714a2d1
|
WilsonMG: Make running MG correctness checks optional via commandline
|
2018-06-12 16:10:48 +02:00 |
|
Daniel Richtmann
|
73a955be20
|
WilsonMG: Move tests for Wilson & WilsonClover into separate files
|
2018-06-12 16:10:48 +02:00 |
|
Daniel Richtmann
|
66b7a0f871
|
WilsonMG: Move multigrid class to separate file
|
2018-06-12 16:10:48 +02:00 |
|
Daniel Richtmann
|
2ab9d4bc56
|
WilsonMG: Fix random behavior in GMRES
From time to time I saw random since the basis vectors were not initialized
properly.
|
2018-06-12 15:01:31 +02:00 |
|
Daniel Richtmann
|
4f41cd114d
|
WilsonMG: Add a mixed precision version of FGMRES
This version does everything in double prec but accepts a preconditioner working
in single precision.
|
2018-06-12 15:01:31 +02:00 |
|
Daniel Richtmann
|
11c4f5e32c
|
WilsonMG: Provide command line switch for reading in input xml + move default params to constructor of MultiGridParams
|
2018-06-12 15:01:31 +02:00 |
|
Daniel Richtmann
|
e9b9550298
|
WilsonMG: Fix incompatibility with single prec MG in construction of simd layout on coarser grids
|
2018-06-12 15:01:31 +02:00 |
|