Peter Boyle
|
2095c12eac
|
Make detection of HPE 8600 automatic
|
2019-05-22 09:54:21 +01:00 |
|
Peter Boyle
|
a0e9f3b0a0
|
Plan for GPU port
|
2019-05-20 09:46:19 +01:00 |
|
|
ae5ad986e2
|
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
|
2019-05-19 14:35:46 +01:00 |
|
Peter Boyle
|
a9342c6ae5
|
Udpdate TODO afer gianluc marge
|
2019-05-18 22:58:25 +01:00 |
|
Peter Boyle
|
ee6f96d85c
|
Merge pull request #210 from grid-test-organisation/feature/gpu-port-develop
Cayley fermion functions for GPUs
|
2019-05-18 19:06:20 +01:00 |
|
Peter Boyle
|
77ca45ff49
|
Merge pull request #211 from fionnoh/develop
Enum for gaugefix and bug fix for wall source
|
2019-05-18 18:57:52 +01:00 |
|
Peter Boyle
|
4e9df9e93c
|
GPU patches
|
2019-05-18 17:43:11 +01:00 |
|
Peter Boyle
|
9fe68857a9
|
Runs multiGPU with coalesced access on tesseract
|
2019-05-18 17:42:41 +01:00 |
|
Peter Boyle
|
37336c9e0c
|
Allow compress to be either vector or scalar types
|
2019-05-18 17:41:13 +01:00 |
|
Peter Boyle
|
6c4da3bbc7
|
Stencil now runs with coalesced accesses
|
2019-05-18 17:40:35 +01:00 |
|
Peter Boyle
|
a584b16c4a
|
Adding a non-blocking kernel launch
|
2019-05-18 17:39:54 +01:00 |
|
fionnoh
|
dbd7f3f0fc
|
Added variables that were missing from wall source setup
|
2019-05-17 19:10:09 +01:00 |
|
fionnoh
|
d14512ee03
|
Exposed a coulomb/landau enum to the gauge fixing module
|
2019-05-17 19:01:52 +01:00 |
|
Peter Boyle
|
48b1c806ed
|
Coulomb gauge added as an option
|
2019-05-17 17:36:32 +01:00 |
|
|
0a8b6724ef
|
Merge pull request #209 from fionnoh/develop
Added gauge transform option to eigpack IO
|
2019-05-15 18:09:44 +02:00 |
|
fionnoh
|
ce102ac550
|
More logging, timing, and 4d/5d logic for eigpack gauge transforms
|
2019-05-15 14:31:25 +01:00 |
|
fionnoh
|
94accec311
|
Added gauge transform option to eigpack IO
|
2019-05-15 13:35:47 +01:00 |
|
gfilaci
|
1a82533d22
|
fix inner product with thrust reduction
|
2019-05-14 15:35:54 +01:00 |
|
gfilaci
|
e3c56fd9b3
|
CayleyZeroCounters before benchmark loop
|
2019-05-13 15:52:00 +01:00 |
|
gfilaci
|
955cc7790f
|
MooeeInvDag offloaded to GPU
|
2019-05-13 14:25:29 +01:00 |
|
gfilaci
|
1179123ac2
|
MooeeInv offloaded to GPU
|
2019-05-13 12:37:12 +01:00 |
|
|
d8512b03f8
|
Merge pull request #195 from nils-asmussen/fix_GaugeProp_4d
MFermion::GaugeProp fix for 4d fields
|
2019-05-12 21:31:18 +02:00 |
|
|
d90cf9d022
|
Merge pull request #207 from fionnoh/develop
Weak Hamiltonian and contraction bug fixes
|
2019-05-12 21:30:20 +02:00 |
|
|
79e930ba12
|
Hadrons: Lepton Propagator for kl2, sign swap for antiperiodic boundary
|
2019-05-10 12:46:18 +01:00 |
|
gfilaci
|
22e35c9ddd
|
M5Ddag offloaded to GPU
|
2019-05-10 12:23:39 +01:00 |
|
gfilaci
|
698b45e163
|
remove unused typedef
|
2019-05-09 11:19:39 +01:00 |
|
gfilaci
|
f1744b3f01
|
M5D offloaded to GPU
|
2019-05-09 11:17:55 +01:00 |
|
gfilaci
|
2b3c22f03d
|
bandwidth dependent on grid default precision
|
2019-05-08 12:01:11 +01:00 |
|
gfilaci
|
8423a05940
|
duplicate CayleyFermion5D for gpu
|
2019-05-08 11:51:37 +01:00 |
|
fionnoh
|
2acd8ece65
|
Hadron WeakEye and A2ALoop bug fixes, and WWVVContraction bug fix
|
2019-05-08 10:57:36 +01:00 |
|
fionnoh
|
b638509c61
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2019-05-08 10:51:04 +01:00 |
|
|
edeb590818
|
DiskVector: fix of memory bug triggering segfault when the cache is accessed following a certain pattern
|
2019-05-03 17:09:47 +01:00 |
|
gfilaci
|
d9438627d9
|
M5D benchmark without vector copy overhead
|
2019-05-02 11:10:57 +01:00 |
|
gfilaci
|
b23305dbe2
|
fix M5D flop count
|
2019-05-02 11:08:21 +01:00 |
|
gfilaci
|
d3b5c02e2d
|
measure M5D bandwidth and fix M5D flop count
|
2019-05-02 11:02:39 +01:00 |
|
gfilaci
|
8b6541fb60
|
Fix gpu MultRealPart and MaddRealPart bug
|
2019-05-02 10:58:17 +01:00 |
|
gfilaci
|
6da9aa9971
|
replace std::vector with Vector in benchmark
|
2019-05-02 10:56:22 +01:00 |
|
gfilaci
|
44e0360b97
|
replace std::vector with Vector
|
2019-05-02 10:55:36 +01:00 |
|
gfilaci
|
9003c4a07c
|
allocator copy constructor (to be fixed)
|
2019-05-02 10:53:37 +01:00 |
|
gfilaci
|
b52fa38f8c
|
seed initialisation of RNG5
|
2019-05-02 10:36:09 +01:00 |
|
gfilaci
|
3f1c4d8789
|
fix comment hash
|
2019-05-02 10:24:36 +01:00 |
|
|
4f0631615f
|
A2A Lepton-Meson Field contraction
|
2019-04-30 12:04:59 +01:00 |
|
|
c2cd0e15d7
|
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
Conflicts:
Grid/qcd/action/fermion/DomainWallFermion.h
Grid/qcd/action/fermion/FermionOperator.h
|
2019-04-29 12:07:20 +01:00 |
|
Peter Boyle
|
60330e05a3
|
NVCC wacky compiler options frozen. Possibly Cuda 9.2 specific
|
2019-04-28 07:39:33 +01:00 |
|
Peter Boyle
|
f9b8c0cccf
|
Vector changes for UVM
|
2019-04-28 07:38:57 +01:00 |
|
Peter Boyle
|
3cad67e569
|
Compile on tesseract
|
2019-04-28 07:38:09 +01:00 |
|
Peter Boyle
|
170ba4e619
|
Ensure different MPI ranks use different GPUs. The mapping works on Tesseract.
|
2019-04-28 07:32:30 +01:00 |
|
Peter Boyle
|
204a090497
|
Inner product is not working on GPU. Why?
|
2019-04-28 07:31:56 +01:00 |
|
Peter Boyle
|
3c717c47ef
|
GPU no compile on Wilson Multigrid fixed
|
2019-04-28 07:31:19 +01:00 |
|
fionnoh
|
df41de4cb6
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2019-04-24 12:02:50 +01:00 |
|