fionnoh
|
8fccda301a
|
Fixed a bug where the guess was always subtracted after the solve and included appropriate weights for the sources in the one case we're looking at now. More work needs to be done to make the 5d/4d source logic less brittle.
|
2018-06-21 16:36:59 +01:00 |
|
fionnoh
|
7a0abfac89
|
Restructured the class that computes and returns the A2A vectors.
|
2018-06-21 16:36:06 +01:00 |
|
fionnoh
|
ae37fda699
|
A more elegant way to subtract guesses from solve and a bool check before verifying residual
|
2018-06-20 16:07:40 +01:00 |
|
fionnoh
|
b5fc5e2030
|
All to all module update that hit a promising milestone. Commiting for a reference for future changes.
|
2018-06-20 10:59:07 +01:00 |
|
paboyle
|
6c97a6a071
|
Coalescing version of the kernel
|
2018-06-13 20:52:29 +01:00 |
|
paboyle
|
73bb2d5128
|
Ugly hack to speed up compile on GPU; we don't use the hand kernels on GPU anyway so why compile
|
2018-06-13 20:35:28 +01:00 |
|
paboyle
|
b710fec6ea
|
Gpu code first version of specialised kernel
|
2018-06-13 20:34:39 +01:00 |
|
paboyle
|
b2a8cd60f5
|
Doubled gauge field is useful
|
2018-06-13 20:27:47 +01:00 |
|
paboyle
|
867ee364ab
|
Explicit instantiation hooks
|
2018-06-13 20:27:12 +01:00 |
|
paboyle
|
25becc9324
|
GPU tweaks for benchmarking; really necessary?
|
2018-06-13 20:26:07 +01:00 |
|
paboyle
|
94d1ae4c82
|
Some prep work for GPU shared memory. Need to be careful, as will try GPU direct
RDMA and inter-GPU memory sharing on SUmmit later
|
2018-06-13 20:24:06 +01:00 |
|
paboyle
|
2075b177ef
|
CUDA_ARCH more carefule treatment
|
2018-06-13 20:22:34 +01:00 |
|
paboyle
|
847c761ccc
|
Move sfw IEEE fp16 into central location
|
2018-06-13 20:22:01 +01:00 |
|
paboyle
|
8287ed8383
|
New GPU vector targets
|
2018-06-13 20:21:35 +01:00 |
|
paboyle
|
e6be7416f4
|
Use managed memory
|
2018-06-13 20:14:00 +01:00 |
|
paboyle
|
26863b6d95
|
User Managed memory
|
2018-06-13 20:13:42 +01:00 |
|
paboyle
|
ebd730bd54
|
Adding 2D loops
|
2018-06-13 20:13:01 +01:00 |
|
paboyle
|
066be31a3b
|
Optional GPU target SIMD types; work in progress and trying experiments
|
2018-06-13 20:07:55 +01:00 |
|
paboyle
|
7a4c142955
|
Add GPU specific simd targets
|
2018-06-13 19:55:30 +01:00 |
|
|
8db0ef9736
|
Merge pull request #168 from jch1g10/feature/qed-fvol
Feature/qed fvol
|
2018-06-08 20:09:06 +02:00 |
|
Guido Cossu
|
95d4b46446
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2018-06-08 11:30:29 +01:00 |
|
paboyle
|
5dfd216a34
|
Better thread safety
|
2018-06-04 21:08:44 +01:00 |
|
paboyle
|
c2e8d0aa88
|
Solve g++ problem on the lanczos test
|
2018-06-04 18:34:15 +01:00 |
|
James Harrison
|
0fe5aeffbb
|
Merge branch 'feature/hadrons' into feature/qed-fvol
|
2018-06-04 16:59:43 +01:00 |
|
James Harrison
|
7fbc469046
|
Merge branch 'develop' into feature/hadrons
|
2018-06-04 16:58:30 +01:00 |
|
paboyle
|
bf96a4bdbf
|
Merge branch 'master' into develop
|
2018-06-04 14:03:11 +01:00 |
|
paboyle
|
84685c9bc3
|
Overflow fix
|
2018-06-04 13:42:07 +01:00 |
|
fionnoh
|
a8d4156997
|
Added a Hadrons module that computes the all-to-all v and w vectors
|
2018-05-31 17:18:58 +01:00 |
|
fionnoh
|
c18074869b
|
Changes to Hadrons SchurRB solver to allow for a subtract_guess boolean to be passed
|
2018-05-31 17:17:16 +01:00 |
|
fionnoh
|
f4c6d39238
|
CHanges made to SchurRB solvers to allow for the subtraction of a guess after solve
|
2018-05-31 17:16:20 +01:00 |
|
|
200d35b38a
|
Merge branch 'develop' into feature/hadrons
|
2018-05-28 11:52:47 +02:00 |
|
|
eb52e84d09
|
Merge branch 'feature/hadrons' of github.com:paboyle/Grid into feature/hadrons
|
2018-05-28 11:50:27 +02:00 |
|
|
72abc34764
|
Merge pull request #166 from guelpers/feature/hadrons
Feature/hadrons
|
2018-05-28 11:49:46 +02:00 |
|
|
e3164d4c7b
|
Hadrons: env function to get volume in double
|
2018-05-28 11:39:17 +02:00 |
|
James Harrison
|
f5db386c55
|
Change MODULE_REGISTER_NS -> MODULE_REGISTER in UnitEM, ScalarVP and VPCounterTerms
|
2018-05-22 16:16:21 +01:00 |
|
James Harrison
|
294ee70a7a
|
Merge branch 'feature/hadrons' into feature/qed-fvol
# Conflicts:
# extras/Hadrons/modules.inc
# lib/qcd/action/gauge/Photon.h
|
2018-05-21 18:02:41 +01:00 |
|
Azusa Yamaguchi
|
013ea4e8d1
|
Merge branch 'feature/staggered-comms-compute' into develop
|
2018-05-21 13:11:56 +01:00 |
|
Azusa Yamaguchi
|
7fbbb31a50
|
Merge branch 'develop' into feature/staggered-comms-compute
Conflicts:
lib/qcd/action/fermion/ImprovedStaggeredFermion.cc
|
2018-05-21 13:07:29 +01:00 |
|
Azusa Yamaguchi
|
0e127b1fc7
|
New file single prec test
|
2018-05-21 12:57:13 +01:00 |
|
Azusa Yamaguchi
|
68c028b0a6
|
Comment
|
2018-05-21 12:54:25 +01:00 |
|
|
255d4992e1
|
Hadrons: stochastic scalar SU(N) free field fix
|
2018-05-18 20:49:55 +01:00 |
|
|
a0d399e5ce
|
Hadrons: yet other attempts at EMT NPR
|
2018-05-18 20:49:26 +01:00 |
|
|
fd3b2e945a
|
Hadrons: don't right result with empty stem
|
2018-05-18 20:48:24 +01:00 |
|
|
b999984501
|
Merge branch 'develop' into feature/hadrons
|
2018-05-15 13:53:57 +01:00 |
|
Guido Cossu
|
7836cc2d74
|
No checksum output on log for scidac
|
2018-05-15 10:10:08 +01:00 |
|
Peter Boyle
|
eb7d34a4cc
|
GPU version
|
2018-05-14 19:41:47 -04:00 |
|
Peter Boyle
|
aab27a655a
|
Start of GPU kernels
|
2018-05-14 19:41:17 -04:00 |
|
Peter Boyle
|
93280bae85
|
Gpu option
|
2018-05-14 19:40:58 -04:00 |
|
Peter Boyle
|
c5f93abcd7
|
GPU clean up
|
2018-05-14 19:40:33 -04:00 |
|
Peter Boyle
|
d5deef782d
|
Useful debug comments
|
2018-05-14 19:39:52 -04:00 |
|