Christoph Lehner
|
019ffe17d4
|
Allow for GPU vector width beyond 64
|
2021-02-02 11:32:23 +01:00 |
|
|
bc496dd844
|
change back benchmark_ITT
|
2021-01-28 14:29:56 +00:00 |
|
|
a673b6a54d
|
prettify
|
2021-01-28 14:15:09 +00:00 |
|
|
1bf2e4d187
|
Merge branch 'develop' into gpu/baryons
|
2021-01-27 21:17:37 +00:00 |
|
Peter Boyle
|
96dd7a8fbd
|
Flop cout matches DiRAC-ITT-2020
|
2021-01-27 21:14:52 +00:00 |
|
|
7905afa9f5
|
revert changes
|
2021-01-27 21:14:52 +00:00 |
|
|
712bb40650
|
merge develop
|
2021-01-27 21:14:52 +00:00 |
|
|
81d88d9f4d
|
fixes
|
2021-01-27 21:09:51 +00:00 |
|
Michael Marshall
|
77063418da
|
Fix issue for GPU by ensuring accelerator_inline version of convertType is available for Grid::complex<T>. This removes many warnings in Hadrons
Simplify the SFINAE syntax and correct convertType for iScalar
|
2021-01-25 15:09:36 +00:00 |
|
Michael Marshall
|
2983b6fdf6
|
Optional (superficial) changes to make comparison with Hadrons WardIdentity module easier: use Schur solver; example of Hadrons random gauge init; logging updates; only solve reverse propagator if provided
|
2021-01-23 12:41:48 +00:00 |
|
Peter Boyle
|
69f1f04f74
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2021-01-21 21:39:59 -05:00 |
|
Peter Boyle
|
11a5fd09d6
|
Hot config
|
2021-01-21 21:39:41 -05:00 |
|
Peter Boyle
|
ff1fa98808
|
Fix for GPU conserveed current
|
2021-01-21 21:38:23 -05:00 |
|
|
df16202865
|
weird bug in 2pt function...
|
2021-01-19 19:25:27 +00:00 |
|
|
3ff7c2c02a
|
Merge branch 'develop' into gpu/baryons
|
2021-01-19 12:34:13 +00:00 |
|
|
fc6d07897f
|
revert changes
|
2021-01-19 12:32:48 +00:00 |
|
|
f9c8e5c8ef
|
Merge branch 'develop' of github.com:paboyle/Grid into develop
|
2021-01-19 12:30:29 +00:00 |
|
|
8bfa0e74f8
|
final version, tested on CPU and GPU
|
2021-01-19 12:27:57 +00:00 |
|
|
9b73a937e7
|
bugfix
|
2021-01-18 18:57:05 +00:00 |
|
Peter Boyle
|
b0339bc5a4
|
Merge branch 'feature/conjugate-bc-dirs' into develop
|
2021-01-15 09:28:39 -05:00 |
|
Peter Boyle
|
3c23a947cc
|
Fixed test for very much non-unit det
|
2021-01-15 09:16:02 -05:00 |
|
Peter Boyle
|
56111bb823
|
Merge branch 'develop' into feature/conjugate-bc-dirs
|
2021-01-14 21:01:22 -05:00 |
|
Peter Boyle
|
99445673f6
|
Gparity fix, and plaquette IO
|
2021-01-14 21:00:36 -05:00 |
|
Peter Boyle
|
97a59643f7
|
Red black coarse space
|
2021-01-14 20:49:13 -05:00 |
|
Peter Boyle
|
579595f547
|
Red black on coarse space
|
2021-01-14 20:48:35 -05:00 |
|
Peter Boyle
|
281ac5fc12
|
Red black support on coars
|
2021-01-14 20:48:08 -05:00 |
|
Peter Boyle
|
d8fa903b02
|
G5 on coarse spaces
|
2021-01-14 20:47:28 -05:00 |
|
Peter Boyle
|
eaff0f3aeb
|
Gamma5 on coaree spaces
|
2021-01-14 20:46:58 -05:00 |
|
Peter Boyle
|
e8e20c01b2
|
Coarsened vector test
|
2021-01-14 20:46:21 -05:00 |
|
Peter Boyle
|
a4afc3ea2a
|
Red black coarse space
|
2021-01-14 20:44:16 -05:00 |
|
|
fa12b9a329
|
bugfix
|
2021-01-13 10:04:17 +00:00 |
|
|
45fc7ded3a
|
test for sum
|
2021-01-12 09:10:37 +00:00 |
|
|
74de2d9742
|
whitespace changes
|
2021-01-08 18:28:36 +00:00 |
|
|
e759367d42
|
tested and working
|
2021-01-08 18:04:50 +00:00 |
|
Christoph Lehner
|
299d0de066
|
Merge pull request #21 from paboyle/develop
Sync
|
2020-12-22 20:59:15 +01:00 |
|
Peter Boyle
|
3fe75bc7cb
|
Merge pull request #329 from nmeyer-ur/feature/a64fx-3
Revised dslash/dwf kernels for A64FX
|
2020-12-20 08:17:15 -05:00 |
|
Nils Meyer
|
45d49d8648
|
clean up
|
2020-12-19 03:35:18 +01:00 |
|
Nils Meyer
|
6013183361
|
removed Asm impls
|
2020-12-19 03:25:01 +01:00 |
|
Nils Meyer
|
4b882e8056
|
fixed lost bracket
|
2020-12-19 03:09:20 +01:00 |
|
Nils Meyer
|
3f9ae6e7e7
|
Merge branch 'develop' into feature/a64fx-3
|
2020-12-19 02:37:11 +01:00 |
|
Nils Meyer
|
909acd55cd
|
vnum variant for prefetches
|
2020-12-19 02:00:22 +01:00 |
|
Nils Meyer
|
4dd9e39e0d
|
up to +36% performance gain for dslash/dwf on QPACE 4 using GCC 10.1.1
|
2020-12-19 00:54:31 +01:00 |
|
Christoph Lehner
|
b4c1317ab4
|
Merge pull request #22 from DanielRichtmann/feature/clover-access-specifier
Clover access specifier
|
2020-12-18 16:20:19 +01:00 |
|
|
f36d6f3923
|
compiles on GPU. 3pt still wrong!!!!
|
2020-12-17 17:04:08 +00:00 |
|
Peter Boyle
|
7adb253e25
|
Merge pull request #328 from mmphys/feature/mrespatch
Enable existing conserved current code for CUDA
|
2020-12-17 11:10:29 -05:00 |
|
|
808f1e0e8c
|
merge develop
|
2020-12-15 16:33:29 +00:00 |
|
Michael Marshall
|
873519e960
|
Enable existing conserved current code for CUDA (compiles OK for CUDA 10.1). Add option to Test_cayley_mres to load a configuration
|
2020-12-14 16:06:10 +00:00 |
|
Peter Boyle
|
9aec4a3c26
|
SYCL
|
2020-12-10 02:11:17 -08:00 |
|
Daniel Richtmann
|
c438118fd7
|
Change access specifier of clover fields in order to allow deriving classes to access these
|
2020-12-08 14:42:11 +01:00 |
|
Peter Boyle
|
70510d151b
|
Merge pull request #327 from paboyle/feature/gparity_twist_GPU
Feature/gparity twist gpu
|
2020-12-07 12:02:20 -05:00 |
|