Peter Boyle
|
d954595922
|
SyCL optimised hand unrolled kernels and const functor patches.
|
2021-02-09 11:39:39 -05:00 |
|
Peter Boyle
|
1ac13ec3a7
|
Merge pull request #338 from paboyle/bugfix/maxnorm2
Fixed compile issues with maxLocalNorm2 for non-scalar lattices
|
2021-02-08 12:08:11 -05:00 |
|
Christopher Kelly
|
55de69a569
|
Fixed compile issues with maxLocalNorm2 for non-scalar lattices
maxLocalNorm2 test now reuses the random field
|
2021-02-08 12:03:16 -05:00 |
|
Peter Boyle
|
eda9ab487b
|
MADWF 5d source option for hadrons - look at Grid of source
Abort on GPU error
|
2021-02-08 10:47:22 -05:00 |
|
Peter Boyle
|
cd99edcc5f
|
maxLocalNorm2()
|
2021-02-04 18:25:49 -05:00 |
|
Peter Boyle
|
69f1f04f74
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2021-01-21 21:39:59 -05:00 |
|
Peter Boyle
|
11a5fd09d6
|
Hot config
|
2021-01-21 21:39:41 -05:00 |
|
Peter Boyle
|
ff1fa98808
|
Fix for GPU conserveed current
|
2021-01-21 21:38:23 -05:00 |
|
Peter Boyle
|
b0339bc5a4
|
Merge branch 'feature/conjugate-bc-dirs' into develop
|
2021-01-15 09:28:39 -05:00 |
|
Peter Boyle
|
3c23a947cc
|
Fixed test for very much non-unit det
|
2021-01-15 09:16:02 -05:00 |
|
Peter Boyle
|
56111bb823
|
Merge branch 'develop' into feature/conjugate-bc-dirs
|
2021-01-14 21:01:22 -05:00 |
|
Peter Boyle
|
99445673f6
|
Gparity fix, and plaquette IO
|
2021-01-14 21:00:36 -05:00 |
|
Peter Boyle
|
97a59643f7
|
Red black coarse space
|
2021-01-14 20:49:13 -05:00 |
|
Peter Boyle
|
579595f547
|
Red black on coarse space
|
2021-01-14 20:48:35 -05:00 |
|
Peter Boyle
|
281ac5fc12
|
Red black support on coars
|
2021-01-14 20:48:08 -05:00 |
|
Peter Boyle
|
d8fa903b02
|
G5 on coarse spaces
|
2021-01-14 20:47:28 -05:00 |
|
Peter Boyle
|
eaff0f3aeb
|
Gamma5 on coaree spaces
|
2021-01-14 20:46:58 -05:00 |
|
Peter Boyle
|
e8e20c01b2
|
Coarsened vector test
|
2021-01-14 20:46:21 -05:00 |
|
Peter Boyle
|
a4afc3ea2a
|
Red black coarse space
|
2021-01-14 20:44:16 -05:00 |
|
Peter Boyle
|
3fe75bc7cb
|
Merge pull request #329 from nmeyer-ur/feature/a64fx-3
Revised dslash/dwf kernels for A64FX
|
2020-12-20 08:17:15 -05:00 |
|
Nils Meyer
|
45d49d8648
|
clean up
|
2020-12-19 03:35:18 +01:00 |
|
Nils Meyer
|
6013183361
|
removed Asm impls
|
2020-12-19 03:25:01 +01:00 |
|
Nils Meyer
|
4b882e8056
|
fixed lost bracket
|
2020-12-19 03:09:20 +01:00 |
|
Nils Meyer
|
3f9ae6e7e7
|
Merge branch 'develop' into feature/a64fx-3
|
2020-12-19 02:37:11 +01:00 |
|
Nils Meyer
|
909acd55cd
|
vnum variant for prefetches
|
2020-12-19 02:00:22 +01:00 |
|
Nils Meyer
|
4dd9e39e0d
|
up to +36% performance gain for dslash/dwf on QPACE 4 using GCC 10.1.1
|
2020-12-19 00:54:31 +01:00 |
|
Peter Boyle
|
7adb253e25
|
Merge pull request #328 from mmphys/feature/mrespatch
Enable existing conserved current code for CUDA
|
2020-12-17 11:10:29 -05:00 |
|
Michael Marshall
|
873519e960
|
Enable existing conserved current code for CUDA (compiles OK for CUDA 10.1). Add option to Test_cayley_mres to load a configuration
|
2020-12-14 16:06:10 +00:00 |
|
Peter Boyle
|
9aec4a3c26
|
SYCL
|
2020-12-10 02:11:17 -08:00 |
|
Peter Boyle
|
70510d151b
|
Merge pull request #327 from paboyle/feature/gparity_twist_GPU
Feature/gparity twist gpu
|
2020-12-07 12:02:20 -05:00 |
|
Christopher Kelly
|
9e7bacb5a4
|
Merge branch 'develop' into feature/gparity_twist_GPU
|
2020-12-07 11:55:39 -05:00 |
|
Christopher Kelly
|
2ef1fa66a8
|
Improved performance of G-parity kernel for GPUs by simplifying multLink implementation
|
2020-12-07 11:53:35 -05:00 |
|
Peter Boyle
|
cf76741ec6
|
Intel DPCPP Gold happy now (compiles all, runs Benchmark_dwf_fp32 )
|
2020-12-03 03:47:11 -08:00 |
|
Peter Boyle
|
497e7c1c40
|
Duplicate code
|
2020-12-02 17:55:30 -08:00 |
|
Peter Boyle
|
888eacd3b8
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2020-11-24 21:46:33 -05:00 |
|
Peter Boyle
|
321f0f51b5
|
Project to SU(N)
|
2020-11-24 21:46:10 -05:00 |
|
Peter Boyle
|
30ad9578a2
|
Merge branch 'lehner-feature/gpt' into develop
|
2020-11-24 06:10:24 -05:00 |
|
Peter Boyle
|
9dce101586
|
Merge branch 'feature/gpt' of https://github.com/lehner/Grid into lehner-feature/gpt
|
2020-11-24 06:10:16 -05:00 |
|
Peter Boyle
|
97e264d0ff
|
Christoph's changes
|
2020-11-23 15:46:11 +00:00 |
|
Peter Boyle
|
683a5e5bf5
|
Stencil use host vector for integera table on enable-shared=no and mirror it on device
|
2020-11-23 15:39:51 +00:00 |
|
Peter Boyle
|
d4861a362c
|
Stencil use non-UVM memory for look up table on enable-shared=no
|
2020-11-23 15:38:49 +00:00 |
|
Peter Boyle
|
5ff3eae027
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2020-11-20 13:14:44 -05:00 |
|
Peter Boyle
|
147dc15d26
|
Update
|
2020-11-20 13:13:59 -05:00 |
|
Christoph Lehner
|
c61ea72949
|
Merge pull request #19 from paboyle/develop
Sync
|
2020-11-20 17:31:13 +01:00 |
|
Peter Boyle
|
86e8b9fe38
|
ALLOC_ALIGN removed
|
2020-11-20 17:07:16 +01:00 |
|
Peter Boyle
|
612e468889
|
Configurable ALLOC_ALIGN and ALLOC_CACHE
|
2020-11-20 16:48:28 +01:00 |
|
Christoph Lehner
|
4ea8d128c2
|
Merge pull request #18 from paboyle/develop
Sync
|
2020-11-20 15:36:50 +01:00 |
|
Peter Boyle
|
e49b7f2f88
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2020-11-19 19:24:41 +01:00 |
|
Peter Boyle
|
aace3d47b9
|
partial work in progress
|
2020-11-19 19:24:14 +01:00 |
|
Peter Boyle
|
d5049949a4
|
Starting to fix reunitarise
|
2020-11-19 19:23:41 +01:00 |
|