Peter Boyle
|
8e81a811d0
|
Merge branch 'feature/hdcr' into develop
|
2020-04-10 11:14:49 -04:00 |
|
Peter Boyle
|
aa13118127
|
Missing conjugate already fixed in develop
|
2020-04-10 11:11:24 -04:00 |
|
Peter Boyle
|
6cdb09c884
|
Faster copy region
|
2020-04-10 11:10:52 -04:00 |
|
Peter Boyle
|
a65bc64f10
|
Accelerator peek poke
|
2020-04-10 11:09:59 -04:00 |
|
Peter Boyle
|
11dec4883c
|
Don't throw assert
|
2020-04-10 11:09:11 -04:00 |
|
Peter Boyle
|
afa458c812
|
Extra solvers
|
2020-04-10 11:08:19 -04:00 |
|
Peter Boyle
|
dc50190b8f
|
Faster GPU basis rotation
May need to later include Regensburg optimised CPU variant
|
2020-04-10 11:06:04 -04:00 |
|
nmeyer-ur
|
160f78c1e4
|
changed debug output to variable direct 3
|
2020-04-10 12:23:07 +02:00 |
|
nmeyer-ur
|
7e4e1bbbc2
|
changed debug output to variable direct 2
|
2020-04-10 12:22:04 +02:00 |
|
nmeyer-ur
|
e699b7e9f9
|
changed debug output to variable direct
|
2020-04-10 12:18:30 +02:00 |
|
nmeyer-ur
|
a28bc0de90
|
debug register address test in WilsonHand
|
2020-04-10 12:07:45 +02:00 |
|
nmeyer-ur
|
14d0fe4d6c
|
added predication in WilsonHand
|
2020-04-10 12:04:00 +02:00 |
|
nmeyer-ur
|
0ad2e0815c
|
debug output in WilsonHand
|
2020-04-10 11:56:29 +02:00 |
|
nils meyer
|
1c8ca05e16
|
Merge branch 'feature/a64fx-2' of https://github.com/nmeyer-ur/Grid into feature/a64fx-2
|
2020-04-09 23:32:19 +02:00 |
|
nils meyer
|
dc9c8340bb
|
switched to DSLASHINTRIN for A64FX Dslash intrinsics
|
2020-04-09 23:30:23 +02:00 |
|
nils meyer
|
19eef97503
|
specialized A64FX Dslash kernels
|
2020-04-09 23:25:25 +02:00 |
|
nmeyer-ur
|
635246ce50
|
corrected typo
|
2020-04-09 21:42:50 +02:00 |
|
nils meyer
|
5cdbb7e71e
|
fixed A64FX Dslash; compiles, but does not specialize -> assertion
|
2020-04-09 21:23:39 +02:00 |
|
nmeyer-ur
|
8123590a1b
|
changes
|
2020-04-09 16:45:47 +02:00 |
|
nmeyer-ur
|
86c9c4da8b
|
changes
|
2020-04-09 16:40:06 +02:00 |
|
nmeyer-ur
|
cd1efee866
|
changes
|
2020-04-09 16:35:13 +02:00 |
|
nmeyer-ur
|
bd310932f7
|
changes
|
2020-04-09 16:32:31 +02:00 |
|
nmeyer-ur
|
304762e7ac
|
changes
|
2020-04-09 16:26:01 +02:00 |
|
nmeyer-ur
|
d79ab03a6c
|
changes
|
2020-04-09 16:19:25 +02:00 |
|
nmeyer-ur
|
d5708e0eb2
|
more changes
|
2020-04-09 15:43:34 +02:00 |
|
nmeyer-ur
|
123f6b7a61
|
more changes
|
2020-04-09 15:17:19 +02:00 |
|
nmeyer-ur
|
2b6457dd9a
|
added xp/xm recon accum
|
2020-04-09 15:13:19 +02:00 |
|
nmeyer-ur
|
b367cbd422
|
defined ADD_RESULT
|
2020-04-09 15:08:45 +02:00 |
|
nmeyer-ur
|
e252c1aca3
|
addressing
|
2020-04-09 15:03:12 +02:00 |
|
nmeyer-ur
|
b140c6a4f9
|
addressing
|
2020-04-09 15:01:15 +02:00 |
|
nmeyer-ur
|
326de36467
|
revised sU addressing scheme
|
2020-04-09 14:44:25 +02:00 |
|
nmeyer-ur
|
9f224a1647
|
fixed typo in single
|
2020-04-09 14:30:21 +02:00 |
|
nmeyer-ur
|
bb46ba9b5f
|
fixed array size in single
|
2020-04-09 14:28:45 +02:00 |
|
nmeyer-ur
|
dd5a22b36b
|
revised declarations
|
2020-04-09 14:21:27 +02:00 |
|
nmeyer-ur
|
1ea85b9972
|
Disabled build message
|
2020-04-09 13:47:21 +02:00 |
|
nmeyer-ur
|
8fb63f1c25
|
added A64FX Wilson kernels single precision
|
2020-04-09 13:41:04 +02:00 |
|
nmeyer-ur
|
77fa586f6c
|
introduced A64FX Wilson kernels
|
2020-04-09 13:30:06 +02:00 |
|
Christoph Lehner
|
96e8e44fd4
|
Merge pull request #2 from DanielRichtmann/feature/fused-innerproduct-norm2
Fused innerProduct + norm2 on first argument operation
|
2020-04-06 13:16:58 +02:00 |
|
Daniel Richtmann
|
5fc8a273e7
|
Fused innerProduct + norm2 on first argument operation
|
2020-04-06 11:52:29 +02:00 |
|
|
d671a63e78
|
Update README.md
|
2020-04-03 19:52:15 +01:00 |
|
nmeyer-ur
|
15238e8d5e
|
reduce acle works, clean up
|
2020-04-03 20:40:44 +02:00 |
|
nmeyer-ur
|
b27e31957a
|
reduce acle revised
|
2020-04-03 19:46:15 +02:00 |
|
nmeyer-ur
|
46927771e3
|
reduce acle still needs overhaul
|
2020-04-03 19:30:48 +02:00 |
|
nmeyer-ur
|
d8cea77707
|
define simd width in header
|
2020-04-03 19:22:25 +02:00 |
|
nmeyer-ur
|
5f8a76d490
|
clean up, reduction in acle
|
2020-04-03 19:18:24 +02:00 |
|
nmeyer-ur
|
28d49a3b60
|
build problem resolved
|
2020-04-03 16:52:48 +02:00 |
|
nmeyer-ur
|
b4c624ece6
|
added A64FX support
|
2020-04-03 15:43:23 +02:00 |
|
|
2c22db841a
|
Added momentum scaling to scalar HMC theories in order to follow UKQCD/CPS conventions
|
2020-04-02 17:38:47 +01:00 |
|
Christoph Lehner
|
856d168e41
|
global sum over vectors of uint64_t
|
2020-03-29 07:56:05 -04:00 |
|
|
6235c7ba98
|
IPP path fix in configure
|
2020-03-27 17:23:29 +00:00 |
|