nils meyer
6504a098cc
999 GiB/s Wilson; 694 GiB/s DW (DP)
2020-04-15 15:06:52 +02:00
nils meyer
79a385faca
disabled armclang hotfix cause armclang 20.0 performance gets a little
2020-04-15 11:46:55 +02:00
nils meyer
c12a67030a
980 GiB/s Wilson; 680 GiB/s DW (DP)
2020-04-15 10:55:06 +02:00
nils meyer
581392f2f2
now with pf, best results so far using intrinsics+pf
2020-04-12 22:06:14 +02:00
nils meyer
113f277b6a
enable dslash asm using -DA64FXASM, additionaly -DDSLASHINTRIN for intrinsics impl
2020-04-11 04:55:01 +02:00
nils meyer
974586bedc
Dslash finally works; cleaned up; uses MOVPRFX in assembly
2020-04-10 22:26:40 +02:00
nmeyer-ur
160f78c1e4
changed debug output to variable direct 3
2020-04-10 12:23:07 +02:00
nmeyer-ur
7e4e1bbbc2
changed debug output to variable direct 2
2020-04-10 12:22:04 +02:00
nmeyer-ur
e699b7e9f9
changed debug output to variable direct
2020-04-10 12:18:30 +02:00
nmeyer-ur
a28bc0de90
debug register address test in WilsonHand
2020-04-10 12:07:45 +02:00
nmeyer-ur
14d0fe4d6c
added predication in WilsonHand
2020-04-10 12:04:00 +02:00
nmeyer-ur
0ad2e0815c
debug output in WilsonHand
2020-04-10 11:56:29 +02:00
nils meyer
1c8ca05e16
Merge branch 'feature/a64fx-2' of https://github.com/nmeyer-ur/Grid into feature/a64fx-2
2020-04-09 23:32:19 +02:00
nils meyer
dc9c8340bb
switched to DSLASHINTRIN for A64FX Dslash intrinsics
2020-04-09 23:30:23 +02:00
nils meyer
19eef97503
specialized A64FX Dslash kernels
2020-04-09 23:25:25 +02:00
nmeyer-ur
635246ce50
corrected typo
2020-04-09 21:42:50 +02:00
nils meyer
5cdbb7e71e
fixed A64FX Dslash; compiles, but does not specialize -> assertion
2020-04-09 21:23:39 +02:00
nmeyer-ur
8123590a1b
changes
2020-04-09 16:45:47 +02:00
nmeyer-ur
86c9c4da8b
changes
2020-04-09 16:40:06 +02:00
nmeyer-ur
cd1efee866
changes
2020-04-09 16:35:13 +02:00
nmeyer-ur
bd310932f7
changes
2020-04-09 16:32:31 +02:00
nmeyer-ur
304762e7ac
changes
2020-04-09 16:26:01 +02:00
nmeyer-ur
d79ab03a6c
changes
2020-04-09 16:19:25 +02:00
nmeyer-ur
d5708e0eb2
more changes
2020-04-09 15:43:34 +02:00
nmeyer-ur
123f6b7a61
more changes
2020-04-09 15:17:19 +02:00
nmeyer-ur
2b6457dd9a
added xp/xm recon accum
2020-04-09 15:13:19 +02:00
nmeyer-ur
b367cbd422
defined ADD_RESULT
2020-04-09 15:08:45 +02:00
nmeyer-ur
e252c1aca3
addressing
2020-04-09 15:03:12 +02:00
nmeyer-ur
b140c6a4f9
addressing
2020-04-09 15:01:15 +02:00
nmeyer-ur
326de36467
revised sU addressing scheme
2020-04-09 14:44:25 +02:00
nmeyer-ur
9f224a1647
fixed typo in single
2020-04-09 14:30:21 +02:00
nmeyer-ur
bb46ba9b5f
fixed array size in single
2020-04-09 14:28:45 +02:00
nmeyer-ur
dd5a22b36b
revised declarations
2020-04-09 14:21:27 +02:00
nmeyer-ur
1ea85b9972
Disabled build message
2020-04-09 13:47:21 +02:00
nmeyer-ur
8fb63f1c25
added A64FX Wilson kernels single precision
2020-04-09 13:41:04 +02:00
nmeyer-ur
77fa586f6c
introduced A64FX Wilson kernels
2020-04-09 13:30:06 +02:00
nmeyer-ur
15238e8d5e
reduce acle works, clean up
2020-04-03 20:40:44 +02:00
nmeyer-ur
b27e31957a
reduce acle revised
2020-04-03 19:46:15 +02:00
nmeyer-ur
46927771e3
reduce acle still needs overhaul
2020-04-03 19:30:48 +02:00
nmeyer-ur
d8cea77707
define simd width in header
2020-04-03 19:22:25 +02:00
nmeyer-ur
5f8a76d490
clean up, reduction in acle
2020-04-03 19:18:24 +02:00
nmeyer-ur
28d49a3b60
build problem resolved
2020-04-03 16:52:48 +02:00
nmeyer-ur
b4c624ece6
added A64FX support
2020-04-03 15:43:23 +02:00
05ebc458e2
Merge pull request #260 from mmphys/feature/distil
...
Distillation: save eigenvalues of the Laplacian for all timeslices
2020-03-13 14:00:21 +00:00
Michael Marshall
3753508957
Making change 1) as simple as possible 2) as much like MSink/Point.hpp as possible
2020-03-12 13:47:51 +00:00
Michael Marshall
c1677fccf6
Merge branch 'develop' into feature/distil
...
* develop:
bugfix ZPerambulator
registered module supporting ZMobius action
changed to push_back according to request
Added Hadrons_Error in case blockSize is set too large
bugfix in perambulator module
# Conflicts:
# Hadrons/Modules/MDistil/Perambulator.hpp
2020-03-12 12:45:18 +00:00
35e8e31749
Merge pull request #272 from mmphys/feature/ZPeramb
...
bugfix ZPerambulator
2020-03-12 12:28:04 +00:00
34813e9b04
Merge branch 'develop' into feature/ZPeramb
2020-03-12 12:27:56 +00:00
Felix Erben
373cf61abb
bugfix ZPerambulator
2020-03-12 11:44:43 +00:00
4e8fbc4b49
Merge pull request #271 from mmphys/feature/ZDistil
...
registered module supporting ZMobius action
2020-03-12 10:54:07 +00:00