1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-11-15 10:15:36 +00:00
Commit Graph

6744 Commits

Author SHA1 Message Date
Peter Boyle
aa13118127 Missing conjugate already fixed in develop 2020-04-10 11:11:24 -04:00
Peter Boyle
6cdb09c884 Faster copy region 2020-04-10 11:10:52 -04:00
Peter Boyle
a65bc64f10 Accelerator peek poke 2020-04-10 11:09:59 -04:00
Peter Boyle
11dec4883c Don't throw assert 2020-04-10 11:09:11 -04:00
Peter Boyle
afa458c812 Extra solvers 2020-04-10 11:08:19 -04:00
Peter Boyle
dc50190b8f Faster GPU basis rotation
May need to later include Regensburg optimised CPU variant
2020-04-10 11:06:04 -04:00
nmeyer-ur
160f78c1e4 changed debug output to variable direct 3 2020-04-10 12:23:07 +02:00
nmeyer-ur
7e4e1bbbc2 changed debug output to variable direct 2 2020-04-10 12:22:04 +02:00
nmeyer-ur
e699b7e9f9 changed debug output to variable direct 2020-04-10 12:18:30 +02:00
nmeyer-ur
a28bc0de90 debug register address test in WilsonHand 2020-04-10 12:07:45 +02:00
nmeyer-ur
14d0fe4d6c added predication in WilsonHand 2020-04-10 12:04:00 +02:00
nmeyer-ur
0ad2e0815c debug output in WilsonHand 2020-04-10 11:56:29 +02:00
nils meyer
1c8ca05e16 Merge branch 'feature/a64fx-2' of https://github.com/nmeyer-ur/Grid into feature/a64fx-2 2020-04-09 23:32:19 +02:00
nils meyer
dc9c8340bb switched to DSLASHINTRIN for A64FX Dslash intrinsics 2020-04-09 23:30:23 +02:00
nils meyer
19eef97503 specialized A64FX Dslash kernels 2020-04-09 23:25:25 +02:00
nmeyer-ur
635246ce50 corrected typo 2020-04-09 21:42:50 +02:00
nils meyer
5cdbb7e71e fixed A64FX Dslash; compiles, but does not specialize -> assertion 2020-04-09 21:23:39 +02:00
nmeyer-ur
8123590a1b changes 2020-04-09 16:45:47 +02:00
nmeyer-ur
86c9c4da8b changes 2020-04-09 16:40:06 +02:00
nmeyer-ur
cd1efee866 changes 2020-04-09 16:35:13 +02:00
nmeyer-ur
bd310932f7 changes 2020-04-09 16:32:31 +02:00
nmeyer-ur
304762e7ac changes 2020-04-09 16:26:01 +02:00
nmeyer-ur
d79ab03a6c changes 2020-04-09 16:19:25 +02:00
nmeyer-ur
d5708e0eb2 more changes 2020-04-09 15:43:34 +02:00
nmeyer-ur
123f6b7a61 more changes 2020-04-09 15:17:19 +02:00
nmeyer-ur
2b6457dd9a added xp/xm recon accum 2020-04-09 15:13:19 +02:00
nmeyer-ur
b367cbd422 defined ADD_RESULT 2020-04-09 15:08:45 +02:00
nmeyer-ur
e252c1aca3 addressing 2020-04-09 15:03:12 +02:00
nmeyer-ur
b140c6a4f9 addressing 2020-04-09 15:01:15 +02:00
nmeyer-ur
326de36467 revised sU addressing scheme 2020-04-09 14:44:25 +02:00
nmeyer-ur
9f224a1647 fixed typo in single 2020-04-09 14:30:21 +02:00
nmeyer-ur
bb46ba9b5f fixed array size in single 2020-04-09 14:28:45 +02:00
nmeyer-ur
dd5a22b36b revised declarations 2020-04-09 14:21:27 +02:00
nmeyer-ur
1ea85b9972 Disabled build message 2020-04-09 13:47:21 +02:00
nmeyer-ur
8fb63f1c25 added A64FX Wilson kernels single precision 2020-04-09 13:41:04 +02:00
nmeyer-ur
77fa586f6c introduced A64FX Wilson kernels 2020-04-09 13:30:06 +02:00
Christoph Lehner
96e8e44fd4
Merge pull request #2 from DanielRichtmann/feature/fused-innerproduct-norm2
Fused innerProduct + norm2 on first argument operation
2020-04-06 13:16:58 +02:00
Daniel Richtmann
5fc8a273e7
Fused innerProduct + norm2 on first argument operation 2020-04-06 11:52:29 +02:00
d671a63e78
Update README.md 2020-04-03 19:52:15 +01:00
nmeyer-ur
15238e8d5e reduce acle works, clean up 2020-04-03 20:40:44 +02:00
nmeyer-ur
b27e31957a reduce acle revised 2020-04-03 19:46:15 +02:00
nmeyer-ur
46927771e3 reduce acle still needs overhaul 2020-04-03 19:30:48 +02:00
nmeyer-ur
d8cea77707 define simd width in header 2020-04-03 19:22:25 +02:00
nmeyer-ur
5f8a76d490 clean up, reduction in acle 2020-04-03 19:18:24 +02:00
nmeyer-ur
28d49a3b60 build problem resolved 2020-04-03 16:52:48 +02:00
nmeyer-ur
b4c624ece6 added A64FX support 2020-04-03 15:43:23 +02:00
2c22db841a Added momentum scaling to scalar HMC theories in order to follow UKQCD/CPS conventions 2020-04-02 17:38:47 +01:00
Christoph Lehner
856d168e41 global sum over vectors of uint64_t 2020-03-29 07:56:05 -04:00
6235c7ba98 IPP path fix in configure 2020-03-27 17:23:29 +00:00
7e13724882 removing Hadrons 2020-03-27 12:03:32 +00:00