1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-05-04 17:34:13 +01:00
Commit Graph

3616 Commits

Author SHA1 Message Date
paboyle ec9939c1ba Test for faster implementation of meson field inner loop
This should be possible to cache block at outer levels, global sum across nodes not performed
and deferred to caller to block them all into a big all reduce.
Nc=3 and Fermion is hard coded in an ugly way. We might think about benchmarking whether
a product without the conjugate should be made available by Grid.

It is not clear whether the explicit unroll, or the performing of conjugate on left once
was the real source of the speed up.

Gives 70-80 GF/s on my laptop (single) half that double, and 70GB/s to cache.

This is competitive with dslash and a reasonable stopping point for the optimisation. If necessary we can revisit.
2018-07-10 12:38:51 +01:00
Guido Cossu 95d4b46446 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2018-06-08 11:30:29 +01:00
paboyle 5dfd216a34 Better thread safety 2018-06-04 21:08:44 +01:00
paboyle c2e8d0aa88 Solve g++ problem on the lanczos test 2018-06-04 18:34:15 +01:00
paboyle bf96a4bdbf Merge branch 'master' into develop 2018-06-04 14:03:11 +01:00
paboyle 84685c9bc3 Overflow fix 2018-06-04 13:42:07 +01:00
Azusa Yamaguchi 013ea4e8d1 Merge branch 'feature/staggered-comms-compute' into develop 2018-05-21 13:11:56 +01:00
Azusa Yamaguchi 7fbbb31a50 Merge branch 'develop' into feature/staggered-comms-compute
Conflicts:
	lib/qcd/action/fermion/ImprovedStaggeredFermion.cc
2018-05-21 13:07:29 +01:00
Azusa Yamaguchi 0e127b1fc7 New file single prec test 2018-05-21 12:57:13 +01:00
Azusa Yamaguchi 68c028b0a6 Comment 2018-05-21 12:54:25 +01:00
Guido Cossu 7836cc2d74 No checksum output on log for scidac 2018-05-15 10:10:08 +01:00
portelli a61e0df54b Travis fix for Lime 2018-05-14 19:56:12 +01:00
portelli f871fb0c6d check file is opened correctly in the Lime reader 2018-05-11 18:06:28 +01:00
Guido Cossu 25d1cadd3b Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2018-05-07 18:55:09 +01:00
Guido Cossu c24d53bbd1 Further debug of RNG I/O 2018-05-07 18:55:05 +01:00
portelli 3c7a4106ed Trap for deadly empty comm thread option 2018-05-07 17:26:39 +01:00
paboyle 6eed167f0c Merge branch 'release/0.8.1' 2018-05-04 17:34:11 +01:00
paboyle 4ad0df6fde Bump volume for Gerardo 0.8.1 2018-05-04 17:33:23 +01:00
Peter Boyle 68a5079f33 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2018-05-04 14:13:54 +01:00
Peter Boyle 8634e19f1b Update 2018-05-04 14:13:35 +01:00
Azusa Yamaguchi 9ada378e38 Add timing 2018-05-04 10:58:01 +01:00
portelli bfbf2f1fa0 no threaded stencil benchmark if OpenMP is not supported 2018-05-03 16:20:01 +01:00
Azusa Yamaguchi 587bfcc0f4 Add Timing 2018-05-03 12:10:31 +01:00
Peter Boyle 8c658de179 Compressor speed up (a little); streaming stores 2018-05-02 17:52:16 +01:00
Guido Cossu ba37d51ee9 Debugging the RNG IO 2018-05-02 15:32:06 +01:00
Azusa Yamaguchi 4f4181c54a Merge branch 'feature/staggered-comms-compute' of https://github.com/paboyle/Grid into feature/staggered-comms-compute 2018-05-02 14:59:13 +01:00
Guido Cossu 4d4ac2517b Adding Scalar field theory example for Scidac format 2018-05-02 14:36:32 +01:00
Guido Cossu e568c24d1d Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2018-05-02 14:29:25 +01:00
Guido Cossu b458326744 Checkpointer module update 2018-05-02 14:29:22 +01:00
Guido Cossu 6e7d5e2243 HMC: added Scidac checkpointer and support for metadata 2018-05-02 14:28:59 +01:00
Azusa Yamaguchi b35169f1dd MultiShift for Staggered 2018-05-02 14:22:37 +01:00
Azusa Yamaguchi 441ad7498d add Iterative counter 2018-05-02 14:21:30 +01:00
Peter Boyle 6f6c5c549a Split off gparity 2018-05-02 14:11:23 +01:00
Peter Boyle 1584e17b54 Revert to fast versoin 2018-05-02 14:10:55 +01:00
Peter Boyle 12982a4455 Hypercube optimisation 2018-05-02 14:10:21 +01:00
Peter Boyle 172f412102 shmget reintroduce 2018-05-02 14:07:41 +01:00
Peter Boyle a64497265d TIming 2018-05-02 14:07:28 +01:00
Peter Boyle c45f24a1b5 Improvements for tesseract 2018-04-30 21:50:00 +01:00
Dr Peter Boyle aaf37ee4d7 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2018-04-27 11:45:13 +01:00
Dr Peter Boyle 1dddd17e3c Benchmark improvements from tesseract 2018-04-27 11:44:46 +01:00
paboyle 661f1d3e8e Merge branch 'release/0.8.0' into develop 2018-04-27 11:22:33 +01:00
paboyle edcf9b9293 Merge branch 'release/0.8.0' 2018-04-27 11:13:19 +01:00
paboyle fe6860b4dd Update with LIME library guard v0.8.0 2018-04-27 08:57:34 +01:00
paboyle d6406b13e1 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2018-04-27 07:52:56 +01:00
paboyle e369d7306d Rename 2018-04-27 07:51:44 +01:00
paboyle 9f8d63e104 Roll over version 2018-04-27 07:51:12 +01:00
paboyle 9b0240d101 Hot start test 2018-04-27 07:50:51 +01:00
paboyle b27f0e5a53 Control over IO 2018-04-27 07:50:15 +01:00
paboyle 75e4483407 Stronger convergence test 2018-04-27 07:49:57 +01:00
Guido Cossu 0734e9ddd4 Debugging Scatter_plane_simple 2018-04-27 14:39:01 +09:00