1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-18 17:56:12 +01:00
Commit Graph

42 Commits

Author SHA1 Message Date
Azusa Yamaguchi d9cd4f0273 Staggered multinode block cg debugged. Missing global sum.
Code stalls and resumes on KNL at cambridge. Curious.

CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
paboyle 7e35286860 Simplified lanczos, added Eigen diagonalisation.
Curious if we can deprecate dependencly on BLAS.
Will see when we get 48^3 running on our BG/Q port
2017-06-21 02:26:03 +01:00
paboyle 0486ff8e79 Improved the lancos 2017-06-20 18:46:01 +01:00
Azusa Yamaguchi e9cc21900f Block solver complete for staggered. Now stable on mass 0.003 and
gives 8x (!) speed up on Haswell laptop vs. standard CG for 8 RHS solves.

166 iterations vs. 537 iterations so algorithmic gain + 2x in flop rate gain.

Better than a slap in the face with a wet kipper.
2017-06-20 12:37:41 +01:00
Azusa Yamaguchi 0a8faac271 Fix make tests compile 2017-06-19 22:54:18 +01:00
Azusa Yamaguchi cfe3cd76d1 Block solver improvements 2017-06-19 14:04:21 +01:00
Guido Cossu 20999c1370 Merge branch 'develop' into feature/hmc_generalise 2017-05-05 12:47:17 +01:00
paboyle 33f0ed1a33 No compile fix 2017-05-05 11:04:30 +01:00
Guido Cossu 3344788fa1 Merge branch 'develop' into feature/hmc_generalise 2017-05-01 12:13:56 +01:00
paboyle 8e161152e4 MultiRHS solver improvements with slice operations moved into lattice and sped up.
Block solver requires a lot of performance work.
2017-04-18 10:51:55 +01:00
paboyle bf516c3b81 higher precision reduction variables in norm and inner product 2017-04-15 12:27:28 +01:00
paboyle d80d802f9d MultiRHS solver test 2017-04-10 00:12:12 +09:00
paboyle 3d99b09dba Start of blockCG 2017-04-09 23:42:10 +09:00
Guido Cossu 8c540333d5 Merge branch 'develop' into feature/hmc_generalise 2017-04-05 14:41:04 +01:00
paboyle 6af459cae4 Christoph's coefficients. 2017-03-31 17:07:43 +09:00
paboyle cb9a297a0a Chulwoo's Zmobius test 2017-03-30 13:30:25 +09:00
paboyle 4b17e8eba8 Merge branch 'develop' into feature/bgq-asm
Conflicts:
	lib/qcd/action/fermion/Fermion.h
	lib/qcd/action/fermion/WilsonFermion.cc
	lib/util/Init.cc
	tests/Test_cayley_even_odd_vec.cc
2017-03-28 04:49:30 -04:00
paboyle 18bde08d1b Merge branch 'feature/staggering' into develop 2017-03-28 15:25:55 +09:00
Chulwoo Jung 0b61f75c9e Adding ZMobius CG test 2017-03-13 00:12:43 -04:00
Guido Cossu 4e34132f4d Correcting modules use in test files 2017-03-10 23:54:53 +09:00
Guido Cossu 7270c6a150 Integrator works now 2017-02-24 17:03:42 +00:00
Guido Cossu 902afcfbaf Adding metric and the implicit steps 2017-02-21 11:30:57 +00:00
Guido Cossu 97a6b61551 Covariant laplacian and implicit integration 2017-02-20 11:17:27 +00:00
Guido Cossu bafb101e4f Testing different versions of the Laplacian 2017-02-13 15:38:11 +00:00
Guido Cossu 08fdf05528 Added and tested the covariant laplacian + CG solver 2017-02-13 15:05:01 +00:00
Guido Cossu c3d7ec65fa All tests compile. 2017-02-10 10:27:51 +00:00
Guido Cossu e0571c872b Merge branch 'develop' into feature/hmc_generalise 2017-02-09 16:12:00 +00:00
portelli a37e71f362 New automatic implementation of gamma matrices, Meson and SeqGamma are broken 2017-01-23 19:13:43 -08:00
Guido Cossu 5214846341 Adding a resource manager 2016-12-22 12:41:56 +00:00
Guido Cossu ce1a115e0b Removing redundant arguments for integrator functions, step 1 2016-12-20 17:51:30 +00:00
Guido Cossu af0ccdd8e9 Moving output order 2016-12-14 02:02:42 +00:00
Guido Cossu 2fb92dbc6e Cleaning up previous debug lines 2016-12-13 07:53:43 +00:00
Guido Cossu 5c74b6028b Commit for debugging, lot of IO 2016-12-13 06:35:30 +00:00
Guido Cossu e0be2b6e6c Adding a new tests for the Ls vec CG 2016-12-13 04:59:18 +00:00
Azusa Yamaguchi 389e0a77bd Staggerd Fermion 5D 2016-11-29 13:13:56 +00:00
Guido Cossu 977b0a6dd9 Merge branch 'develop' into feature/hmc_generalise 2016-10-20 17:04:41 +01:00
Guido Cossu 2e453dfbf5 Added some instrumentation to benchmark the force computation 2016-10-06 17:52:45 +01:00
Guido Cossu c065e454c3 Adding Binrary IO, untested 2016-10-06 10:12:11 +01:00
portelli 629283726b build system: local Grid link flag moved to configure.ac 2016-08-03 15:07:42 +01:00
portelli 9e5b934d21 improved LAPACK configuration 2016-08-02 17:26:54 +01:00
portelli e9f30cab2c first working version for the new build system 2016-07-30 17:53:18 +01:00
paboyle a0676beeb1 Open up dependency on Eigen and FFTW 2016-07-07 22:31:07 +01:00