paboyle
7e35286860
Simplified lanczos, added Eigen diagonalisation.
...
Curious if we can deprecate dependencly on BLAS.
Will see when we get 48^3 running on our BG/Q port
2017-06-21 02:26:03 +01:00
paboyle
0486ff8e79
Improved the lancos
2017-06-20 18:46:01 +01:00
Azusa Yamaguchi
e9cc21900f
Block solver complete for staggered. Now stable on mass 0.003 and
...
gives 8x (!) speed up on Haswell laptop vs. standard CG for 8 RHS solves.
166 iterations vs. 537 iterations so algorithmic gain + 2x in flop rate gain.
Better than a slap in the face with a wet kipper.
2017-06-20 12:37:41 +01:00
Azusa Yamaguchi
0a8faac271
Fix make tests compile
2017-06-19 22:54:18 +01:00
Azusa Yamaguchi
cfe3cd76d1
Block solver improvements
2017-06-19 14:04:21 +01:00
Guido Cossu
20999c1370
Merge branch 'develop' into feature/hmc_generalise
2017-05-05 12:47:17 +01:00
paboyle
33f0ed1a33
No compile fix
2017-05-05 11:04:30 +01:00
Guido Cossu
3344788fa1
Merge branch 'develop' into feature/hmc_generalise
2017-05-01 12:13:56 +01:00
paboyle
8e161152e4
MultiRHS solver improvements with slice operations moved into lattice and sped up.
...
Block solver requires a lot of performance work.
2017-04-18 10:51:55 +01:00
paboyle
bf516c3b81
higher precision reduction variables in norm and inner product
2017-04-15 12:27:28 +01:00
paboyle
d80d802f9d
MultiRHS solver test
2017-04-10 00:12:12 +09:00
paboyle
3d99b09dba
Start of blockCG
2017-04-09 23:42:10 +09:00
Guido Cossu
8c540333d5
Merge branch 'develop' into feature/hmc_generalise
2017-04-05 14:41:04 +01:00
paboyle
6af459cae4
Christoph's coefficients.
2017-03-31 17:07:43 +09:00
paboyle
cb9a297a0a
Chulwoo's Zmobius test
2017-03-30 13:30:25 +09:00
paboyle
4b17e8eba8
Merge branch 'develop' into feature/bgq-asm
...
Conflicts:
lib/qcd/action/fermion/Fermion.h
lib/qcd/action/fermion/WilsonFermion.cc
lib/util/Init.cc
tests/Test_cayley_even_odd_vec.cc
2017-03-28 04:49:30 -04:00
paboyle
18bde08d1b
Merge branch 'feature/staggering' into develop
2017-03-28 15:25:55 +09:00
Chulwoo Jung
0b61f75c9e
Adding ZMobius CG test
2017-03-13 00:12:43 -04:00
Guido Cossu
4e34132f4d
Correcting modules use in test files
2017-03-10 23:54:53 +09:00
Guido Cossu
7270c6a150
Integrator works now
2017-02-24 17:03:42 +00:00
Guido Cossu
902afcfbaf
Adding metric and the implicit steps
2017-02-21 11:30:57 +00:00
Guido Cossu
97a6b61551
Covariant laplacian and implicit integration
2017-02-20 11:17:27 +00:00
Guido Cossu
bafb101e4f
Testing different versions of the Laplacian
2017-02-13 15:38:11 +00:00
Guido Cossu
08fdf05528
Added and tested the covariant laplacian + CG solver
2017-02-13 15:05:01 +00:00
Guido Cossu
c3d7ec65fa
All tests compile.
2017-02-10 10:27:51 +00:00
Guido Cossu
e0571c872b
Merge branch 'develop' into feature/hmc_generalise
2017-02-09 16:12:00 +00:00
a37e71f362
New automatic implementation of gamma matrices, Meson and SeqGamma are broken
2017-01-23 19:13:43 -08:00
Guido Cossu
5214846341
Adding a resource manager
2016-12-22 12:41:56 +00:00
Guido Cossu
ce1a115e0b
Removing redundant arguments for integrator functions, step 1
2016-12-20 17:51:30 +00:00
Guido Cossu
af0ccdd8e9
Moving output order
2016-12-14 02:02:42 +00:00
Guido Cossu
2fb92dbc6e
Cleaning up previous debug lines
2016-12-13 07:53:43 +00:00
Guido Cossu
5c74b6028b
Commit for debugging, lot of IO
2016-12-13 06:35:30 +00:00
Guido Cossu
e0be2b6e6c
Adding a new tests for the Ls vec CG
2016-12-13 04:59:18 +00:00
Azusa Yamaguchi
389e0a77bd
Staggerd Fermion 5D
2016-11-29 13:13:56 +00:00
Guido Cossu
977b0a6dd9
Merge branch 'develop' into feature/hmc_generalise
2016-10-20 17:04:41 +01:00
Guido Cossu
2e453dfbf5
Added some instrumentation to benchmark the force computation
2016-10-06 17:52:45 +01:00
Guido Cossu
c065e454c3
Adding Binrary IO, untested
2016-10-06 10:12:11 +01:00
629283726b
build system: local Grid link flag moved to configure.ac
2016-08-03 15:07:42 +01:00
9e5b934d21
improved LAPACK configuration
2016-08-02 17:26:54 +01:00
e9f30cab2c
first working version for the new build system
2016-07-30 17:53:18 +01:00
paboyle
a0676beeb1
Open up dependency on Eigen and FFTW
2016-07-07 22:31:07 +01:00