Azusa Yamaguchi
7fbbb31a50
Merge branch 'develop' into feature/staggered-comms-compute
...
Conflicts:
lib/qcd/action/fermion/ImprovedStaggeredFermion.cc
2018-05-21 13:07:29 +01:00
Azusa Yamaguchi
68c028b0a6
Comment
2018-05-21 12:54:25 +01:00
Azusa Yamaguchi
9ada378e38
Add timing
2018-05-04 10:58:01 +01:00
Azusa Yamaguchi
587bfcc0f4
Add Timing
2018-05-03 12:10:31 +01:00
Azusa Yamaguchi
4f4181c54a
Merge branch 'feature/staggered-comms-compute' of https://github.com/paboyle/Grid into feature/staggered-comms-compute
2018-05-02 14:59:13 +01:00
Azusa Yamaguchi
441ad7498d
add Iterative counter
2018-05-02 14:21:30 +01:00
paboyle
b27f0e5a53
Control over IO
2018-04-27 07:50:15 +01:00
paboyle
75e4483407
Stronger convergence test
2018-04-27 07:49:57 +01:00
Azusa Yamaguchi
3e125c5b61
Faster linalg on CG optimised against staggered
...
Sum overhead is bigger for staggered
2018-04-26 10:07:19 +01:00
Fionn O hOgain
2e88408f5c
Some changes needed for deflation interface
2018-03-02 22:27:41 +00:00
Azusa Yamaguchi
0f468e2179
OverlappedComm for Staggered 5D and 4D.
2018-02-22 12:50:09 +00:00
paboyle
559a501140
Deflation interface for solvers
2018-02-20 14:29:08 +00:00
paboyle
945684c470
updates for deflation in the RB solver
2018-02-20 14:28:38 +00:00
paboyle
c96483e3bd
Whitespace only change
2018-02-13 11:39:07 +00:00
paboyle
ae31a6a760
Move deflate to right class
2018-02-13 02:11:37 +00:00
paboyle
dd8f2a64fe
INterface to suit hadrons on Lanczos
2018-02-13 02:08:49 +00:00
azusayamaguchi
e55397bc13
Staggerd cg
2017-11-24 14:18:30 +00:00
fionnoh
1d7ccc6b2c
Declaring virtual functions as pure virtual functions.
2017-11-09 19:46:57 +00:00
Azusa Yamaguchi
9b8d1cc3da
Staggered Schur decomposed matrix norm changed to not be the Schur anymore :(
...
Carleton wanted this for multimass / multishift
2017-11-07 14:48:45 +00:00
paboyle
360efd0088
Improved treatment of reverse asked for by chris.
...
Truncate the basis.
Power method renormalises
2017-11-02 22:05:31 +00:00
paboyle
fe4d9b003c
More digits
2017-10-30 00:04:47 +00:00
paboyle
84b441800f
Merge branch 'develop' into feature/lanczos-reorg
2017-10-27 14:21:38 +01:00
paboyle
f96c800d25
Passes reload of coarse basis
2017-10-27 09:43:22 +01:00
paboyle
32a52d7583
Move the local coherence lanczos into algorithms.
...
Keep the I/O in the tester. Other people can copy this method to write other I/O formats.
2017-10-27 09:04:31 +01:00
paboyle
0c4ddaea0b
Cleaning up
2017-10-26 23:31:46 +01:00
Azusa Yamaguchi
034de160bf
Staggered updates : Schur fixed and added a unit test for Test_staggered_cg_schur.cc giving stronger check
2017-10-26 20:58:46 +01:00
paboyle
31f99574fa
Moving these out of algorithms
2017-10-26 07:47:42 +01:00
paboyle
a34c8a2961
Update to IRL; getting close to the structure I would like.
2017-10-26 07:45:56 +01:00
paboyle
e325929851
ALl codes compile against the new Lanczos call signature
2017-10-13 14:02:43 +01:00
paboyle
47af3565f4
Logging improvement; reunified the Lanczos codes
2017-10-13 13:23:07 +01:00
paboyle
4b4d187935
Reunified the Lanczos implementations
2017-10-13 13:22:44 +01:00
paboyle
9aff354ab5
Final version prior to reunification
2017-10-13 13:22:26 +01:00
paboyle
cb9ff20249
Approx tests and lanczos improvement
2017-10-13 11:30:50 +01:00
paboyle
9fe6ac71ea
Starting reorg of Blocked lanczos
2017-10-11 10:12:07 +01:00
paboyle
bf58557fb1
Block compressed Lanczos
2017-10-10 14:15:11 +01:00
paboyle
a1d80282ec
cb factorise
2017-10-10 13:49:31 +01:00
Azusa Yamaguchi
f0e084a88c
Schur staggered
2017-10-10 10:00:43 +01:00
paboyle
4f8b6f26b4
Merge branch 'develop' into feature/dwf-multirhs
2017-10-02 11:41:49 +01:00
Peter Boyle
771a1b8e79
Merge pull request #128 from paboyle/feature/CG-reliable-update
...
Feature/cg reliable update
2017-09-21 10:12:03 +01:00
Azusa Yamaguchi
b83b2b1415
Stability improvement to BCG. Force m_rr hermitian beyond rounding.
2017-09-04 14:09:47 +01:00
Azusa Yamaguchi
d9cd4f0273
Staggered multinode block cg debugged. Missing global sum.
...
Code stalls and resumes on KNL at cambridge. Curious.
CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
Christopher Kelly
9939b267d2
Added switching to fallback linear operator in reliable update CG, and added recalculation of b parameter on update.
2017-07-31 13:39:44 -04:00
Christopher Kelly
9f280b82c4
Added mixed-precision CG with reliable updates
2017-07-25 11:30:41 -04:00
paboyle
e504260f3d
Able to run a test job splitting into multiple MPI subdomains.
2017-06-22 18:53:11 +01:00
paboyle
e8b95bd35b
Clean up finished. Could shrink Lanczos to around 400 lines at a push
2017-06-21 02:50:09 +01:00
paboyle
7e35286860
Simplified lanczos, added Eigen diagonalisation.
...
Curious if we can deprecate dependencly on BLAS.
Will see when we get 48^3 running on our BG/Q port
2017-06-21 02:26:03 +01:00
paboyle
0486ff8e79
Improved the lancos
2017-06-20 18:46:01 +01:00
Azusa Yamaguchi
e9cc21900f
Block solver complete for staggered. Now stable on mass 0.003 and
...
gives 8x (!) speed up on Haswell laptop vs. standard CG for 8 RHS solves.
166 iterations vs. 537 iterations so algorithmic gain + 2x in flop rate gain.
Better than a slap in the face with a wet kipper.
2017-06-20 12:37:41 +01:00
Azusa Yamaguchi
cfe3cd76d1
Block solver improvements
2017-06-19 14:04:21 +01:00
paboyle
2439999ec8
Warning elimination; drop to -O2 on G++ bad versions
2017-05-06 14:44:49 +01:00