paboyle
32a52d7583
Move the local coherence lanczos into algorithms.
...
Keep the I/O in the tester. Other people can copy this method to write other I/O formats.
2017-10-27 09:04:31 +01:00
paboyle
0c4ddaea0b
Cleaning up
2017-10-26 23:31:46 +01:00
Azusa Yamaguchi
034de160bf
Staggered updates : Schur fixed and added a unit test for Test_staggered_cg_schur.cc giving stronger check
2017-10-26 20:58:46 +01:00
paboyle
31f99574fa
Moving these out of algorithms
2017-10-26 07:47:42 +01:00
paboyle
a34c8a2961
Update to IRL; getting close to the structure I would like.
2017-10-26 07:45:56 +01:00
paboyle
f6c3f6bf2d
XML serialisation of parms and initialise from parms object
2017-10-25 23:47:59 +01:00
paboyle
d83868fdbb
Identity linear op added -- useful in circumstances where a linear op may or may not be needed.
...
Supply a trivial one if not needed
2017-10-25 23:47:10 +01:00
paboyle
303e0b927d
Improvements for coarse grid compressed lanczos
2017-10-25 23:46:33 +01:00
paboyle
e325929851
ALl codes compile against the new Lanczos call signature
2017-10-13 14:02:43 +01:00
paboyle
47af3565f4
Logging improvement; reunified the Lanczos codes
2017-10-13 13:23:07 +01:00
paboyle
4b4d187935
Reunified the Lanczos implementations
2017-10-13 13:22:44 +01:00
paboyle
9aff354ab5
Final version prior to reunification
2017-10-13 13:22:26 +01:00
paboyle
cb9ff20249
Approx tests and lanczos improvement
2017-10-13 11:30:50 +01:00
paboyle
9fe6ac71ea
Starting reorg of Blocked lanczos
2017-10-11 10:12:07 +01:00
Christopher Kelly
ef61b549e6
Merge branch 'feature/Lanczos' into ckelly_develop4
2017-10-10 13:41:43 -04:00
paboyle
bf58557fb1
Block compressed Lanczos
2017-10-10 14:15:11 +01:00
paboyle
a1d80282ec
cb factorise
2017-10-10 13:49:31 +01:00
paboyle
4eb8bbbebe
Christop mods
2017-10-10 13:48:51 +01:00
Azusa Yamaguchi
bb7378cfc3
Schur for staggered
2017-10-10 12:02:18 +01:00
Azusa Yamaguchi
f0e084a88c
Schur staggered
2017-10-10 10:00:43 +01:00
paboyle
4f8b6f26b4
Merge branch 'develop' into feature/dwf-multirhs
2017-10-02 11:41:49 +01:00
paboyle
fddeb29d6b
Bug fix with spreadout FFT
2017-09-21 11:10:08 +01:00
Peter Boyle
946a8671b9
Merge pull request #129 from djm2131/feature/eofa
...
Add support for DWF with the exact one flavor algorithm
2017-09-21 10:15:21 +01:00
Peter Boyle
771a1b8e79
Merge pull request #128 from paboyle/feature/CG-reliable-update
...
Feature/cg reliable update
2017-09-21 10:12:03 +01:00
Azusa Yamaguchi
b83b2b1415
Stability improvement to BCG. Force m_rr hermitian beyond rounding.
2017-09-04 14:09:47 +01:00
Chulwoo Jung
3006663b9c
Schur solver for staggered type (hermition Mpc) opertors
2017-08-31 21:32:01 -04:00
Azusa Yamaguchi
d9cd4f0273
Staggered multinode block cg debugged. Missing global sum.
...
Code stalls and resumes on KNL at cambridge. Curious.
CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
Chulwoo Jung
0145685f96
Added Staggered Type Preconditioned operator
2017-08-18 01:44:31 -04:00
David Murphy
41f73ec083
Add ChronoForecast class for forecasting solutions across poles in the EOFA heatbath
2017-08-16 12:37:38 -04:00
Chulwoo Jung
e73e4b4002
Minor changes fixes
2017-08-11 01:35:25 -04:00
Chulwoo Jung
caa6605b43
Still tweaking memory saving routines in Lanczos
2017-08-07 00:01:04 -04:00
Christopher Kelly
9939b267d2
Added switching to fallback linear operator in reliable update CG, and added recalculation of b parameter on update.
2017-07-31 13:39:44 -04:00
Chulwoo Jung
191fbf85fc
Added ImplicitlyRestartedLanczosCJ to Algorithms.h
2017-07-28 15:33:59 -04:00
Christopher Kelly
9f280b82c4
Added mixed-precision CG with reliable updates
2017-07-25 11:30:41 -04:00
Chulwoo Jung
93650f3a61
Adding back (temporarily) dense matrix routines until Lanczos is fininalized
2017-07-24 21:49:25 -04:00
Chulwoo Jung
cab4b4d063
Deleting old include file references
2017-07-24 20:51:31 -04:00
Chulwoo Jung
cf4b30b2dd
re-adding ImplcitlyRestartedLanczos
2017-07-24 20:40:25 -04:00
Chulwoo Jung
c51d0b4078
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/Lanczos
2017-07-24 20:35:29 -04:00
paboyle
e504260f3d
Able to run a test job splitting into multiple MPI subdomains.
2017-06-22 18:53:11 +01:00
paboyle
b9104f3072
Block CG
2017-06-21 21:08:03 +01:00
paboyle
e8b95bd35b
Clean up finished. Could shrink Lanczos to around 400 lines at a push
2017-06-21 02:50:09 +01:00
paboyle
7e35286860
Simplified lanczos, added Eigen diagonalisation.
...
Curious if we can deprecate dependencly on BLAS.
Will see when we get 48^3 running on our BG/Q port
2017-06-21 02:26:03 +01:00
paboyle
0486ff8e79
Improved the lancos
2017-06-20 18:46:01 +01:00
Azusa Yamaguchi
e9cc21900f
Block solver complete for staggered. Now stable on mass 0.003 and
...
gives 8x (!) speed up on Haswell laptop vs. standard CG for 8 RHS solves.
166 iterations vs. 537 iterations so algorithmic gain + 2x in flop rate gain.
Better than a slap in the face with a wet kipper.
2017-06-20 12:37:41 +01:00
Azusa Yamaguchi
cfe3cd76d1
Block solver improvements
2017-06-19 14:04:21 +01:00
Chulwoo Jung
2f4cbeb4d5
Minor changes
2017-06-12 18:25:18 -04:00
Chulwoo Jung
fb7c4fb815
Recovering lapack interface without array allocation
2017-06-07 00:00:59 -04:00
Chulwoo Jung
00bb71e5af
Checking in before reworking lapack interface
2017-06-06 16:26:41 -04:00
Chulwoo Jung
cfed2c1ea0
Broken Lanczos. Going back to an older verion temporarily.
2017-06-06 12:14:45 -04:00
Chulwoo Jung
b1b15f0b70
Further fixes from multidimensional array
2017-06-05 23:13:41 -04:00
Chulwoo Jung
927c7ae3ed
changed allocation for LAPACK temporaries, to avoid crashing with some compilers (reported by Christoph)
2017-05-25 21:43:53 -04:00
Chulwoo Jung
05d04ceff8
Adding SimpleLanczos
2017-05-25 12:30:47 -04:00
Chulwoo Jung
5c479ce663
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/Lanczos
2017-05-24 18:58:53 -04:00
Chulwoo Jung
4bf9d65bf8
Checking in memory saving version of Lanczos
2017-05-24 18:57:32 -04:00
Chulwoo Jung
3a056c4dff
Re-adding Bisection for SimpleLanczos
2017-05-22 18:23:03 -04:00
Chulwoo Jung
b0ba651654
Turning off the final sort for now
2017-05-19 10:49:09 -04:00
Chulwoo Jung
25d4c175c3
Cleaning up Lanczos
2017-05-18 18:33:47 -04:00
paboyle
2439999ec8
Warning elimination; drop to -O2 on G++ bad versions
2017-05-06 14:44:49 +01:00
Chulwoo Jung
a8d7986e1c
Temporary (hopefully) change to run with GCC for now.
2017-05-05 10:55:07 -04:00
Guido Cossu
20999c1370
Merge branch 'develop' into feature/hmc_generalise
2017-05-05 12:47:17 +01:00
Chulwoo Jung
92ec509bfa
Commiting to move to Jlab
2017-05-04 19:32:00 -04:00
Chulwoo Jung
e80a87ff7f
Checking in before modifying
2017-05-04 16:05:07 -04:00
ea9aef7baa
New header for standard headers (was an issue with Remez.h and external compilation)
2017-05-02 18:26:11 +01:00
Chulwoo Jung
867fe93018
First Rotate reorg done.
2017-05-02 01:26:22 -04:00
Chulwoo Jung
09651c3326
Checking in before rearranging Lanczos
2017-05-02 00:47:18 -04:00
Chulwoo Jung
f87f2a3f8b
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/Lanczos
2017-05-01 12:00:47 -04:00
Guido Cossu
3344788fa1
Merge branch 'develop' into feature/hmc_generalise
2017-05-01 12:13:56 +01:00
paboyle
8e161152e4
MultiRHS solver improvements with slice operations moved into lattice and sped up.
...
Block solver requires a lot of performance work.
2017-04-18 10:51:55 +01:00
paboyle
3141ebac10
MultiRHS working, starting to optimise. Block doesn't and I thought it already was; puzzled.
2017-04-17 10:50:19 +01:00
paboyle
7ede696126
Non compile of tests fixed
2017-04-16 23:40:00 +01:00
Chulwoo Jung
a07556dd5f
Added back the convergence test from evecs of tridiagonal matrix. Bugfixes
2017-04-15 09:32:15 -04:00
paboyle
a8db024c92
Cleaning up the dense matrix and lanczos sector
2017-04-15 08:54:11 +01:00
paboyle
42fb49d3fd
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-04-13 14:12:47 +01:00
8ef4300412
spurious .dirstamp files removed
2017-04-10 17:00:22 +01:00
paboyle
b12dc89d26
Commenting and clean up
2017-04-10 20:38:20 +09:00
paboyle
d80d802f9d
MultiRHS solver test
2017-04-10 00:12:12 +09:00
paboyle
3d99b09dba
Start of blockCG
2017-04-09 23:42:10 +09:00
Chulwoo Jung
93cb5d4e97
Working version of Lanczos without the extra copy.
2017-04-06 23:35:30 -04:00
Chulwoo Jung
9e48b7dfda
MEM_SAVE in Lanczos seems to be working, but not pretty
2017-04-06 22:21:56 -04:00
Guido Cossu
8c540333d5
Merge branch 'develop' into feature/hmc_generalise
2017-04-05 14:41:04 +01:00
Chulwoo Jung
d0c2c9c71f
Merge branch 'develop' of https://github.com/paboyle/Grid into bugfix/dminus
2017-04-04 15:20:17 -04:00
Chulwoo Jung
c8cafa77ca
Checking in the latest Lacnzos
2017-04-04 15:18:12 -04:00
paboyle
9cbcdd65d7
No random device seed
2017-04-02 00:26:57 +09:00
Chulwoo Jung
a3bcad3804
Added preconditioned SYM2 solver (SchurRedBlackDiagTwoSolve)
2017-03-30 20:33:27 -04:00
paboyle
e099dcdae7
Merge branch 'develop' into feature/bgq-asm
2017-02-23 00:25:29 +00:00
paboyle
4e7ab3166f
Refactoring header layout
2017-02-22 18:09:33 +00:00
paboyle
3ae92fa2e6
Global changes to parallel_for structure.
...
Move the comms flags to more sensible names
2017-02-21 05:24:27 -05:00
Guido Cossu
e0571c872b
Merge branch 'develop' into feature/hmc_generalise
2017-02-09 16:12:00 +00:00
Christopher Kelly
c94133af49
Added iteration reporting to CG and mixed CG
...
Added ability to manually change the initial CG inner tolerance in mixed CG
Added .hpp files to filelist script
2017-02-02 17:04:42 -05:00
fad743fbb1
Build system sanity check: corrected several headers not in the <Grid/*> format
2017-01-26 17:00:41 -08:00
Guido Cossu
0bd296dda4
Adding check of the Dag part in the benchmark
2016-12-14 03:15:09 +00:00
Guido Cossu
2fb92dbc6e
Cleaning up previous debug lines
2016-12-13 07:53:43 +00:00
Guido Cossu
5c74b6028b
Commit for debugging, lot of IO
2016-12-13 06:35:30 +00:00
Guido Cossu
01480da0a8
Merge branch 'develop' into feature/hmc_generalise
2016-12-05 05:10:27 +00:00
97cddda49e
Merge branch 'feature/gen-simd' into feature/doxygen
...
# Conflicts:
# Makefile.am
# configure.ac
2016-11-19 13:11:13 +01:00
Guido Cossu
a783282b8b
Merge branch 'develop' into feature/hmc_generalise
2016-11-10 18:13:07 +00:00
paboyle
33dc1f51b5
Final sign off commits from Cori-1
2016-11-09 04:11:03 -08:00
azusayamaguchi
96ba42a297
omm buf
2016-11-04 22:47:25 +00:00
e74417ca12
big build system polish
2016-10-31 16:31:27 +00:00
Guido Cossu
977b0a6dd9
Merge branch 'develop' into feature/hmc_generalise
2016-10-20 17:04:41 +01:00