Christopher Kelly
0896f2cead
Added missing include guards in bigfloat_double.h
2020-04-20 10:30:38 -04:00
Christopher Kelly
181709bba4
Merge branch 'develop' into feature/zmobius_paramcompute
2020-04-20 09:12:34 -04:00
Peter Boyle
90229cfb0f
Merge pull request #270 from milc-qcd/feature/CGinfo
...
feature/CGinfo
2020-04-16 11:46:08 -04:00
Peter Boyle
0475c46ecb
Merge pull request #256 from djm2131/feature/BiCGSTAB
...
Import BiCGSTAB solvers and tests
2020-04-16 11:45:15 -04:00
Peter Boyle
11dec4883c
Don't throw assert
2020-04-10 11:09:11 -04:00
Peter Boyle
afa458c812
Extra solvers
2020-04-10 11:08:19 -04:00
Peter Boyle
dc50190b8f
Faster GPU basis rotation
...
May need to later include Regensburg optimised CPU variant
2020-04-10 11:06:04 -04:00
Carleton DeTar
165c68e28e
Change TrueResiduals to TrueResidualShift and IterationsToComplete to IterationsToCompleteShift
2020-02-29 17:51:51 -06:00
Carleton DeTar
9479bc8486
Make IterationsToComplete and TrueResidual externally accessible
2020-02-19 17:43:57 -06:00
Peter Boyle
8a5c13d5fb
Still fast moving in changes
2020-02-06 17:57:26 -05:00
Peter Boyle
bdccb0c91f
Working 2 types of decomposition
2020-02-06 17:26:55 -05:00
Peter Boyle
b9ca40cc44
More precise power method at start
2020-02-06 10:09:14 -05:00
Peter Boyle
2f421a5db1
Commeent fix
2020-02-06 10:08:27 -05:00
Peter Boyle
2b5de5bba5
MdagM operator without norm option
2020-01-27 13:44:30 -05:00
Peter Boyle
2e85cae74e
Add Jacobi polynomials
2020-01-27 13:43:49 -05:00
Peter Boyle
76c823781e
Much faster coarsening
2020-01-27 13:43:19 -05:00
Peter Boyle
114db3b99d
Optional MdagM without norms
2020-01-27 13:42:51 -05:00
Peter Boyle
49e123dbda
Use explicit linalg calls to get coalesce optimisations on GPU
2020-01-27 12:44:51 -05:00
Peter Boyle
8cec294ec9
Make CG a bit less verbose as gettign annoying in nested algorithms.
...
Can use Iterative logging if you want to see more
2020-01-27 12:44:04 -05:00
Peter Boyle
eb5b720e94
Normal Equations can be used in HDCR now
2020-01-27 12:43:29 -05:00
Peter Boyle
b2736ec80b
Make PrecGCR recursive - it can precondition itself
2020-01-27 12:42:48 -05:00
Peter Boyle
086256a032
Less sloppy convergence test on PowerMethod
2020-01-27 12:41:59 -05:00
Christopher Kelly
96671bbb24
Added ability to pass callback to MADWF that is called every inner iteration and allows user to, for example, adjust the inner solver tolerance depending on residual
...
Added a general implementation of the Remez algorithm for producing arbitrary rational polynomial approximation with optional restriction to even/odd polynomials
Added implementation of computation of ZMobius parameters
Added Test_zMADWF_prec to test ZMobius in MADWF
2020-01-17 12:45:30 -08:00
Peter Boyle
e583035614
Change to interface to minise comms in evaluating coarse space operator
2020-01-06 11:43:59 -05:00
Peter Boyle
205ea4bbb2
More verboose Lanczos
2020-01-04 03:13:40 -05:00
Peter Boyle
f7e4bd1f6d
Getting more optimised
2020-01-04 03:11:53 -05:00
Peter Boyle
ba40a3f763
Alternate low pass filter option
2020-01-03 05:29:09 -05:00
Peter Boyle
c0d8e4dce5
Improved Multigrid for DWF
2019-12-28 10:32:15 -05:00
Peter Boyle
9cfd64c604
Coarse grid on GPU, not fast enough yet. Need a 10x
2019-12-17 05:24:45 -05:00
Peter Boyle
d18994eddc
offload more of mgrid to GPU
2019-12-13 22:08:11 -05:00
Peter Boyle
736b19485e
Faster set up and some dead code ifdef'ed out
2019-12-13 21:30:48 -05:00
David Murphy
843ca9350a
Fix naming conventions to be consistent with Peter
2019-12-11 11:46:18 -05:00
Peter Boyle
710fee5d26
Subspace setup testing code
...
and timing verbose
2019-12-10 21:48:42 -05:00
Peter Boyle
f2a4f13111
Must offload the Coarsened matrix if Stencil buffers are device resident
2019-12-10 19:32:12 -05:00
David Murphy
4180a4a8a7
Import BiCGSTAB solvers and tests
2019-12-10 17:20:35 -05:00
Peter Boyle
58a31f0763
QMR implemented, preserve even if not used much
2019-12-09 02:59:13 -05:00
Peter Boyle
0dfdf80407
Logging
2019-12-09 02:54:52 -05:00
Peter Boyle
2912071f83
Add non hermitian operator
2019-12-09 02:51:53 -05:00
Peter Boyle
26605ef387
HDCR back to working
2019-12-09 02:51:01 -05:00
Christopher Kelly
114ebb7914
Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions
...
Fixed Lattice::reset not compiling with new Grid explicit memory region handling
Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-08-26 16:47:44 -04:00
Peter Boyle
a23dc295ac
Remove compiler errors and warnings
2019-07-18 14:47:02 +01:00
Peter Boyle
08904f830e
Merge develop
2019-07-16 11:59:56 +01:00
Peter Boyle
fa9cd50c5b
Merge branch 'develop' into feature/gpu-port
2019-07-16 11:55:17 +01:00
Peter Boyle
d7b3efe893
Compile fix
2019-06-15 17:03:15 +01:00
Peter Boyle
82306913a8
Move Schur operator into correct place
2019-06-15 12:49:22 +01:00
Peter Boyle
d0d8dc8042
Thread loop changes
2019-06-15 07:45:09 +01:00
Peter Boyle
81eb1fd9f2
Accelerator loop changes for coalesced access
2019-06-15 07:44:47 +01:00
Peter Boyle
cb93d32cd9
Thread loop changes
2019-06-15 07:44:08 +01:00
Peter Boyle
8f223962ff
Thread loop changed
2019-06-15 07:43:42 +01:00
Peter Boyle
7a1569bd46
Annoying, cannot rely on equivalence of Grid ComplexD adn Eigen Complex type on GPU.
...
Solve with ComplexD typecasts but must be a better way
2019-06-04 20:47:49 +01:00