0896f2cead
Added missing include guards in bigfloat_double.h
2020-04-20 10:30:38 -04:00
181709bba4
Merge branch 'develop' into feature/zmobius_paramcompute
2020-04-20 09:12:34 -04:00
90229cfb0f
Merge pull request #270 from milc-qcd/feature/CGinfo
...
feature/CGinfo
2020-04-16 11:46:08 -04:00
0475c46ecb
Merge pull request #256 from djm2131/feature/BiCGSTAB
...
Import BiCGSTAB solvers and tests
2020-04-16 11:45:15 -04:00
11dec4883c
Don't throw assert
2020-04-10 11:09:11 -04:00
afa458c812
Extra solvers
2020-04-10 11:08:19 -04:00
dc50190b8f
Faster GPU basis rotation
...
May need to later include Regensburg optimised CPU variant
2020-04-10 11:06:04 -04:00
165c68e28e
Change TrueResiduals to TrueResidualShift and IterationsToComplete to IterationsToCompleteShift
2020-02-29 17:51:51 -06:00
9479bc8486
Make IterationsToComplete and TrueResidual externally accessible
2020-02-19 17:43:57 -06:00
8a5c13d5fb
Still fast moving in changes
2020-02-06 17:57:26 -05:00
bdccb0c91f
Working 2 types of decomposition
2020-02-06 17:26:55 -05:00
b9ca40cc44
More precise power method at start
2020-02-06 10:09:14 -05:00
2f421a5db1
Commeent fix
2020-02-06 10:08:27 -05:00
2b5de5bba5
MdagM operator without norm option
2020-01-27 13:44:30 -05:00
2e85cae74e
Add Jacobi polynomials
2020-01-27 13:43:49 -05:00
76c823781e
Much faster coarsening
2020-01-27 13:43:19 -05:00
114db3b99d
Optional MdagM without norms
2020-01-27 13:42:51 -05:00
49e123dbda
Use explicit linalg calls to get coalesce optimisations on GPU
2020-01-27 12:44:51 -05:00
8cec294ec9
Make CG a bit less verbose as gettign annoying in nested algorithms.
...
Can use Iterative logging if you want to see more
2020-01-27 12:44:04 -05:00
eb5b720e94
Normal Equations can be used in HDCR now
2020-01-27 12:43:29 -05:00
b2736ec80b
Make PrecGCR recursive - it can precondition itself
2020-01-27 12:42:48 -05:00
086256a032
Less sloppy convergence test on PowerMethod
2020-01-27 12:41:59 -05:00
96671bbb24
Added ability to pass callback to MADWF that is called every inner iteration and allows user to, for example, adjust the inner solver tolerance depending on residual
...
Added a general implementation of the Remez algorithm for producing arbitrary rational polynomial approximation with optional restriction to even/odd polynomials
Added implementation of computation of ZMobius parameters
Added Test_zMADWF_prec to test ZMobius in MADWF
2020-01-17 12:45:30 -08:00
e583035614
Change to interface to minise comms in evaluating coarse space operator
2020-01-06 11:43:59 -05:00
205ea4bbb2
More verboose Lanczos
2020-01-04 03:13:40 -05:00
f7e4bd1f6d
Getting more optimised
2020-01-04 03:11:53 -05:00
ba40a3f763
Alternate low pass filter option
2020-01-03 05:29:09 -05:00
c0d8e4dce5
Improved Multigrid for DWF
2019-12-28 10:32:15 -05:00
9cfd64c604
Coarse grid on GPU, not fast enough yet. Need a 10x
2019-12-17 05:24:45 -05:00
d18994eddc
offload more of mgrid to GPU
2019-12-13 22:08:11 -05:00
736b19485e
Faster set up and some dead code ifdef'ed out
2019-12-13 21:30:48 -05:00
843ca9350a
Fix naming conventions to be consistent with Peter
2019-12-11 11:46:18 -05:00
710fee5d26
Subspace setup testing code
...
and timing verbose
2019-12-10 21:48:42 -05:00
f2a4f13111
Must offload the Coarsened matrix if Stencil buffers are device resident
2019-12-10 19:32:12 -05:00
4180a4a8a7
Import BiCGSTAB solvers and tests
2019-12-10 17:20:35 -05:00
58a31f0763
QMR implemented, preserve even if not used much
2019-12-09 02:59:13 -05:00
0dfdf80407
Logging
2019-12-09 02:54:52 -05:00
2912071f83
Add non hermitian operator
2019-12-09 02:51:53 -05:00
26605ef387
HDCR back to working
2019-12-09 02:51:01 -05:00
114ebb7914
Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions
...
Fixed Lattice::reset not compiling with new Grid explicit memory region handling
Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-08-26 16:47:44 -04:00
a23dc295ac
Remove compiler errors and warnings
2019-07-18 14:47:02 +01:00
08904f830e
Merge develop
2019-07-16 11:59:56 +01:00
fa9cd50c5b
Merge branch 'develop' into feature/gpu-port
2019-07-16 11:55:17 +01:00
d7b3efe893
Compile fix
2019-06-15 17:03:15 +01:00
82306913a8
Move Schur operator into correct place
2019-06-15 12:49:22 +01:00
d0d8dc8042
Thread loop changes
2019-06-15 07:45:09 +01:00
81eb1fd9f2
Accelerator loop changes for coalesced access
2019-06-15 07:44:47 +01:00
cb93d32cd9
Thread loop changes
2019-06-15 07:44:08 +01:00
8f223962ff
Thread loop changed
2019-06-15 07:43:42 +01:00
7a1569bd46
Annoying, cannot rely on equivalence of Grid ComplexD adn Eigen Complex type on GPU.
...
Solve with ComplexD typecasts but must be a better way
2019-06-04 20:47:49 +01:00