1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-04-16 00:35:57 +01:00

126 Commits

Author SHA1 Message Date
Christopher Kelly
0896f2cead Added missing include guards in bigfloat_double.h 2020-04-20 10:30:38 -04:00
Christopher Kelly
181709bba4 Merge branch 'develop' into feature/zmobius_paramcompute 2020-04-20 09:12:34 -04:00
Peter Boyle
90229cfb0f
Merge pull request #270 from milc-qcd/feature/CGinfo
feature/CGinfo
2020-04-16 11:46:08 -04:00
Peter Boyle
0475c46ecb
Merge pull request #256 from djm2131/feature/BiCGSTAB
Import BiCGSTAB solvers and tests
2020-04-16 11:45:15 -04:00
Peter Boyle
11dec4883c Don't throw assert 2020-04-10 11:09:11 -04:00
Peter Boyle
afa458c812 Extra solvers 2020-04-10 11:08:19 -04:00
Peter Boyle
dc50190b8f Faster GPU basis rotation
May need to later include Regensburg optimised CPU variant
2020-04-10 11:06:04 -04:00
Carleton DeTar
165c68e28e Change TrueResiduals to TrueResidualShift and IterationsToComplete to IterationsToCompleteShift 2020-02-29 17:51:51 -06:00
Carleton DeTar
9479bc8486 Make IterationsToComplete and TrueResidual externally accessible 2020-02-19 17:43:57 -06:00
Peter Boyle
8a5c13d5fb Still fast moving in changes 2020-02-06 17:57:26 -05:00
Peter Boyle
bdccb0c91f Working 2 types of decomposition 2020-02-06 17:26:55 -05:00
Peter Boyle
b9ca40cc44 More precise power method at start 2020-02-06 10:09:14 -05:00
Peter Boyle
2f421a5db1 Commeent fix 2020-02-06 10:08:27 -05:00
Peter Boyle
2b5de5bba5 MdagM operator without norm option 2020-01-27 13:44:30 -05:00
Peter Boyle
2e85cae74e Add Jacobi polynomials 2020-01-27 13:43:49 -05:00
Peter Boyle
76c823781e Much faster coarsening 2020-01-27 13:43:19 -05:00
Peter Boyle
114db3b99d Optional MdagM without norms 2020-01-27 13:42:51 -05:00
Peter Boyle
49e123dbda Use explicit linalg calls to get coalesce optimisations on GPU 2020-01-27 12:44:51 -05:00
Peter Boyle
8cec294ec9 Make CG a bit less verbose as gettign annoying in nested algorithms.
Can use Iterative logging if you want to see more
2020-01-27 12:44:04 -05:00
Peter Boyle
eb5b720e94 Normal Equations can be used in HDCR now 2020-01-27 12:43:29 -05:00
Peter Boyle
b2736ec80b Make PrecGCR recursive - it can precondition itself 2020-01-27 12:42:48 -05:00
Peter Boyle
086256a032 Less sloppy convergence test on PowerMethod 2020-01-27 12:41:59 -05:00
Christopher Kelly
96671bbb24 Added ability to pass callback to MADWF that is called every inner iteration and allows user to, for example, adjust the inner solver tolerance depending on residual
Added a general implementation of the Remez algorithm for producing arbitrary rational polynomial approximation with optional restriction to even/odd polynomials
Added implementation of computation of ZMobius parameters
Added Test_zMADWF_prec to test ZMobius in MADWF
2020-01-17 12:45:30 -08:00
Peter Boyle
e583035614 Change to interface to minise comms in evaluating coarse space operator 2020-01-06 11:43:59 -05:00
Peter Boyle
205ea4bbb2 More verboose Lanczos 2020-01-04 03:13:40 -05:00
Peter Boyle
f7e4bd1f6d Getting more optimised 2020-01-04 03:11:53 -05:00
Peter Boyle
ba40a3f763 Alternate low pass filter option 2020-01-03 05:29:09 -05:00
Peter Boyle
c0d8e4dce5 Improved Multigrid for DWF 2019-12-28 10:32:15 -05:00
Peter Boyle
9cfd64c604 Coarse grid on GPU, not fast enough yet. Need a 10x 2019-12-17 05:24:45 -05:00
Peter Boyle
d18994eddc offload more of mgrid to GPU 2019-12-13 22:08:11 -05:00
Peter Boyle
736b19485e Faster set up and some dead code ifdef'ed out 2019-12-13 21:30:48 -05:00
David Murphy
843ca9350a Fix naming conventions to be consistent with Peter 2019-12-11 11:46:18 -05:00
Peter Boyle
710fee5d26 Subspace setup testing code
and timing verbose
2019-12-10 21:48:42 -05:00
Peter Boyle
f2a4f13111 Must offload the Coarsened matrix if Stencil buffers are device resident 2019-12-10 19:32:12 -05:00
David Murphy
4180a4a8a7 Import BiCGSTAB solvers and tests 2019-12-10 17:20:35 -05:00
Peter Boyle
58a31f0763 QMR implemented, preserve even if not used much 2019-12-09 02:59:13 -05:00
Peter Boyle
0dfdf80407 Logging 2019-12-09 02:54:52 -05:00
Peter Boyle
2912071f83 Add non hermitian operator 2019-12-09 02:51:53 -05:00
Peter Boyle
26605ef387 HDCR back to working 2019-12-09 02:51:01 -05:00
Christopher Kelly
114ebb7914 Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions
Fixed Lattice::reset not compiling with new Grid explicit memory region handling
Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-08-26 16:47:44 -04:00
Peter Boyle
a23dc295ac Remove compiler errors and warnings 2019-07-18 14:47:02 +01:00
Peter Boyle
08904f830e Merge develop 2019-07-16 11:59:56 +01:00
Peter Boyle
fa9cd50c5b Merge branch 'develop' into feature/gpu-port 2019-07-16 11:55:17 +01:00
Peter Boyle
d7b3efe893 Compile fix 2019-06-15 17:03:15 +01:00
Peter Boyle
82306913a8 Move Schur operator into correct place 2019-06-15 12:49:22 +01:00
Peter Boyle
d0d8dc8042 Thread loop changes 2019-06-15 07:45:09 +01:00
Peter Boyle
81eb1fd9f2 Accelerator loop changes for coalesced access 2019-06-15 07:44:47 +01:00
Peter Boyle
cb93d32cd9 Thread loop changes 2019-06-15 07:44:08 +01:00
Peter Boyle
8f223962ff Thread loop changed 2019-06-15 07:43:42 +01:00
Peter Boyle
7a1569bd46 Annoying, cannot rely on equivalence of Grid ComplexD adn Eigen Complex type on GPU.
Solve with ComplexD typecasts but must be a better way
2019-06-04 20:47:49 +01:00