55cdb17691
Integer divide for blocking
2020-01-27 12:27:45 -05:00
96671bbb24
Added ability to pass callback to MADWF that is called every inner iteration and allows user to, for example, adjust the inner solver tolerance depending on residual
...
Added a general implementation of the Remez algorithm for producing arbitrary rational polynomial approximation with optional restriction to even/odd polynomials
Added implementation of computation of ZMobius parameters
Added Test_zMADWF_prec to test ZMobius in MADWF
2020-01-17 12:45:30 -08:00
e583035614
Change to interface to minise comms in evaluating coarse space operator
2020-01-06 11:43:59 -05:00
3c3d6a94f3
OPtimising the force term a bit
2020-01-04 03:16:23 -05:00
205ea4bbb2
More verboose Lanczos
2020-01-04 03:13:40 -05:00
039eb7b2eb
Make the force term and coarsening multigrid more optimised
2020-01-04 03:12:17 -05:00
f7e4bd1f6d
Getting more optimised
2020-01-04 03:11:53 -05:00
ba40a3f763
Alternate low pass filter option
2020-01-03 05:29:09 -05:00
c0d8e4dce5
Improved Multigrid for DWF
2019-12-28 10:32:15 -05:00
0ca1992151
Remove warning in tensor layout comparison. Make default names and index names visible for PerambTensor and NoiseTensor
2019-12-20 13:53:27 +00:00
9cfd64c604
Coarse grid on GPU, not fast enough yet. Need a 10x
2019-12-17 05:24:45 -05:00
9aafd20468
Simple block project promote runs faster on GPU
2019-12-17 05:01:39 -05:00
f7373e97a4
Missing conjugate in MooeeInvDag
2019-12-16 10:05:50 +01:00
9e15474999
Accelerator loop attempt at speed up
2019-12-14 05:28:16 -05:00
152b525a4d
Typo fix
2019-12-13 22:44:42 -05:00
d18994eddc
offload more of mgrid to GPU
2019-12-13 22:08:11 -05:00
736b19485e
Faster set up and some dead code ifdef'ed out
2019-12-13 21:30:48 -05:00
843ca9350a
Fix naming conventions to be consistent with Peter
2019-12-11 11:46:18 -05:00
5bfd1470ad
Merge branch 'develop' into feature/hdcr
2019-12-10 21:51:06 -05:00
d73f0b8618
Verbose for temporary debug
2019-12-10 21:50:06 -05:00
0b3a3562c3
Some MPI (summit) create sigusr2, so trap that
2019-12-10 21:49:12 -05:00
710fee5d26
Subspace setup testing code
...
and timing verbose
2019-12-10 21:48:42 -05:00
848079e8ba
Merge pull request #235 from grid-test-organisation/feature/5d-improvement
...
MooeeInv and M5D optimisations + enable threading with nvcc
2019-12-10 21:45:03 -05:00
f2a4f13111
Must offload the Coarsened matrix if Stencil buffers are device resident
2019-12-10 19:32:12 -05:00
4180a4a8a7
Import BiCGSTAB solvers and tests
2019-12-10 17:20:35 -05:00
6446671a9c
Merge pull request #241 from nils-asmussen/fix/remQCDns_ignore_ws
...
Undo whitespace changes in fix/removeQCDremnants to allow comparing relevant changes
2019-12-09 18:02:21 +00:00
9b6b0caa55
Junk commit fix
2019-12-09 03:01:58 -05:00
2a48617ac5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-12-09 03:00:00 -05:00
58a31f0763
QMR implemented, preserve even if not used much
2019-12-09 02:59:13 -05:00
3d2fe80780
Temporary size depends on checkerboard/uncheckerboard. The Mdir cares
2019-12-09 02:58:24 -05:00
0dfdf80407
Logging
2019-12-09 02:54:52 -05:00
2912071f83
Add non hermitian operator
2019-12-09 02:51:53 -05:00
26605ef387
HDCR back to working
2019-12-09 02:51:01 -05:00
f7698b93ca
corrected comments about quark line directions
2019-12-06 09:46:52 +00:00
a54157e682
more definitions changed
2019-12-05 17:08:09 +00:00
b766038810
new syntax after merge
2019-12-04 18:08:00 +00:00
cd9fd80a5d
merged in develop
2019-12-04 17:12:46 +00:00
e940f4db7e
removed unused parameter parity
2019-12-03 12:01:31 +00:00
7983ff2fdd
Merge branch 'develop' into feature/distil
...
* develop:
Change to reporting
NVCC timer support
Fix nocompilee under NVCC
--enable-summit flag
IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could be a little faster
Sliced propagator contraction was not producing any results because buf.size()=0
several typos in hadrons
2019-11-30 16:47:03 +00:00
2db814f2b7
Resolve conflicts in BaryonUtils (just use latest from develop)
2019-11-29 18:19:35 +00:00
799ff0c96e
speed-up
2019-11-26 15:28:47 +00:00
5fd5c25114
now two seperate functions for Eye and NonEye
2019-11-26 13:44:55 +00:00
d1a89af8c9
Change to reporting
2019-11-22 10:49:10 -05:00
d91ba1f6cc
NVCC timer support
2019-11-21 20:11:19 +00:00
f4d27e7090
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-11-21 20:09:31 +00:00
feb1ff3494
Fix nocompilee under NVCC
2019-11-21 20:03:39 +00:00
98ea67b636
IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could
...
be a little faster
2019-11-21 15:00:46 -05:00
421a4395af
Sigma to Nucleon contractions
2019-11-21 17:25:37 +00:00
22c654182a
Fixes for GPU compile
2019-11-04 17:24:34 +00:00
efe2f2d48b
Merge branch 'develop' into feature/distil
...
* develop:
Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun
Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions Fixed Lattice::reset not compiling with new Grid explicit memory region handling Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-11-01 15:38:48 +00:00