Peter Boyle
e5d1c09665
Faster DhopDirAll for little dirac operator coarsening
2020-01-27 12:38:54 -05:00
Peter Boyle
8016a465ae
Remove extraneous variable
2020-01-27 12:35:37 -05:00
Peter Boyle
d8b9742092
DhopDirAll for faster matrix elements of little Dirac operator
2020-01-27 12:34:54 -05:00
Peter Boyle
1bd87c35d7
Read coalescing on Nvidia
2020-01-27 12:29:56 -05:00
Peter Boyle
fa856c9669
Disable information message
2020-01-27 12:28:46 -05:00
Peter Boyle
48008e4d8b
Thread coordinate creation loop
2020-01-27 12:28:16 -05:00
Peter Boyle
55cdb17691
Integer divide for blocking
2020-01-27 12:27:45 -05:00
Christopher Kelly
96671bbb24
Added ability to pass callback to MADWF that is called every inner iteration and allows user to, for example, adjust the inner solver tolerance depending on residual
...
Added a general implementation of the Remez algorithm for producing arbitrary rational polynomial approximation with optional restriction to even/odd polynomials
Added implementation of computation of ZMobius parameters
Added Test_zMADWF_prec to test ZMobius in MADWF
2020-01-17 12:45:30 -08:00
Peter Boyle
e583035614
Change to interface to minise comms in evaluating coarse space operator
2020-01-06 11:43:59 -05:00
Peter Boyle
3c3d6a94f3
OPtimising the force term a bit
2020-01-04 03:16:23 -05:00
Peter Boyle
205ea4bbb2
More verboose Lanczos
2020-01-04 03:13:40 -05:00
Peter Boyle
039eb7b2eb
Make the force term and coarsening multigrid more optimised
2020-01-04 03:12:17 -05:00
Peter Boyle
f7e4bd1f6d
Getting more optimised
2020-01-04 03:11:53 -05:00
Peter Boyle
ba40a3f763
Alternate low pass filter option
2020-01-03 05:29:09 -05:00
Peter Boyle
c0d8e4dce5
Improved Multigrid for DWF
2019-12-28 10:32:15 -05:00
Michael Marshall
0ca1992151
Remove warning in tensor layout comparison. Make default names and index names visible for PerambTensor and NoiseTensor
2019-12-20 13:53:27 +00:00
Peter Boyle
9cfd64c604
Coarse grid on GPU, not fast enough yet. Need a 10x
2019-12-17 05:24:45 -05:00
Peter Boyle
9aafd20468
Simple block project promote runs faster on GPU
2019-12-17 05:01:39 -05:00
gfilaci
f7373e97a4
Missing conjugate in MooeeInvDag
2019-12-16 10:05:50 +01:00
Peter Boyle
9e15474999
Accelerator loop attempt at speed up
2019-12-14 05:28:16 -05:00
Peter Boyle
152b525a4d
Typo fix
2019-12-13 22:44:42 -05:00
Peter Boyle
d18994eddc
offload more of mgrid to GPU
2019-12-13 22:08:11 -05:00
Peter Boyle
736b19485e
Faster set up and some dead code ifdef'ed out
2019-12-13 21:30:48 -05:00
David Murphy
843ca9350a
Fix naming conventions to be consistent with Peter
2019-12-11 11:46:18 -05:00
Peter Boyle
5bfd1470ad
Merge branch 'develop' into feature/hdcr
2019-12-10 21:51:06 -05:00
Peter Boyle
d73f0b8618
Verbose for temporary debug
2019-12-10 21:50:06 -05:00
Peter Boyle
0b3a3562c3
Some MPI (summit) create sigusr2, so trap that
2019-12-10 21:49:12 -05:00
Peter Boyle
710fee5d26
Subspace setup testing code
...
and timing verbose
2019-12-10 21:48:42 -05:00
Peter Boyle
848079e8ba
Merge pull request #235 from grid-test-organisation/feature/5d-improvement
...
MooeeInv and M5D optimisations + enable threading with nvcc
2019-12-10 21:45:03 -05:00
Peter Boyle
f2a4f13111
Must offload the Coarsened matrix if Stencil buffers are device resident
2019-12-10 19:32:12 -05:00
David Murphy
4180a4a8a7
Import BiCGSTAB solvers and tests
2019-12-10 17:20:35 -05:00
6446671a9c
Merge pull request #241 from nils-asmussen/fix/remQCDns_ignore_ws
...
Undo whitespace changes in fix/removeQCDremnants to allow comparing relevant changes
2019-12-09 18:02:21 +00:00
Peter Boyle
9b6b0caa55
Junk commit fix
2019-12-09 03:01:58 -05:00
Peter Boyle
2a48617ac5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-12-09 03:00:00 -05:00
Peter Boyle
58a31f0763
QMR implemented, preserve even if not used much
2019-12-09 02:59:13 -05:00
Peter Boyle
3d2fe80780
Temporary size depends on checkerboard/uncheckerboard. The Mdir cares
2019-12-09 02:58:24 -05:00
Peter Boyle
0dfdf80407
Logging
2019-12-09 02:54:52 -05:00
Peter Boyle
2912071f83
Add non hermitian operator
2019-12-09 02:51:53 -05:00
Peter Boyle
26605ef387
HDCR back to working
2019-12-09 02:51:01 -05:00
ferben
f7698b93ca
corrected comments about quark line directions
2019-12-06 09:46:52 +00:00
ferben
a54157e682
more definitions changed
2019-12-05 17:08:09 +00:00
ferben
b766038810
new syntax after merge
2019-12-04 18:08:00 +00:00
ferben
cd9fd80a5d
merged in develop
2019-12-04 17:12:46 +00:00
ferben
e940f4db7e
removed unused parameter parity
2019-12-03 12:01:31 +00:00
Michael Marshall
7983ff2fdd
Merge branch 'develop' into feature/distil
...
* develop:
Change to reporting
NVCC timer support
Fix nocompilee under NVCC
--enable-summit flag
IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could be a little faster
Sliced propagator contraction was not producing any results because buf.size()=0
several typos in hadrons
2019-11-30 16:47:03 +00:00
Michael Marshall
2db814f2b7
Resolve conflicts in BaryonUtils (just use latest from develop)
2019-11-29 18:19:35 +00:00
799ff0c96e
speed-up
2019-11-26 15:28:47 +00:00
5fd5c25114
now two seperate functions for Eye and NonEye
2019-11-26 13:44:55 +00:00
Peter Boyle
d1a89af8c9
Change to reporting
2019-11-22 10:49:10 -05:00
Peter Boyle
d91ba1f6cc
NVCC timer support
2019-11-21 20:11:19 +00:00