Peter Boyle
3c3d6a94f3
OPtimising the force term a bit
2020-01-04 03:16:23 -05:00
Peter Boyle
205ea4bbb2
More verboose Lanczos
2020-01-04 03:13:40 -05:00
Peter Boyle
039eb7b2eb
Make the force term and coarsening multigrid more optimised
2020-01-04 03:12:17 -05:00
Peter Boyle
f7e4bd1f6d
Getting more optimised
2020-01-04 03:11:53 -05:00
Peter Boyle
ba40a3f763
Alternate low pass filter option
2020-01-03 05:29:09 -05:00
Peter Boyle
c0d8e4dce5
Improved Multigrid for DWF
2019-12-28 10:32:15 -05:00
Peter Boyle
9cfd64c604
Coarse grid on GPU, not fast enough yet. Need a 10x
2019-12-17 05:24:45 -05:00
Peter Boyle
9aafd20468
Simple block project promote runs faster on GPU
2019-12-17 05:01:39 -05:00
Peter Boyle
9e15474999
Accelerator loop attempt at speed up
2019-12-14 05:28:16 -05:00
Peter Boyle
152b525a4d
Typo fix
2019-12-13 22:44:42 -05:00
Peter Boyle
d18994eddc
offload more of mgrid to GPU
2019-12-13 22:08:11 -05:00
Peter Boyle
736b19485e
Faster set up and some dead code ifdef'ed out
2019-12-13 21:30:48 -05:00
Peter Boyle
5bfd1470ad
Merge branch 'develop' into feature/hdcr
2019-12-10 21:51:06 -05:00
Peter Boyle
d73f0b8618
Verbose for temporary debug
2019-12-10 21:50:06 -05:00
Peter Boyle
0b3a3562c3
Some MPI (summit) create sigusr2, so trap that
2019-12-10 21:49:12 -05:00
Peter Boyle
710fee5d26
Subspace setup testing code
...
and timing verbose
2019-12-10 21:48:42 -05:00
Peter Boyle
848079e8ba
Merge pull request #235 from grid-test-organisation/feature/5d-improvement
...
MooeeInv and M5D optimisations + enable threading with nvcc
2019-12-10 21:45:03 -05:00
Peter Boyle
f2a4f13111
Must offload the Coarsened matrix if Stencil buffers are device resident
2019-12-10 19:32:12 -05:00
6446671a9c
Merge pull request #241 from nils-asmussen/fix/remQCDns_ignore_ws
...
Undo whitespace changes in fix/removeQCDremnants to allow comparing relevant changes
2019-12-09 18:02:21 +00:00
Peter Boyle
9b6b0caa55
Junk commit fix
2019-12-09 03:01:58 -05:00
Peter Boyle
2a48617ac5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-12-09 03:00:00 -05:00
Peter Boyle
58a31f0763
QMR implemented, preserve even if not used much
2019-12-09 02:59:13 -05:00
Peter Boyle
3d2fe80780
Temporary size depends on checkerboard/uncheckerboard. The Mdir cares
2019-12-09 02:58:24 -05:00
Peter Boyle
0dfdf80407
Logging
2019-12-09 02:54:52 -05:00
Peter Boyle
2912071f83
Add non hermitian operator
2019-12-09 02:51:53 -05:00
Peter Boyle
26605ef387
HDCR back to working
2019-12-09 02:51:01 -05:00
ferben
f7698b93ca
corrected comments about quark line directions
2019-12-06 09:46:52 +00:00
ferben
a54157e682
more definitions changed
2019-12-05 17:08:09 +00:00
ferben
b766038810
new syntax after merge
2019-12-04 18:08:00 +00:00
ferben
cd9fd80a5d
merged in develop
2019-12-04 17:12:46 +00:00
ferben
e940f4db7e
removed unused parameter parity
2019-12-03 12:01:31 +00:00
Michael Marshall
7983ff2fdd
Merge branch 'develop' into feature/distil
...
* develop:
Change to reporting
NVCC timer support
Fix nocompilee under NVCC
--enable-summit flag
IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could be a little faster
Sliced propagator contraction was not producing any results because buf.size()=0
several typos in hadrons
2019-11-30 16:47:03 +00:00
Michael Marshall
2db814f2b7
Resolve conflicts in BaryonUtils (just use latest from develop)
2019-11-29 18:19:35 +00:00
799ff0c96e
speed-up
2019-11-26 15:28:47 +00:00
5fd5c25114
now two seperate functions for Eye and NonEye
2019-11-26 13:44:55 +00:00
Peter Boyle
d1a89af8c9
Change to reporting
2019-11-22 10:49:10 -05:00
Peter Boyle
d91ba1f6cc
NVCC timer support
2019-11-21 20:11:19 +00:00
Peter Boyle
f4d27e7090
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-11-21 20:09:31 +00:00
Peter Boyle
feb1ff3494
Fix nocompilee under NVCC
2019-11-21 20:03:39 +00:00
Peter Boyle
98ea67b636
IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could
...
be a little faster
2019-11-21 15:00:46 -05:00
ferben
421a4395af
Sigma to Nucleon contractions
2019-11-21 17:25:37 +00:00
Michael Marshall
22c654182a
Fixes for GPU compile
2019-11-04 17:24:34 +00:00
Michael Marshall
efe2f2d48b
Merge branch 'develop' into feature/distil
...
* develop:
Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun
Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions Fixed Lattice::reset not compiling with new Grid explicit memory region handling Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-11-01 15:38:48 +00:00
Peter Boyle
ac614cbc53
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-10-31 11:46:43 -04:00
Peter Boyle
ec8e060ec7
Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun
2019-10-31 11:46:09 -04:00
Michael Marshall
3b3680c64e
Reversed Felix's interim A2Autils.h changes ... these were finished and went into develop via a separate branch
2019-10-30 15:50:04 +00:00
Michael Marshall
2a926b3dc6
Merged latest changes from develop, in preparation for release.
2019-10-30 14:52:34 +00:00
Chris K
845a045493
Merge pull request #233 from giltirn/lanczos_fix
...
A few run /compile / memory leak fixes
2019-10-30 10:21:59 -04:00
Michael Marshall
eb8848a071
Merge branch 'develop' into feature/distil
...
* develop: (27 commits)
Update README.md
result layout standardised, iterator size more elegant
updated syntac in Test_hadrons_spectrum
chroma-regression test now prints difference correctly
baryon input strings are now pairs of pairs of gammas - still ugly!!
second update to pull request
Changing back interface for Gamma3pt
Removing old debug code
Changes to A2Autils
suggested changes for 1st pull request implemented
changed input parameters for easier use
Should compile everywhere now
changed baryon interface
added author information
ready for pull request
code compiling now - still need to test
Baryons module works in 1 of 3 cases - still need SlicedProp and Msource part!!
thread_for caused the problems - slow for loop for now
still bugfix
weird bug...
...
# Conflicts:
# Hadrons/Modules.hpp
# Hadrons/modules.inc
2019-10-30 14:13:00 +00:00
c97f780784
Merge pull request #243 from fionnoh/feature/A2A_current_insertion
...
Feature/a2 a current insertion
2019-10-22 13:55:53 +01:00