1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-05-07 18:54:31 +01:00
Commit Graph

532 Commits

Author SHA1 Message Date
Peter Boyle 3c3d6a94f3 OPtimising the force term a bit 2020-01-04 03:16:23 -05:00
Peter Boyle 205ea4bbb2 More verboose Lanczos 2020-01-04 03:13:40 -05:00
Peter Boyle 039eb7b2eb Make the force term and coarsening multigrid more optimised 2020-01-04 03:12:17 -05:00
Peter Boyle f7e4bd1f6d Getting more optimised 2020-01-04 03:11:53 -05:00
Peter Boyle ba40a3f763 Alternate low pass filter option 2020-01-03 05:29:09 -05:00
Peter Boyle c0d8e4dce5 Improved Multigrid for DWF 2019-12-28 10:32:15 -05:00
Peter Boyle 9cfd64c604 Coarse grid on GPU, not fast enough yet. Need a 10x 2019-12-17 05:24:45 -05:00
Peter Boyle 9aafd20468 Simple block project promote runs faster on GPU 2019-12-17 05:01:39 -05:00
Peter Boyle 9e15474999 Accelerator loop attempt at speed up 2019-12-14 05:28:16 -05:00
Peter Boyle 152b525a4d Typo fix 2019-12-13 22:44:42 -05:00
Peter Boyle d18994eddc offload more of mgrid to GPU 2019-12-13 22:08:11 -05:00
Peter Boyle 736b19485e Faster set up and some dead code ifdef'ed out 2019-12-13 21:30:48 -05:00
Peter Boyle 5bfd1470ad Merge branch 'develop' into feature/hdcr 2019-12-10 21:51:06 -05:00
Peter Boyle d73f0b8618 Verbose for temporary debug 2019-12-10 21:50:06 -05:00
Peter Boyle 0b3a3562c3 Some MPI (summit) create sigusr2, so trap that 2019-12-10 21:49:12 -05:00
Peter Boyle 710fee5d26 Subspace setup testing code
and timing verbose
2019-12-10 21:48:42 -05:00
Peter Boyle 848079e8ba Merge pull request #235 from grid-test-organisation/feature/5d-improvement
MooeeInv and M5D optimisations + enable threading with nvcc
2019-12-10 21:45:03 -05:00
Peter Boyle f2a4f13111 Must offload the Coarsened matrix if Stencil buffers are device resident 2019-12-10 19:32:12 -05:00
portelli 6446671a9c Merge pull request #241 from nils-asmussen/fix/remQCDns_ignore_ws
Undo whitespace changes in fix/removeQCDremnants to allow comparing relevant changes
2019-12-09 18:02:21 +00:00
Peter Boyle 9b6b0caa55 Junk commit fix 2019-12-09 03:01:58 -05:00
Peter Boyle 2a48617ac5 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2019-12-09 03:00:00 -05:00
Peter Boyle 58a31f0763 QMR implemented, preserve even if not used much 2019-12-09 02:59:13 -05:00
Peter Boyle 3d2fe80780 Temporary size depends on checkerboard/uncheckerboard. The Mdir cares 2019-12-09 02:58:24 -05:00
Peter Boyle 0dfdf80407 Logging 2019-12-09 02:54:52 -05:00
Peter Boyle 2912071f83 Add non hermitian operator 2019-12-09 02:51:53 -05:00
Peter Boyle 26605ef387 HDCR back to working 2019-12-09 02:51:01 -05:00
ferben f7698b93ca corrected comments about quark line directions 2019-12-06 09:46:52 +00:00
ferben a54157e682 more definitions changed 2019-12-05 17:08:09 +00:00
ferben b766038810 new syntax after merge 2019-12-04 18:08:00 +00:00
ferben cd9fd80a5d merged in develop 2019-12-04 17:12:46 +00:00
ferben e940f4db7e removed unused parameter parity 2019-12-03 12:01:31 +00:00
Michael Marshall 7983ff2fdd Merge branch 'develop' into feature/distil
* develop:
  Change to reporting
  NVCC timer support
  Fix nocompilee under NVCC
  --enable-summit flag
  IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could be a little faster
  Sliced propagator contraction was not producing any results because buf.size()=0
  several typos in hadrons
2019-11-30 16:47:03 +00:00
Michael Marshall 2db814f2b7 Resolve conflicts in BaryonUtils (just use latest from develop) 2019-11-29 18:19:35 +00:00
ferben 799ff0c96e speed-up 2019-11-26 15:28:47 +00:00
ferben 5fd5c25114 now two seperate functions for Eye and NonEye 2019-11-26 13:44:55 +00:00
Peter Boyle d1a89af8c9 Change to reporting 2019-11-22 10:49:10 -05:00
Peter Boyle d91ba1f6cc NVCC timer support 2019-11-21 20:11:19 +00:00
Peter Boyle f4d27e7090 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2019-11-21 20:09:31 +00:00
Peter Boyle feb1ff3494 Fix nocompilee under NVCC 2019-11-21 20:03:39 +00:00
Peter Boyle 98ea67b636 IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could
be a little faster
2019-11-21 15:00:46 -05:00
ferben 421a4395af Sigma to Nucleon contractions 2019-11-21 17:25:37 +00:00
Michael Marshall 22c654182a Fixes for GPU compile 2019-11-04 17:24:34 +00:00
Michael Marshall efe2f2d48b Merge branch 'develop' into feature/distil
* develop:
  Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun
  Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions Fixed Lattice::reset not compiling with new Grid explicit memory region handling Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-11-01 15:38:48 +00:00
Peter Boyle ac614cbc53 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2019-10-31 11:46:43 -04:00
Peter Boyle ec8e060ec7 Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun 2019-10-31 11:46:09 -04:00
Michael Marshall 3b3680c64e Reversed Felix's interim A2Autils.h changes ... these were finished and went into develop via a separate branch 2019-10-30 15:50:04 +00:00
Michael Marshall 2a926b3dc6 Merged latest changes from develop, in preparation for release. 2019-10-30 14:52:34 +00:00
Chris K 845a045493 Merge pull request #233 from giltirn/lanczos_fix
A few run /compile / memory leak fixes
2019-10-30 10:21:59 -04:00
Michael Marshall eb8848a071 Merge branch 'develop' into feature/distil
* develop: (27 commits)
  Update README.md
  result layout standardised, iterator size more elegant
  updated syntac in Test_hadrons_spectrum
  chroma-regression test now prints difference correctly
  baryon input strings are now pairs of pairs of gammas - still ugly!!
  second update to pull request
  Changing back interface for Gamma3pt
  Removing old debug code
  Changes to A2Autils
  suggested changes for 1st pull request implemented
  changed input parameters for easier use
  Should compile everywhere now
  changed baryon interface
  added author information
  ready for pull request
  code compiling now - still need to test
  Baryons module works in 1 of 3 cases - still need SlicedProp and Msource part!!
  thread_for caused the problems - slow for loop for now
  still bugfix
  weird bug...
  ...

# Conflicts:
#	Hadrons/Modules.hpp
#	Hadrons/modules.inc
2019-10-30 14:13:00 +00:00
portelli c97f780784 Merge pull request #243 from fionnoh/feature/A2A_current_insertion
Feature/a2 a current insertion
2019-10-22 13:55:53 +01:00