1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-05-17 23:54:31 +01:00
Commit Graph

526 Commits

Author SHA1 Message Date
nmeyer-ur 77fa586f6c introduced A64FX Wilson kernels 2020-04-09 13:30:06 +02:00
nmeyer-ur 15238e8d5e reduce acle works, clean up 2020-04-03 20:40:44 +02:00
nmeyer-ur b27e31957a reduce acle revised 2020-04-03 19:46:15 +02:00
nmeyer-ur 46927771e3 reduce acle still needs overhaul 2020-04-03 19:30:48 +02:00
nmeyer-ur d8cea77707 define simd width in header 2020-04-03 19:22:25 +02:00
nmeyer-ur 5f8a76d490 clean up, reduction in acle 2020-04-03 19:18:24 +02:00
nmeyer-ur 28d49a3b60 build problem resolved 2020-04-03 16:52:48 +02:00
nmeyer-ur b4c624ece6 added A64FX support 2020-04-03 15:43:23 +02:00
Michael Marshall c69a3b6ef6 When saving eigenvectors, LapEvec now saves eigenvalues for every timeslice as well.
I.e. nT x nVec eigenvalues are saved in FileName.evals.conf.h5.
A new named tensor, "TimesliceEvals" can be used to simplify restoring these from disk.
NB: The changes in BaseIO add support so that Eigen tensors can be easily used in MPI operations, e.g. GlobalSum.
See LapEvec.hpp for an example of how this is done.
2020-01-29 21:20:20 +00:00
Michael Marshall 0ca1992151 Remove warning in tensor layout comparison. Make default names and index names visible for PerambTensor and NoiseTensor 2019-12-20 13:53:27 +00:00
gfilaci f7373e97a4 Missing conjugate in MooeeInvDag 2019-12-16 10:05:50 +01:00
Peter Boyle 848079e8ba Merge pull request #235 from grid-test-organisation/feature/5d-improvement
MooeeInv and M5D optimisations + enable threading with nvcc
2019-12-10 21:45:03 -05:00
portelli 6446671a9c Merge pull request #241 from nils-asmussen/fix/remQCDns_ignore_ws
Undo whitespace changes in fix/removeQCDremnants to allow comparing relevant changes
2019-12-09 18:02:21 +00:00
Peter Boyle 9b6b0caa55 Junk commit fix 2019-12-09 03:01:58 -05:00
Peter Boyle 2a48617ac5 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2019-12-09 03:00:00 -05:00
Peter Boyle 58a31f0763 QMR implemented, preserve even if not used much 2019-12-09 02:59:13 -05:00
Peter Boyle 3d2fe80780 Temporary size depends on checkerboard/uncheckerboard. The Mdir cares 2019-12-09 02:58:24 -05:00
Peter Boyle 0dfdf80407 Logging 2019-12-09 02:54:52 -05:00
Peter Boyle 2912071f83 Add non hermitian operator 2019-12-09 02:51:53 -05:00
Peter Boyle 26605ef387 HDCR back to working 2019-12-09 02:51:01 -05:00
ferben f7698b93ca corrected comments about quark line directions 2019-12-06 09:46:52 +00:00
ferben a54157e682 more definitions changed 2019-12-05 17:08:09 +00:00
ferben b766038810 new syntax after merge 2019-12-04 18:08:00 +00:00
ferben cd9fd80a5d merged in develop 2019-12-04 17:12:46 +00:00
ferben e940f4db7e removed unused parameter parity 2019-12-03 12:01:31 +00:00
Michael Marshall 7983ff2fdd Merge branch 'develop' into feature/distil
* develop:
  Change to reporting
  NVCC timer support
  Fix nocompilee under NVCC
  --enable-summit flag
  IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could be a little faster
  Sliced propagator contraction was not producing any results because buf.size()=0
  several typos in hadrons
2019-11-30 16:47:03 +00:00
Michael Marshall 2db814f2b7 Resolve conflicts in BaryonUtils (just use latest from develop) 2019-11-29 18:19:35 +00:00
ferben 799ff0c96e speed-up 2019-11-26 15:28:47 +00:00
ferben 5fd5c25114 now two seperate functions for Eye and NonEye 2019-11-26 13:44:55 +00:00
Peter Boyle d1a89af8c9 Change to reporting 2019-11-22 10:49:10 -05:00
Peter Boyle d91ba1f6cc NVCC timer support 2019-11-21 20:11:19 +00:00
Peter Boyle f4d27e7090 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2019-11-21 20:09:31 +00:00
Peter Boyle feb1ff3494 Fix nocompilee under NVCC 2019-11-21 20:03:39 +00:00
Peter Boyle 98ea67b636 IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could
be a little faster
2019-11-21 15:00:46 -05:00
ferben 421a4395af Sigma to Nucleon contractions 2019-11-21 17:25:37 +00:00
Michael Marshall 22c654182a Fixes for GPU compile 2019-11-04 17:24:34 +00:00
Michael Marshall efe2f2d48b Merge branch 'develop' into feature/distil
* develop:
  Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun
  Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions Fixed Lattice::reset not compiling with new Grid explicit memory region handling Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-11-01 15:38:48 +00:00
Peter Boyle ac614cbc53 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2019-10-31 11:46:43 -04:00
Peter Boyle ec8e060ec7 Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun 2019-10-31 11:46:09 -04:00
Michael Marshall 3b3680c64e Reversed Felix's interim A2Autils.h changes ... these were finished and went into develop via a separate branch 2019-10-30 15:50:04 +00:00
Michael Marshall 2a926b3dc6 Merged latest changes from develop, in preparation for release. 2019-10-30 14:52:34 +00:00
Chris K 845a045493 Merge pull request #233 from giltirn/lanczos_fix
A few run /compile / memory leak fixes
2019-10-30 10:21:59 -04:00
Michael Marshall eb8848a071 Merge branch 'develop' into feature/distil
* develop: (27 commits)
  Update README.md
  result layout standardised, iterator size more elegant
  updated syntac in Test_hadrons_spectrum
  chroma-regression test now prints difference correctly
  baryon input strings are now pairs of pairs of gammas - still ugly!!
  second update to pull request
  Changing back interface for Gamma3pt
  Removing old debug code
  Changes to A2Autils
  suggested changes for 1st pull request implemented
  changed input parameters for easier use
  Should compile everywhere now
  changed baryon interface
  added author information
  ready for pull request
  code compiling now - still need to test
  Baryons module works in 1 of 3 cases - still need SlicedProp and Msource part!!
  thread_for caused the problems - slow for loop for now
  still bugfix
  weird bug...
  ...

# Conflicts:
#	Hadrons/Modules.hpp
#	Hadrons/modules.inc
2019-10-30 14:13:00 +00:00
portelli c97f780784 Merge pull request #243 from fionnoh/feature/A2A_current_insertion
Feature/a2 a current insertion
2019-10-22 13:55:53 +01:00
portelli 202f025fc7 Merge pull request #242 from mmphys/feature/baryons
Feature/baryons
2019-10-16 15:06:32 +01:00
Michael Marshall 519ce19128 Fixes to enable GPU build. NB: Contractor and ContractorBenchmark still not working 2019-10-14 22:40:13 +01:00
Felix Erben 548b3bf43c second update to pull request 2019-10-09 14:52:33 +01:00
Fionn O hOgain 5de9547db5 Removing old debug code 2019-10-08 15:51:28 +01:00
Fionn O hOgain 6a3b09cf02 Merge branch 'develop' of github.com:fionnoh/Grid into feature/A2A_current_insertion 2019-10-08 13:25:51 +01:00
Fionn O hOgain 10de4bfc23 Changes to A2Autils 2019-10-08 13:24:56 +01:00