1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-06-21 03:08:15 +01:00

Commit Graph

  • 093bb02633 Hadrons: execute message for time diluted noise portelli 2018-08-10 16:07:48 +01:00
  • 99a85116f8 Hadrons: module and VM instrumentation portelli 2018-08-10 16:07:30 +01:00
  • 27cdb79063 Sha used to seed from a unique string paboyle 2018-08-10 15:11:01 +01:00
  • f4cbfd63ff Hadrons: more meson field cleaning, needs IO now portelli 2018-08-09 18:39:58 +01:00
  • 2b794b6aa7 Hadrons: module generating random lattices for testing purposes portelli 2018-08-09 17:16:42 +01:00
  • d0244a059f Hadrons: cleaning cleaning... portelli 2018-08-09 00:38:17 +01:00
  • dcdd891d7d Hadrons: precision fix portelli 2018-08-09 00:13:53 +01:00
  • 6d2df9de79 Hadrons: even more cleaning portelli 2018-08-08 23:15:55 +01:00
  • 41d4e37bae Hadrons: more cleaning portelli 2018-08-08 19:04:44 +01:00
  • ee5c0cc9b6 Hadrons: code cleaning portelli 2018-08-08 18:45:06 +01:00
  • 0a4020eb4d Hadrons: copyright fix portelli 2018-08-07 18:42:52 +01:00
  • b2de26589b Hadrons: code cleaning and copyright update portelli 2018-08-07 18:40:48 +01:00
  • 0677adb4dd Hadrons: overhaul of A2A for production portelli 2018-08-07 18:27:59 +01:00
  • 231cc95be6 Hadrons: eigenvalues precision fix portelli 2018-08-07 18:27:19 +01:00
  • 639f9cab82 Hadrons: schedule loading fix portelli 2018-08-07 18:26:49 +01:00
  • 4eac4e575e Hadrons: meson fields indentation fix portelli 2018-08-06 12:42:25 +01:00
  • 3f0f92cda6 Hadrons: first cleaning/integration of A2A/meson fields portelli 2018-08-06 12:11:52 +01:00
  • d2650e89bd Hadrons: VM exception for object type (solves infinite loop in scheduler) portelli 2018-08-06 12:11:00 +01:00
  • 2962123cba Hadrons: diluted noise polish portelli 2018-08-05 01:44:37 +01:00
  • 830168ec37 Hadrons: first try at diluted noise class (tested) portelli 2018-08-04 12:32:58 +01:00
  • 584c921ca0 Eigen support fix (use of Grid as a library was broken) portelli 2018-08-03 21:07:58 +01:00
  • 81347b4d16 gitignore update portelli 2018-08-03 19:58:52 +01:00
  • 2cfa0b0e6b Merge pull request #174 from fionnoh/a2a_basics portelli 2018-08-03 16:32:14 +01:00
  • fa5dee76b1 Included Peter's A2AMeson field and Eigen changes fionnoh 2018-08-03 15:15:54 +01:00
  • 8d1679c6b8 Merge branch 'feature/hadrons-a2a' of https://github.com/paboyle/Grid into a2a_basics fionnoh 2018-08-03 15:12:24 +01:00
  • 3791a38f7c Optimised the MesonField a bit more Peter Boyle 2018-08-01 08:27:27 +01:00
  • 142f7b0c86 Updated the A2A Meson Field module Peter Boyle 2018-07-31 15:58:02 +01:00
  • 891ad66eab Included changes to Hadrons RBPrecCG solver needed for subtraction of guess fionnoh 2018-07-31 11:26:07 +01:00
  • 60c43151c5 Merge branch 'feature/hadrons-a2a' of https://github.com/paboyle/Grid into feature/hadrons-a2a Peter Boyle 2018-07-31 01:09:02 +01:00
  • e036800261 Eigen fix paboyle 2018-07-31 01:08:42 +01:00
  • 62900def36 Merge branch 'feature/hadrons-a2a' of https://github.com/paboyle/Grid into feature/hadrons-a2a Peter Boyle 2018-07-31 00:36:26 +01:00
  • e3a309a73f Eigen happiness paboyle 2018-07-31 00:35:17 +01:00
  • ad6c1c0c4e The basics of what is needed in Grid and Hadrons for the A2A class and module, with none of the contraction or MF code. fionnoh 2018-07-30 18:40:50 +01:00
  • 00b92a91b5 Optimising Peter Boyle 2018-07-28 23:46:22 +01:00
  • 65533741f7 7 moms paboyle 2018-07-28 16:17:47 +01:00
  • dc0259fbda Merge pull request #173 from fionnoh/feature/hadrons-a2a Peter Boyle 2018-07-27 23:03:56 +01:00
  • 131a6785d4 Merge branch 'feature/hadrons-a2a' into feature/hadrons-a2a Peter Boyle 2018-07-27 23:03:42 +01:00
  • 44f4f5c8e2 Momentum loop paboyle 2018-07-27 23:00:16 +01:00
  • 2679df034f Changes to meson field benchmark. Now includes the gammas in the final part of the naive method, both methods compute lhs^dag*Gamma*rhs (previously Gamma*lhs^dag*rhs), and checks results. fionnoh 2018-07-27 18:31:10 +01:00
  • bf71162b97 Hadrons: backtrace on abort portelli 2018-07-26 19:20:12 +01:00
  • 299e828d83 Merge branch 'develop' into feature/hadrons portelli 2018-07-26 16:49:49 +01:00
  • ef5452cddf Hadrons: smarter memory profiler portelli 2018-07-26 16:47:45 +01:00
  • 80de748737 Hadrons: new exceptions which can save a integer portelli 2018-07-26 16:47:25 +01:00
  • 71e1006ba8 Updated meson field benchmark for dirac structures paboyle 2018-07-26 09:09:29 +01:00
  • 00f31ae83f Merge pull request #163 from goracle/unstaged portelli 2018-07-25 19:00:00 +00:00
  • cce339deaf Merge pull request #172 from fionnoh/feature/hadrons portelli 2018-07-25 17:20:19 +00:00
  • 24128ff109 Changes needed for MF benchmark to work with comms correctly fionnoh 2018-07-23 15:51:37 +01:00
  • da17a015c7 Pack the stencil smaller for 128 bit access Peter Boyle 2018-07-23 06:12:45 -04:00
  • 1fd08c21ac make simd width configure time option for GPU Peter Boyle 2018-07-23 06:10:55 -04:00
  • 28db0631ff Hack to force 128bit accesses Peter Boyle 2018-07-23 06:10:27 -04:00
  • b35401b86b Fix CUDA_ARCH. Need to simplify. See when new eigen release happens Peter Boyle 2018-07-23 06:09:33 -04:00
  • a0714de8ec Define vector length for GPU Peter Boyle 2018-07-23 06:09:05 -04:00
  • 21a1710b43 Verbose vector length Peter Boyle 2018-07-23 06:08:39 -04:00
  • 34e9d3f0ca Moved the creation and resizing of the v and w high modes from the A2A class to the A2A module and made them an output of the module. This means that they have to be inputs of the contration modules and they will freed from memory if they are no longer needed. fionnoh 2018-07-22 14:40:31 +01:00
  • c995788259 Added ImportUnphysicalFermion and included appropriate logic for 5d w vectors in A2A code fionnoh 2018-07-21 00:08:11 +01:00
  • 94c7198001 Added ZFIMPL to A2AMeson contraction fionnoh 2018-07-20 23:08:22 +01:00
  • 04d86fe9f3 Removed overly verbose print statement fionnoh 2018-07-20 21:38:19 +01:00
  • b78074b6a0 Removed a Dminus from high mode v and removed duplication pf D_oo code fionnoh 2018-07-20 16:55:24 +01:00
  • 7dfd3cdae8 Inclusion of ExportPhysicalFermionSource that fixes a bug in the low mode w vectors fionnoh 2018-07-20 15:45:43 +01:00
  • cecee1ef2c Merge branch 'develop' of github.com:paboyle/Grid into feature/hadrons fionnoh 2018-07-20 13:37:50 +01:00
  • 355d4b58be Merge branch 'feature/hadrons' of github.com:fionnoh/Grid into feature/hadrons fionnoh 2018-07-19 16:07:54 +01:00
  • 2c54a536f3 Moved the meson field inner product to its own header file fionnoh 2018-07-19 15:56:52 +01:00
  • d868a45120 Cleaned up some stuff that was erroneously included in a previous "trash" commit. Leaving in the mySliceInnerProdct function for now as it speeds up mesonfield creation quite a lot for 24^3 tests fionnoh 2018-07-16 16:19:59 +01:00
  • 9deae8c962 A2A meson field contraction code fionnoh 2018-07-16 14:18:45 +01:00
  • b2b5137d28 Finally starting to get decent performance on Volta Peter Boyle 2018-07-13 12:06:18 -04:00
  • 751fae9f0d Changing boundary phase to be always double feature/BCG Chulwoo Jung 2018-07-10 12:18:12 -07:00
  • db86cdd7bd Possible trash commit fionnoh 2018-07-10 13:30:45 +01:00
  • ec9939c1ba Test for faster implementation of meson field inner loop This should be possible to cache block at outer levels, global sum across nodes not performed and deferred to caller to block them all into a big all reduce. Nc=3 and Fermion is hard coded in an ugly way. We might think about benchmarking whether a product without the conjugate should be made available by Grid. paboyle 2018-07-10 12:38:51 +01:00
  • 2cc07450f4 Fastest option for the dslash Peter Boyle 2018-07-05 09:57:55 -04:00
  • c0e8bc9da9 Current version gets 250 - 320 GF/s on Volta on the target 12^4 volume. Peter Boyle 2018-07-05 07:10:25 -04:00
  • b1265ae867 Prettify code Peter Boyle 2018-07-05 07:08:06 -04:00
  • 32bb85ea4c Standard extractLane is fast Peter Boyle 2018-07-05 07:07:30 -04:00
  • ca0607b6ef Clearer kernel call meaning Peter Boyle 2018-07-05 07:06:15 -04:00
  • 19b527e83f Better extract merge for GPU. Let the SIMD header files define the pointer type for access. GPU redirects through builtin float2, double2 for complex Peter Boyle 2018-07-05 07:05:13 -04:00
  • 4730d4692a Fast lane extract, saturates bandwidth on Volta for SU3 benchmarks Peter Boyle 2018-07-05 07:03:33 -04:00
  • 1bb456c0c5 Minor GPU vector width change Peter Boyle 2018-07-05 07:02:04 -04:00
  • 4b04ae3611 Printing improvement Peter Boyle 2018-07-05 06:59:38 -04:00
  • 2f776d51c6 Gpu specific benchmark saturates memory. Can enhance Grid to do this for expressions, but a bitof (known) work. Peter Boyle 2018-07-05 06:58:37 -04:00
  • f74617c124 Added ZFIMPL to meson field module fionnoh 2018-07-03 14:04:53 +01:00
  • 8c6a3921ed Merge remote-tracking branch 'upstream/feature/hadrons' into feature/hadrons fionnoh 2018-07-03 11:35:14 +01:00
  • a8a15dd9d0 Hadrons: code cleaning portelli 2018-07-02 17:52:39 +01:00
  • 3ce68a751a Hadrons: stout smearing module portelli 2018-07-02 17:52:04 +01:00
  • daa0977d01 Included a print statement that indicates that the guess is being subtracted from the solve. fionnoh 2018-06-28 16:34:56 +01:00
  • a2929f4384 Removed A2A contraction module and replaced it with the beginnings of a meson field module fionnoh 2018-06-28 16:17:26 +01:00
  • 7fe3974c0a Included eigenPacks and action as references, not inputs, of A2A module. They now now longer need to be parameters in the meson field modules. fionnoh 2018-06-28 16:14:49 +01:00
  • f7e86f81a0 Changes A2A class to make use of the new Solver class fionnoh 2018-06-28 16:14:16 +01:00
  • fecec803d9 Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/hadrons fionnoh 2018-06-28 16:13:43 +01:00
  • 8fe9a13cdd Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/hadrons fionnoh 2018-06-28 16:13:07 +01:00
  • 3a50afe7e7 GPU dslash updates paboyle 2018-06-27 22:32:21 +01:00
  • f8e880b445 Loop for s and xyzt offlow paboyle 2018-06-27 21:49:57 +01:00
  • 3e947527cb Move looping over "s" and "site" into kernels for GPU optimisatoin paboyle 2018-06-27 21:29:43 +01:00
  • 31f65beac8 Move site and Ls looping into the kernels paboyle 2018-06-27 21:28:48 +01:00
  • 38e2a32ac9 Single SIMD lane operations for CUDA paboyle 2018-06-27 21:28:06 +01:00
  • efa84ca50a Keep Cuda 9.1 happy paboyle 2018-06-27 21:27:32 +01:00
  • 5e96d6d04c Keep CUDA happy paboyle 2018-06-27 21:27:11 +01:00
  • df30bdc599 CUDA happy paboyle 2018-06-27 21:26:49 +01:00
  • 7f45222924 Diagnostics on memory alloc fail paboyle 2018-06-27 21:26:20 +01:00
  • dd891f5e3b Use NVCC to suppress device Eigen paboyle 2018-06-27 21:25:17 +01:00
  • d2c42e6f42 Hadrons: scaled DWF action portelli 2018-06-26 14:59:33 +01:00
  • 2881b3e8e5 WilsonMG: Remove unnecessary static assertions Daniel Richtmann 2018-06-26 14:40:59 +02:00