1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-25 13:06:00 +01:00

Commit Graph

  • 48a340a9d1 GEN seems to defined by default -> some fixes applied nmeyer-ur 2020-05-08 10:47:49 +02:00
  • f45621109b placed typedefs in Optimization nmeyer-ur 2020-05-08 10:41:52 +02:00
  • 32d1a0bbea added even more debug output nmeyer-ur 2020-05-08 10:39:26 +02:00
  • 267cce66a1 added more debug output nmeyer-ur 2020-05-08 10:29:28 +02:00
  • 3417147b11 added real fma, corrected typos in tbls; integrated, must supply A64FXGCC with GEN in configure nmeyer-ur 2020-05-08 10:20:19 +02:00
  • b338719bc8 first transition to fixed-size done, excl. Exch; next step: integration nmeyer-ur 2020-05-07 22:33:28 +02:00
  • 2b81cbe2c2 first attempt to introduce tables using fixed-size; still incomplete nmeyer-ur 2020-05-07 22:01:19 +02:00
  • acff9d6ed2 transition to fixed size data types almost done; still incomplete nmeyer-ur 2020-05-07 21:24:07 +02:00
  • 053b4dd495 Merge pull request #282 from felixerben/baryon-reversal portelli 2020-05-07 18:09:17 +01:00
  • a306a49788 first mods for fixed size; still incomplete nmeyer-ur 2020-05-07 19:07:49 +02:00
  • 42bb5f0721 asserrtion ferben 2020-05-07 18:06:12 +01:00
  • 253bcc3426 back to old version ferben 2020-05-07 18:03:17 +01:00
  • a887206413 Merge pull request #281 from felixerben/feature/baryonSpeedup portelli 2020-05-07 13:41:29 +01:00
  • 591ebb6213 Merge branch 'develop' of github.com:paboyle/Grid into feature/baryonSpeedup ferben 2020-05-07 11:13:21 +01:00
  • 56e2f7d088 deleted test routines. cleaned up fast version. assert Ns=4,Nc=3. ferben 2020-05-07 10:03:45 +01:00
  • 7ef03c5368 updated SVE readme nmeyer-ur 2020-05-06 16:30:37 +02:00
  • 525418abfb Merge pull request #273 from lehner/feature/gpt Peter Boyle 2020-05-06 10:10:51 -04:00
  • 5f780806c2 Merge pull request #279 from paboyle/bugfix/nvcc-config Peter Boyle 2020-05-06 10:07:52 -04:00
  • 3c6ffcb48c Merge branch 'develop' into feature/gpt Christoph Lehner 2020-05-06 15:03:35 +02:00
  • 87984ece7d add Lattice_basis.h Christoph Lehner 2020-05-06 08:47:18 -04:00
  • e9b295f967 Synchronize blocking infrastructure with GPT Christoph Lehner 2020-05-06 08:42:28 -04:00
  • 224cbf0453 Merge pull request #280 from mmphys/bugfix/ET_go_home Peter Boyle 2020-05-05 17:56:51 -04:00
  • c1e57d4357 Merge branch 'develop' into bugfix/ET_go_home Michael Marshall 2020-05-05 22:35:04 +01:00
  • 28a1fcaaff First compile against SYCL Peter Boyle 2020-05-05 11:13:27 -07:00
  • 6b64727161 disable comments Christoph Lehner 2020-05-05 05:05:36 -04:00
  • 04863f8f38 debug new AcceleratorView Christoph Lehner 2020-05-04 16:07:03 -04:00
  • 04927d2e40 SYCL prep - no sycl just make it compile through DPC++ u37294 2020-05-04 10:28:29 -07:00
  • 7caed4edd9 dpc++ didn't like rdtsc() u37294 2020-05-04 10:27:05 -07:00
  • 59c51d2c35 Make compile if HAVE_LIME=0 u37294 2020-05-04 10:26:20 -07:00
  • ff53b231c8 Merge branch 'develop' of https://github.com/paboyle/Grid into develop u37294 2020-05-04 10:25:10 -07:00
  • fc19cf905b Lime optional u37294 2020-05-04 10:24:48 -07:00
  • 2a1387e992 rankInnerProduct Christoph Lehner 2020-05-03 17:27:11 -04:00
  • 9bfa51bffb cleanup comment Christoph Lehner 2020-05-03 09:12:52 -04:00
  • 38532753f4 interface cleanup Christoph Lehner 2020-05-03 08:58:32 -04:00
  • 949be9605c fix pragmas Christoph Lehner 2020-05-02 16:20:03 -04:00
  • 63cf201ee7 Add AdviseInfrequentUse Christoph Lehner 2020-05-02 11:38:42 -04:00
  • c8af498a2a BinaryIO fix for alternative little-endian format name (used in 96I ensemble) Christoph Lehner 2020-05-01 03:45:50 -04:00
  • ddb192bac7 re-work double precision promotion for summit Christoph Lehner 2020-04-30 16:09:57 -04:00
  • 7666300a6f Merge branch 'develop' into bugfix/ET_go_home Michael Marshall 2020-04-30 20:10:32 +01:00
  • 4a4b9e305d Fix: strToVec enters infinite loop and exhausts memory if operator>> fails before the end of string, e.g. if parsing "0_0_0" for momentum instead of "0 0 0". Michael Marshall 2020-04-30 19:40:04 +01:00
  • 9b2d2d0fc3 Basis rotate stack passig to GPU reduction Peter Boyle 2020-04-30 12:31:07 -04:00
  • 5011753f4f Clean up warning Peter Boyle 2020-04-30 10:23:48 -04:00
  • dbaeefaeef All Eigen::TensorMap objects are fixed (i.e. cannot be dynamically resized) Michael Marshall 2020-04-30 15:02:51 +01:00
  • dee96cbf82 Added workaround in configure to still catch Cuda compiler when nvcc with extra arguments (eg -ccbin) is used as CXX Christopher Kelly 2020-04-29 10:37:11 -04:00
  • dd3ebc2ce4 Slow compile on NVCC switch off conserved current Peter Boyle 2020-04-29 08:43:12 -04:00
  • 103e7ae2f0 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2020-04-29 03:05:36 -04:00
  • 29ae5615c0 Seqeuential fix Peter Boyle 2020-04-29 03:05:15 -04:00
  • 6240e02619 added assertion to avoid potential infinite loop ferben 2020-04-27 18:50:53 +01:00
  • f4033ad8cb baryon speedup by a factor 2 ferben 2020-04-27 17:46:14 +01:00
  • 5abec5b8a9 SVE_readme update, update Grid_vector_types.h nmeyer-ur 2020-04-25 13:48:26 +02:00
  • 499edc0636 updated SVE_README.txt; defined ARMCLANGCOMPAT macro nmeyer-ur 2020-04-25 13:41:24 +02:00
  • d990e61be3 armclang 20.1 settings in SVE readme nmeyer-ur 2020-04-25 12:11:43 +02:00
  • 3edb2dc2da removed -static from gcc CXXFLAGS nmeyer-ur 2020-04-24 13:04:34 +02:00
  • f1fe444d4f blocked precision promotion infrastructure upgrade Christoph Lehner 2020-04-24 06:27:20 -04:00
  • 345721220e resolved merge conflict nils meyer 2020-04-24 10:14:21 +02:00
  • 6db68d6ecb added SVE configure for armclang and gcc nils meyer 2020-04-24 10:10:47 +02:00
  • dae820aa96 Merge pull request #277 from mmphys/bugfix/grid-config Peter Boyle 2020-04-23 10:26:54 -04:00
  • 5daf176f4a Updated to expose GRID_CXXLD in addition to CXXLD. NB: CXXLD required as this is what drives linking behaviour. Michael Marshall 2020-04-23 15:25:53 +01:00
  • e96c86ec14 Make grid-config message more specific for --cxx and --cxxld Michael Marshall 2020-04-23 13:10:45 +01:00
  • 09f0963d1f changes in configure.ac ; to be verified nmeyer-ur 2020-04-23 11:27:03 +02:00
  • 6f44e3c192 reverted changes in configure.ac ; included SVE configure readme nils meyer 2020-04-23 11:18:50 +02:00
  • c2c3cad20d Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2020-04-23 04:35:42 -04:00
  • edec9ee2e2 Conserved current rewrite done. Zmobius working Peter Boyle 2020-04-23 04:34:01 -04:00
  • ed70cce542 Test for 5D DWF obserevables Peter Boyle 2020-04-23 04:29:45 -04:00
  • 4701201b5f grid-config: Expose CXXLD (for GPU build) and update help Michael Marshall 2020-04-22 18:42:30 +01:00
  • 5893888f87 removed default no-strict-aliasing for gcc-10.0.1 exclusively nils meyer 2020-04-22 19:29:55 +02:00
  • 39b448affb Merge remote-tracking branch 'origin/develop' into feature/a64fx-2 nmeyer-ur 2020-04-22 17:34:12 +02:00
  • e54a8f05a9 Exchange1 with generic version for now, should use svtbl2 in final version nils meyer 2020-04-20 22:45:27 +02:00
  • 0782b76ed4 Merge pull request #274 from paboyle/feature/zmobius_paramcompute Peter Boyle 2020-04-20 14:39:29 -04:00
  • 0896f2cead Added missing include guards in bigfloat_double.h Christopher Kelly 2020-04-20 10:30:38 -04:00
  • 181709bba4 Merge branch 'develop' into feature/zmobius_paramcompute Christopher Kelly 2020-04-20 09:12:34 -04:00
  • 64b72fc17f testing gcc 10.0.1: build errors in Exchange1 using -DA64FX and in Lattice_base.h building Dslash only nils meyer 2020-04-19 01:25:40 +02:00
  • 091d5c605e towards more precise blocking Christoph Lehner 2020-04-17 04:25:28 -04:00
  • 6fdce60492 revised BodyA64FX; 990 GiB/s Wilson, 687 GiB/s DW using intrinsics (armclang 20.0) nils meyer 2020-04-16 22:43:32 +02:00
  • 90229cfb0f Merge pull request #270 from milc-qcd/feature/CGinfo Peter Boyle 2020-04-16 11:46:08 -04:00
  • 0475c46ecb Merge pull request #256 from djm2131/feature/BiCGSTAB Peter Boyle 2020-04-16 11:45:15 -04:00
  • 3cca10e617 Merge pull request #276 from nils-asmussen/fix/regression_nt Peter Boyle 2020-04-16 11:42:39 -04:00
  • 327da332bb Merge branch 'develop' of https://github.com/paboyle/Grid into feature/gpt Christoph Lehner 2020-04-16 11:30:17 -04:00
  • 852db4626a re-introduced HOTFIX cause Grid binaries give wrong results otherwise; checked in good gridverter.py nils meyer 2020-04-15 18:22:19 +02:00
  • 43dc2814dd fix regression in core/Test_qed.cc asmussen 2020-04-15 16:10:15 +01:00
  • 6504a098cc 999 GiB/s Wilson; 694 GiB/s DW (DP) nils meyer 2020-04-15 15:06:52 +02:00
  • 79a385faca disabled armclang hotfix cause armclang 20.0 performance gets a little nils meyer 2020-04-15 11:46:55 +02:00
  • c12a67030a 980 GiB/s Wilson; 680 GiB/s DW (DP) nils meyer 2020-04-15 10:55:06 +02:00
  • 581392f2f2 now with pf, best results so far using intrinsics+pf nils meyer 2020-04-12 22:06:14 +02:00
  • 113f277b6a enable dslash asm using -DA64FXASM, additionaly -DDSLASHINTRIN for intrinsics impl nils meyer 2020-04-11 04:55:01 +02:00
  • f3a8d039a2 Merge branch 'feature/hdcr' into develop Peter Boyle 2020-04-10 22:01:52 -04:00
  • 974586bedc Dslash finally works; cleaned up; uses MOVPRFX in assembly nils meyer 2020-04-10 22:26:40 +02:00
  • 4e864e56c9 develop pull portelli 2020-04-10 17:19:18 +01:00
  • 014dbfa464 Compile fix with OpDirAll Peter Boyle 2020-04-10 11:57:09 -04:00
  • 3b0e07882f Adding another form of polynomial Peter Boyle 2020-04-10 11:28:33 -04:00
  • 8e81a811d0 Merge branch 'feature/hdcr' into develop Peter Boyle 2020-04-10 11:14:49 -04:00
  • aa13118127 Missing conjugate already fixed in develop feature/hdcr Peter Boyle 2020-04-10 11:11:24 -04:00
  • 6cdb09c884 Faster copy region Peter Boyle 2020-04-10 11:10:52 -04:00
  • a65bc64f10 Accelerator peek poke Peter Boyle 2020-04-10 11:09:59 -04:00
  • 11dec4883c Don't throw assert Peter Boyle 2020-04-10 11:09:11 -04:00
  • afa458c812 Extra solvers Peter Boyle 2020-04-10 11:08:19 -04:00
  • dc50190b8f Faster GPU basis rotation May need to later include Regensburg optimised CPU variant Peter Boyle 2020-04-10 11:06:04 -04:00
  • 160f78c1e4 changed debug output to variable direct 3 nmeyer-ur 2020-04-10 12:23:07 +02:00
  • 7e4e1bbbc2 changed debug output to variable direct 2 nmeyer-ur 2020-04-10 12:22:04 +02:00
  • e699b7e9f9 changed debug output to variable direct nmeyer-ur 2020-04-10 12:18:30 +02:00