1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-23 18:22:02 +01:00

Commit Graph

  • d24d8e8398 Use X-direction as more bits meaningful on CUDA. 2^31-1 shoulddd always bee enough for SIMD and thread reduced local volume Peter Boyle 2020-05-12 10:35:49 -04:00
  • 162e4bb567 no automatic prefetching for now Christoph Lehner 2020-05-12 07:01:23 -04:00
  • 07c0c02f8c Speed up Cshift Peter Boyle 2020-05-11 17:02:01 -04:00
  • 8c31c065b5 Keep the Vector fixed to protect it from realloc Peter Boyle 2020-05-11 17:00:30 -04:00
  • b7c76ede29 Removed some assertions in Test_simd and removed exit() in Reduce nmeyer-ur 2020-05-11 22:43:00 +02:00
  • 05edf803bd corrected typo nmeyer-ur 2020-05-12 03:59:59 +09:00
  • b1c86900b2 Merge pull request #4 from paboyle/develop Christoph Lehner 2020-05-11 20:59:29 +02:00
  • 78b8e40f83 switched to gcc's internal data types nmeyer-ur 2020-05-11 18:11:23 +02:00
  • fc2e9850d3 temporarily enable TOFU by default when using A64FX or A64FXFIXEDSIZE nmeyer-ur 2020-05-11 13:25:02 +02:00
  • ffaaed679e MPI_THREAD_SINGLE hack for Fugaku, enabled by -DTOFU nmeyer-ur 2020-05-11 13:21:39 +02:00
  • bbbee5660d First compiile on HiP Peter Boyle 2020-05-10 05:28:09 -04:00
  • ea08f193e7 Allocator cache spliit into large/small pools Peter Boyle 2020-05-10 05:24:26 -04:00
  • 2bb2c68e15 Separate pools for small and large allocations cache Peter Boyle 2020-05-09 22:57:21 -04:00
  • efe5bc6a3c Split allocator cache into two pools of different sizes Peter Boyle 2020-05-09 22:27:56 -04:00
  • b2fd8b993a fixed-size clean up nmeyer-ur 2020-05-09 22:53:42 +02:00
  • 291ee8c3d0 updated fixed-size implementation; only Exch1 and prefetches missing nmeyer-ur 2020-05-09 22:18:02 +02:00
  • e1a5b3ea49 unions for tables eliminate explicit loads, gcc does not complain nmeyer-ur 2020-05-09 21:21:57 +02:00
  • 55a55660cb reverted changes nmeyer-ur 2020-05-09 12:48:42 +02:00
  • 384da487bd Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2020-05-08 18:55:11 -04:00
  • ee1de82a53 Working ITT benchmark again Peter Boyle 2020-05-08 18:54:50 -04:00
  • 2b576fc185 Comment deadd codde remove Peter Boyle 2020-05-08 18:54:29 -04:00
  • 52081acfa5 NVCC compile fixes Peter Boyle 2020-05-08 13:14:12 -04:00
  • b01b7f761a Merge pull request #283 from DanielRichtmann/feature/minor-fixes Peter Boyle 2020-05-08 10:52:03 -04:00
  • c83471bfd0 Fix missing checkerboards for adj und conjugate Daniel Richtmann 2020-04-23 10:54:19 +02:00
  • ab0c5d77fb Correct NonHermitianSchurOperatorBase Daniel Richtmann 2020-04-22 19:50:30 +02:00
  • 779e3c7442 Const-correctness for retrieval routines of GridStopWatch Daniel Richtmann 2020-04-21 13:30:08 +02:00
  • 0c570824f2 Add missing declaration of GridCmdOptionInt Daniel Richtmann 2020-04-21 13:26:43 +02:00
  • f8b8e00090 Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc Aim to reduce the amount of cuda and other code variations floating around all over the place. Peter Boyle 2020-05-08 06:23:55 -07:00
  • 0dd1bdfa94 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2020-05-08 09:21:43 -04:00
  • 1d65e2f62c Slightly faster Chebyshev; ifdef'ed out the fastest until tested numerics Lifteed from HDCR setup Peter Boyle 2020-05-08 09:20:54 -04:00
  • 93920c4811 Remove verbose Peter Boyle 2020-05-08 09:19:54 -04:00
  • 6859a3e1d4 Schur operator Peter Boyle 2020-05-08 09:19:12 -04:00
  • 21ca182c36 Comments remove Peter Boyle 2020-05-08 09:18:24 -04:00
  • ceb8b374da API change v3 nmeyer-ur 2020-05-08 15:04:44 +02:00
  • 4bc2ad2894 API change v2 nmeyer-ur 2020-05-08 15:00:25 +02:00
  • 798af3e68f retry changing StoD API nmeyer-ur 2020-05-08 14:34:59 +02:00
  • b0ef2367f3 testing alternate call to PrecisionChange nmeyer-ur 2020-05-08 14:22:44 +02:00
  • 71a7350a85 changed 2nd argument in Reduce to native vector type nmeyer-ur 2020-05-08 12:26:51 +02:00
  • 6f79369955 trying to get rid of macro definition error nmeyer-ur 2020-05-08 12:19:24 +02:00
  • f9cb6b979f corrected more typos nmeyer-ur 2020-05-08 12:11:01 +02:00
  • ed4d9d17f8 corrected type nmeyer-ur 2020-05-08 12:09:22 +02:00
  • fbed02690d some changes in breaking out A64FX: use -DA64FXFIXEDSIZE for fixed size, but also define GEN nmeyer-ur 2020-05-08 12:05:31 +02:00
  • 39f3ae5b1d corrected more types nmeyer-ur 2020-05-08 11:07:14 +02:00
  • e64bec8c8e pulled SVE typedefs out of Optimization nmeyer-ur 2020-05-08 11:04:21 +02:00
  • 0893b4e552 fixed typos in PrecisionChange nmeyer-ur 2020-05-08 10:59:07 +02:00
  • 92f0f29670 fixed double overloading vecf in Div, corrected typos nmeyer-ur 2020-05-08 10:57:23 +02:00
  • 48a340a9d1 GEN seems to defined by default -> some fixes applied nmeyer-ur 2020-05-08 10:47:49 +02:00
  • f45621109b placed typedefs in Optimization nmeyer-ur 2020-05-08 10:41:52 +02:00
  • 32d1a0bbea added even more debug output nmeyer-ur 2020-05-08 10:39:26 +02:00
  • 267cce66a1 added more debug output nmeyer-ur 2020-05-08 10:29:28 +02:00
  • 3417147b11 added real fma, corrected typos in tbls; integrated, must supply A64FXGCC with GEN in configure nmeyer-ur 2020-05-08 10:20:19 +02:00
  • b338719bc8 first transition to fixed-size done, excl. Exch; next step: integration nmeyer-ur 2020-05-07 22:33:28 +02:00
  • 2b81cbe2c2 first attempt to introduce tables using fixed-size; still incomplete nmeyer-ur 2020-05-07 22:01:19 +02:00
  • acff9d6ed2 transition to fixed size data types almost done; still incomplete nmeyer-ur 2020-05-07 21:24:07 +02:00
  • 053b4dd495 Merge pull request #282 from felixerben/baryon-reversal Antonin Portelli 2020-05-07 18:09:17 +01:00
  • a306a49788 first mods for fixed size; still incomplete nmeyer-ur 2020-05-07 19:07:49 +02:00
  • 42bb5f0721 asserrtion ferben 2020-05-07 18:06:12 +01:00
  • 253bcc3426 back to old version ferben 2020-05-07 18:03:17 +01:00
  • a887206413 Merge pull request #281 from felixerben/feature/baryonSpeedup Antonin Portelli 2020-05-07 13:41:29 +01:00
  • 591ebb6213 Merge branch 'develop' of github.com:paboyle/Grid into feature/baryonSpeedup ferben 2020-05-07 11:13:21 +01:00
  • 56e2f7d088 deleted test routines. cleaned up fast version. assert Ns=4,Nc=3. ferben 2020-05-07 10:03:45 +01:00
  • 7ef03c5368 updated SVE readme nmeyer-ur 2020-05-06 16:30:37 +02:00
  • 525418abfb Merge pull request #273 from lehner/feature/gpt Peter Boyle 2020-05-06 10:10:51 -04:00
  • 5f780806c2 Merge pull request #279 from paboyle/bugfix/nvcc-config Peter Boyle 2020-05-06 10:07:52 -04:00
  • 3c6ffcb48c Merge branch 'develop' into feature/gpt Christoph Lehner 2020-05-06 15:03:35 +02:00
  • 87984ece7d add Lattice_basis.h Christoph Lehner 2020-05-06 08:47:18 -04:00
  • e9b295f967 Synchronize blocking infrastructure with GPT Christoph Lehner 2020-05-06 08:42:28 -04:00
  • 224cbf0453 Merge pull request #280 from mmphys/bugfix/ET_go_home Peter Boyle 2020-05-05 17:56:51 -04:00
  • c1e57d4357 Merge branch 'develop' into bugfix/ET_go_home Michael Marshall 2020-05-05 22:35:04 +01:00
  • 28a1fcaaff First compile against SYCL Peter Boyle 2020-05-05 11:13:27 -07:00
  • 6b64727161 disable comments Christoph Lehner 2020-05-05 05:05:36 -04:00
  • 04863f8f38 debug new AcceleratorView Christoph Lehner 2020-05-04 16:07:03 -04:00
  • 04927d2e40 SYCL prep - no sycl just make it compile through DPC++ u37294 2020-05-04 10:28:29 -07:00
  • 7caed4edd9 dpc++ didn't like rdtsc() u37294 2020-05-04 10:27:05 -07:00
  • 59c51d2c35 Make compile if HAVE_LIME=0 u37294 2020-05-04 10:26:20 -07:00
  • ff53b231c8 Merge branch 'develop' of https://github.com/paboyle/Grid into develop u37294 2020-05-04 10:25:10 -07:00
  • fc19cf905b Lime optional u37294 2020-05-04 10:24:48 -07:00
  • 2a1387e992 rankInnerProduct Christoph Lehner 2020-05-03 17:27:11 -04:00
  • 9bfa51bffb cleanup comment Christoph Lehner 2020-05-03 09:12:52 -04:00
  • 38532753f4 interface cleanup Christoph Lehner 2020-05-03 08:58:32 -04:00
  • 949be9605c fix pragmas Christoph Lehner 2020-05-02 16:20:03 -04:00
  • 63cf201ee7 Add AdviseInfrequentUse Christoph Lehner 2020-05-02 11:38:42 -04:00
  • c8af498a2a BinaryIO fix for alternative little-endian format name (used in 96I ensemble) Christoph Lehner 2020-05-01 03:45:50 -04:00
  • ddb192bac7 re-work double precision promotion for summit Christoph Lehner 2020-04-30 16:09:57 -04:00
  • 7666300a6f Merge branch 'develop' into bugfix/ET_go_home Michael Marshall 2020-04-30 20:10:32 +01:00
  • 4a4b9e305d Fix: strToVec enters infinite loop and exhausts memory if operator>> fails before the end of string, e.g. if parsing "0_0_0" for momentum instead of "0 0 0". Michael Marshall 2020-04-30 19:40:04 +01:00
  • 9b2d2d0fc3 Basis rotate stack passig to GPU reduction Peter Boyle 2020-04-30 12:31:07 -04:00
  • 5011753f4f Clean up warning Peter Boyle 2020-04-30 10:23:48 -04:00
  • dbaeefaeef All Eigen::TensorMap objects are fixed (i.e. cannot be dynamically resized) Michael Marshall 2020-04-30 15:02:51 +01:00
  • dee96cbf82 Added workaround in configure to still catch Cuda compiler when nvcc with extra arguments (eg -ccbin) is used as CXX Christopher Kelly 2020-04-29 10:37:11 -04:00
  • dd3ebc2ce4 Slow compile on NVCC switch off conserved current Peter Boyle 2020-04-29 08:43:12 -04:00
  • 103e7ae2f0 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2020-04-29 03:05:36 -04:00
  • 29ae5615c0 Seqeuential fix Peter Boyle 2020-04-29 03:05:15 -04:00
  • 6240e02619 added assertion to avoid potential infinite loop ferben 2020-04-27 18:50:53 +01:00
  • f4033ad8cb baryon speedup by a factor 2 ferben 2020-04-27 17:46:14 +01:00
  • 5abec5b8a9 SVE_readme update, update Grid_vector_types.h nmeyer-ur 2020-04-25 13:48:26 +02:00
  • 499edc0636 updated SVE_README.txt; defined ARMCLANGCOMPAT macro nmeyer-ur 2020-04-25 13:41:24 +02:00
  • d990e61be3 armclang 20.1 settings in SVE readme nmeyer-ur 2020-04-25 12:11:43 +02:00
  • 3edb2dc2da removed -static from gcc CXXFLAGS nmeyer-ur 2020-04-24 13:04:34 +02:00
  • f1fe444d4f blocked precision promotion infrastructure upgrade Christoph Lehner 2020-04-24 06:27:20 -04:00