1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-23 02:02:02 +01:00

Commit Graph

  • ddbb008694 Merge pull request #10 from lehner/feature/gpt-sycl Christoph Lehner 2020-07-30 13:12:09 +02:00
  • 7997e0a449 Merge branch 'feature/gpt' into feature/gpt-sycl Christoph Lehner 2020-07-30 13:11:31 +02:00
  • 197612bc7a fast cpu basisRotate and other small cleanups Christoph Lehner 2020-07-30 07:08:54 -04:00
  • 0e88bf4bff remove Nils's default pragma Christoph Lehner 2020-07-29 10:24:35 -04:00
  • 3e64d78469 include versions.h again and add back asserts in Test_simd Christoph Lehner 2020-07-29 10:18:05 -04:00
  • 2004611def Merge pull request #9 from nmeyer-ur/feature/a64fx-2 Christoph Lehner 2020-07-29 14:54:20 +02:00
  • a2868c96a4 Merge pull request #8 from paboyle/develop Christoph Lehner 2020-07-29 14:10:07 +02:00
  • 69bd7082f1 Merge branch 'rmhmc_fix' into feature/rmhmc feature/rmhmc Chulwoo Jung 2020-07-23 14:47:24 -04:00
  • 873f039c72 Test_hmc_WilsonGauge_Implicit compiles Chulwoo Jung 2020-07-23 14:45:47 -04:00
  • 16303e5f16 T_hmc_WilsonGauge_Implicit not compiling with latest develop Chulwoo Jung 2020-07-22 23:00:58 -04:00
  • 88683fa648 Merge branch 'develop' of https://github.com/paboyle/Grid into feature/rmhmc Chulwoo Jung 2020-07-22 15:54:07 -04:00
  • 43298ef681 Make precision switchable for cublas routines Chulwoo Jung 2020-07-22 14:49:32 -04:00
  • 7cf7f11e1a Doc recompile Peter Boyle 2020-07-22 14:44:11 -04:00
  • ea7f8fda5e fix typo nmeyer-ur 2020-07-22 09:34:05 +02:00
  • 906b78811b exit in Init when using --comms-overlap nmeyer-ur 2020-07-22 08:57:01 +02:00
  • 7e70df27e4 Confirmed double precision working Chulwoo Jung 2020-07-21 00:48:46 -04:00
  • c55d657736 Merge branch 'dev-BlockLanczosOpt' of https://github.com/yongchull/Grid into feature/block_lanczos Chulwoo Jung 2020-07-20 16:36:46 -04:00
  • fe5b23e144 RMHMC implementation, originaly from Guido Cossu Chulwoo Jung 2020-07-19 01:42:31 -04:00
  • e1327e7ea0 Optional bounds check debug code Peter Boyle 2020-07-16 16:57:46 -04:00
  • 569f78c2cf Stenccil improvement Peter Boyle 2020-07-16 16:57:13 -04:00
  • 488c79d5a1 Bound improvement Peter Boyle 2020-07-15 19:58:08 -04:00
  • 97703b181b Merge pull request #7 from paboyle/develop Christoph Lehner 2020-07-12 16:24:53 +02:00
  • d9474c6cb6 compiler-independent build using --enable-simd=A64FX nmeyer-ur 2020-07-09 10:07:02 +02:00
  • bbd145382b enable --enable-simd=A64FX in configure nmeyer-ur 2020-07-08 12:43:51 +02:00
  • 1b08cb7300 Merge branch 'develop' into feature/a64fx-2 nmeyer-ur 2020-07-08 08:18:18 +02:00
  • 337d9dc043 move barrier in Benchmark_wilson nmeyer-ur 2020-07-08 08:13:40 +02:00
  • 8726e94ea7 merge upstream develop nmeyer-ur 2020-07-07 20:26:47 +02:00
  • 67db4993c2 reset head, update SVE readme nmeyer-ur 2020-07-07 19:54:52 +02:00
  • f1f655d92b Merge pull request #304 from Heinrich-BR/develop syck Antonin Portelli 2020-07-06 10:16:03 +01:00
  • 43334e88c3 Tiny change in a comment for clarity Henrique B.R 2020-07-04 16:11:16 +01:00
  • 4f1e66b044 Fixed HMC SU(N) integrator which was causing fields to leave Lie Algebra manifold for N>2 Henrique B.R 2020-07-04 03:53:06 +01:00
  • dc6b0f20b2 Fixed array bounds Peter Boyle 2020-07-02 12:20:20 -04:00
  • c0badc3e16 Summit bounce back to git Peter Boyle 2020-07-02 10:48:39 -04:00
  • fd3c8b0e85 correct build instructions qp4 nmeyer-ur 2020-07-01 09:00:38 +02:00
  • 58f6529b55 Slowly piecing together Peter Boyle 2020-06-30 16:42:03 -04:00
  • e3f056dfbb Hw multigrid operator Peter Boyle 2020-06-30 16:10:16 -04:00
  • da0ffa7a79 Two spin update defer commit to repository Peter Boyle 2020-06-30 16:09:48 -04:00
  • fcc7640b9c Detect a coarsened vector Peter Boyle 2020-06-30 14:17:45 -04:00
  • 0cbe2859e0 Making progress on Hw based 5d coarse matrix Peter Boyle 2020-06-30 14:17:20 -04:00
  • 1635c263ee disable TOFU by default nmeyer-ur 2020-06-30 19:27:08 +02:00
  • 64fe5b21b4 Merge pull request #298 from rrhodgson/feature/baryon Antonin Portelli 2020-06-29 18:45:00 +01:00
  • ee9889821d Runs through to coarse space solve Peter Boyle 2020-06-29 12:59:52 -04:00
  • eb470aa6dc Update to baryon and added comments/fix whitespace Raoul Hodgson 2020-06-29 09:43:01 +01:00
  • 77af9a3ddc Baryon revert sign Raoul Hodgson 2020-06-26 10:08:42 +01:00
  • 102089798c BaryonUtils: update to autoView Raoul Hodgson 2020-06-25 16:41:58 +01:00
  • 39cea8b5a7 Merge branch 'develop' into feature/baryon Raoul Hodgson 2020-06-25 16:24:07 +01:00
  • a65f66d2db Merge branch 'feature/baryon3pt' into feature/baryon Raoul Hodgson 2020-06-25 16:20:59 +01:00
  • 936c5ecf69 Reduction GPU no compile fix Peter Boyle 2020-06-24 17:28:31 -04:00
  • 22cfbdbbb3 Boost precision in inner products in single Peter Boyle 2020-06-24 12:52:31 -04:00
  • 093d1ee21b Force initial values Peter Boyle 2020-06-24 08:54:49 -04:00
  • d6ba2581ce Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2020-06-24 08:25:08 -04:00
  • 577c064184 Memory manager initialise earlier Peter Boyle 2020-06-24 08:24:38 -04:00
  • 2ff1fa6fad UVM used shared for CPU alloccations andd ddont migrate Peter Boyle 2020-06-23 22:14:56 -04:00
  • 70be1bd8be Adding code under development Peter Boyle 2020-06-23 10:24:21 -04:00
  • 4ef50ba31f Baryon speedup Raoul Hodgson 2020-06-23 11:44:20 +01:00
  • 3e97a26f90 BaryonGamm3pt threads -> accelerator Raoul Hodgson 2020-06-23 11:35:32 +01:00
  • 599f28f6ef Baryon bug fixes Raoul Hodgson 2020-06-23 11:10:26 +01:00
  • c48da35921 Memory Vector UVM and Lattice alignedAllocator separate sycl Peter Boyle 2020-06-22 20:21:53 -04:00
  • 6c5fa8dcd8 Aligned allocate on CPU put through this interface Peter Boyle 2020-06-20 14:34:29 -04:00
  • 0d2f913a1a String.h for linux Peter Boyle 2020-06-20 09:37:31 -04:00
  • 5b117865b2 Merge pull request #6 from paboyle/sycl Christoph Lehner 2020-06-20 09:44:44 +02:00
  • 1a74816c25 Hopeefully fixed Peter Boyle 2020-06-19 17:50:52 -04:00
  • 73de335256 Merge branch 'develop' into sycl Peter Boyle 2020-06-19 17:44:16 -04:00
  • 228fd450ce Typo fix (excusee - my keyboard is starting to break) Peter Boyle 2020-06-19 17:36:05 -04:00
  • b949cf6b12 PeekLocal needs a view to keep thread safe. ALLOCATION_CACHEE reenable Peter Boyle 2020-06-19 17:13:27 -04:00
  • 11bc1aeadc TThread count defaultt to fastest Peter Boyle 2020-06-19 14:30:35 -04:00
  • 66005929af Set up the cache size on all ranks Peter Boyle 2020-06-19 12:50:54 -04:00
  • 05bbc49a99 Edge case in GetShmDim check Christoph Lehner 2020-06-19 12:01:23 -04:00
  • ff7c847735 Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl Peter Boyle 2020-06-19 01:22:16 -04:00
  • 1aa988b2af Comms overlap fix UVM case Peter Boyle 2020-06-19 01:21:14 -04:00
  • edf17708a8 Range improvement Peter Boyle 2020-06-18 22:41:06 -04:00
  • 81a8209749 ConvertType for blockInnerProduct Christoph Lehner 2020-06-18 11:53:21 -04:00
  • a87e45ba25 SVE readme update nmeyer-ur 2020-06-18 11:23:08 +02:00
  • 465856331a switch back to serialized; wrong results on single too nmeyer-ur 2020-06-15 15:39:39 +02:00
  • cc958aa9ed switch back to standard MPI_init due to wrong results in Benchmark_wilson using comms-overlap nmeyer-ur 2020-06-15 14:21:38 +02:00
  • f46f029dbb Merge pull request #292 from lehner/feature/gpt-sycl Peter Boyle 2020-06-14 13:43:27 -04:00
  • 3dccd7aa2c Catch edge case in SharedMemoryMPI::GetShmDims; Change default units to consistent MB in init args; Want last element not past last element in MemoryManagerCache.cc Christoph Lehner 2020-06-14 13:26:01 -04:00
  • a25e4b3d0c pred 32/64 for float/double instead of 8 in VLA patch nmeyer-ur 2020-06-13 14:44:37 +02:00
  • d1210ca12a switch to double/float instead of float64_t/float32_t in VLA patch nmeyer-ur 2020-06-13 13:59:32 +02:00
  • 36ea0e222a type traits for ComplexF/D in VLA patch; cosmetics in VLS intrinsics nmeyer-ur 2020-06-13 13:42:35 +02:00
  • 65e6e7da6f Merge pull request #291 from lehner/feature/gpt-sycl Peter Boyle 2020-06-12 20:42:32 -04:00
  • b5e87e8d97 summit compile fixes Christoph Lehner 2020-06-12 18:16:12 -04:00
  • 5f5807d60a cleanup Christoph Lehner 2020-06-12 14:48:23 -04:00
  • 92281ec22d add 3 op Mult for VLA nmeyer-ur 2020-06-12 18:49:05 +02:00
  • 87266ce099 comment out fcmla in vector types: need also MultAddReal nmeyer-ur 2020-06-12 18:37:19 +02:00
  • 2a23f133e8 reenable fcmla for VLA nmeyer-ur 2020-06-12 17:30:38 +02:00
  • 8dbf790f62 correct tbl2 for sp nmeyer-ur 2020-06-12 17:12:34 +02:00
  • 2402b4940e vec_imm in float nmeyer-ur 2020-06-12 15:17:38 +02:00
  • 2111052fbe apply VLA patch for memcpy reduction suggested by Arm, CAS-162542-D6W7Z7 nmeyer-ur 2020-06-12 14:49:19 +02:00
  • 7974acff54 merged sycl to feature-gpt Christoph Lehner 2020-06-12 06:49:38 -04:00
  • f0d17d2b49 Added Baryon3pt code Raoul Hodgson 2020-06-12 11:35:52 +01:00
  • 244c003a1b Updated Baryon code Raoul Hodgson 2020-06-12 11:00:25 +01:00
  • 0174f5f742 look for librt when using shm=shmopen Antonin Portelli 2020-06-11 16:50:43 +01:00
  • 32b2b59be4 Offload Peter Boyle 2020-06-10 20:36:26 -04:00
  • 86bb0cc24b Keep on GPU Peter Boyle 2020-06-10 20:00:00 -04:00
  • 84c19587e7 Offload Peter Boyle 2020-06-10 19:59:31 -04:00
  • 237ce92540 Offload loops Peter Boyle 2020-06-10 19:59:11 -04:00
  • a7ffc61e82 acceleratorSIMTlane() Peter Boyle 2020-06-10 19:58:33 -04:00
  • fd97f64612 Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl Peter Boyle 2020-06-10 12:58:13 -04:00
  • 8720aecb80 Offload more loops Peter Boyle 2020-06-10 12:57:55 -04:00