1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-09-19 17:51:04 +01:00

Commit Graph

  • 4d1aa134b5 Use normal reduction, configure flag to force deterministic Peter Boyle 2024-10-15 14:32:11 +00:00
  • 5ec879860a Odd rounding issue - bears looking into Peter Boyle 2024-10-15 14:30:54 +00:00
  • f617468e04 Update Lattice_base.h Peter Boyle 2024-10-11 10:39:16 -04:00
  • b728af903c Fast axpy norm under CFLAG Peter Boyle 2024-10-11 03:23:09 +00:00
  • 54f1999030 axpy_norm_fast -- wasn't using the determinstic MPI sum causing issues Peter Boyle 2024-10-11 03:22:18 +00:00
  • fd58f0b669 Return ok Peter Boyle 2024-10-11 03:21:21 +00:00
  • c5c67b706e cl::sycl -> SYCL Peter Boyle 2024-10-10 22:04:12 +00:00
  • be7a543e2c Revert barriers -- these were not the problem Peter Boyle 2024-10-10 22:03:29 +00:00
  • 68f112d576 New software moves cl::sycl Peter Boyle 2024-10-10 22:03:04 +00:00
  • ec1395a304 Better flight logging Peter Boyle 2024-10-10 22:01:57 +00:00
  • beb0e474ee Use deterministic own brand reduction Peter Boyle 2024-10-10 22:01:24 +00:00
  • 2b5fdcbbc5 New software version Peter Boyle 2024-10-10 21:59:02 +00:00
  • 295127d456 Deterministic homebrew reduction Peter Boyle 2024-10-10 21:58:26 +00:00
  • 7dcfb13694 New software stack Peter Boyle 2024-10-10 21:57:35 +00:00
  • ee4046fe92 Added a dimension ordered column sum based reduction for scalar. Removes dependence on MPI_Allreduce and allows for work around on systems where this is bollox. Peter Boyle 2024-09-27 09:26:03 -04:00
  • 2a9cfeb9ea New files Peter Boyle 2024-09-26 14:23:29 -04:00
  • 1147b8ea40 Cheby poly setup Peter Boyle 2024-09-26 14:20:32 -04:00
  • 3f9119b39d Remove vectors used for the power spectrum table in paper Peter Boyle 2024-09-26 14:19:41 -04:00
  • 35e8225abd Verbose control Peter Boyle 2024-09-26 14:18:35 -04:00
  • bdbfbb7a14 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-09-26 14:05:45 -04:00
  • f7d4be8d96 Calculate bytes correctly Peter Boyle 2024-09-26 14:04:44 -04:00
  • 9fa8bd6438 Configure for AOT on Aurora latest software Peter Boyle 2024-09-23 11:25:44 +00:00
  • 02c8178f16 Almost working on Aurora Peter Boyle 2024-09-23 09:43:50 +00:00
  • e637fbacae Verbose remove Peter Boyle 2024-09-23 09:42:43 +00:00
  • bffd30abec Optimise lie algebra project feature/fthmc-optimise Peter Boyle 2024-09-19 15:48:09 -04:00
  • 066544281f Deprecate UVM Peter Boyle 2024-09-17 13:34:27 +00:00
  • 11be10d2c0 Aurora testing Peter Boyle 2024-09-10 18:11:52 +00:00
  • 160969a758 UVM tester, doesn't turn up anything Peter Boyle 2024-09-10 18:09:42 +00:00
  • 622f78ebea SYCL updates -- operator = giving trouble on Aurora. SYCL reduction is failing intermittently with SVM interface - returns zero, expect non-zero. Think I need to remove ALL dependence on SVM. Peter Boyle 2024-09-04 13:53:48 +00:00
  • aa67a5b095 Rename Peter Boyle 2024-08-27 19:54:01 +00:00
  • af9ea0864c Blas fix Peter Boyle 2024-08-27 19:53:09 +00:00
  • 4e2a6d87c4 Gemm batched fix Peter Boyle 2024-08-27 19:24:05 +00:00
  • a465ecece9 Aurora Peter Boyle 2024-08-27 19:17:25 +00:00
  • 575eb72182 Converges on 16^3 Peter Boyle 2024-08-20 18:10:08 +00:00
  • 3a973914d6 Compile on frontier Peter Boyle 2024-08-27 14:55:42 -04:00
  • f568c07bbd Improved the BLAS benchmark Peter Boyle 2024-08-27 14:53:12 -04:00
  • 2c9878fc3a Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-08-27 12:05:46 -04:00
  • 27b1b1b005 Checkerboard available for offloading pickCheckerboard Peter Boyle 2024-08-27 12:04:09 -04:00
  • 130d7ab077 Verbose changes Peter Boyle 2024-08-27 12:03:28 -04:00
  • 29f6b8a74a Setup Peter Boyle 2024-08-27 12:02:49 -04:00
  • 9779aaea33 16^3 optimise Peter Boyle 2024-08-27 11:38:35 -04:00
  • ec25604a67 Fastest solver for mrhs multigrid Peter Boyle 2024-08-27 11:32:34 -04:00
  • 3668e81c5e Extract slice working on checkerboard field for Block Lanczos Peter Boyle 2024-08-27 11:31:30 -04:00
  • d66b2423cb Move slice operations to GPU for BlockCG Peter Boyle 2024-08-27 11:28:47 -04:00
  • 15cc78f0b6 peek/poke local site on checkerboard arrays Peter Boyle 2024-08-27 11:23:42 -04:00
  • 06db4ddea2 Fast init on GPU Peter Boyle 2024-08-27 11:22:33 -04:00
  • 6cfb90e99f Support needed for accelerator resident set/pick Checkerboard Peter Boyle 2024-08-27 11:19:00 -04:00
  • d8be95a2a3 Don't early terminate power method to get more accurate top EV Peter Boyle 2024-08-27 11:17:37 -04:00
  • f82702872d Normal residual Peter Boyle 2024-08-27 11:16:44 -04:00
  • 3752c49ef0 Add option to record the CG polynomial Peter Boyle 2024-08-27 11:14:35 -04:00
  • fe65fa4988 MulMatrix Peter Boyle 2024-08-27 11:13:18 -04:00
  • 1fe4c205a3 Adef Peter Boyle 2024-08-27 11:11:47 -04:00
  • d4dc5e0f43 BlockCG linalg acceleratoin with BLAS Peter Boyle 2024-08-27 11:08:33 -04:00
  • 77944437ce Functor initialisation Peter Boyle 2024-08-27 11:01:02 -04:00
  • c164bff758 MMdag Peter Boyle 2024-08-27 11:00:36 -04:00
  • aa2e3d954a MMdag operator Peter Boyle 2024-08-27 10:59:29 -04:00
  • de62b04728 Block CG linalg acceleration Peter Boyle 2024-08-27 10:58:54 -04:00
  • d0bdb50f24 Analyse power spectrum Peter Boyle 2024-08-27 10:58:19 -04:00
  • da919949f9 Clean up the accelerator pick/set checkerboard Peter Boyle 2024-08-23 12:34:41 -04:00
  • b12b4fdaff Attempt at operating on half checkerboard Peter Boyle 2024-08-23 11:05:09 -04:00
  • a8fecbc609 BlockCG linalg via BLAS Peter Boyle 2024-08-21 16:08:16 -04:00
  • 557fa483ff Blas benchmark committed stand alone Peter Boyle 2024-08-20 16:18:43 +00:00
  • fc15d55df6 Mallinfo Peter Boyle 2024-08-20 14:33:09 +00:00
  • 53573d7d94 Better benchmark Peter Boyle 2024-08-20 14:31:57 +00:00
  • bb3c177000 Better benchmarking Peter Boyle 2024-08-20 14:31:41 +00:00
  • a3322b470f Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-08-20 14:30:52 +00:00
  • f8f408e7a9 BLAS everywhere Peter Boyle 2024-07-25 18:09:02 +00:00
  • baac1127d0 Later intel compiler happiness Peter Boyle 2024-07-25 18:06:05 +00:00
  • 6f1328160c Remove SVM use Peter Boyle 2024-07-25 18:05:40 +00:00
  • 04cf902791 Mallinfo and ASAN hooks Peter Boyle 2024-07-25 18:04:56 +00:00
  • 7a5b1c1a19 Try Catch convenience macro Peter Boyle 2024-07-25 18:03:41 +00:00
  • 18d2d7da4a Eigen implementation and SYCL implementation Peter Boyle 2024-07-25 18:02:56 +00:00
  • b461184797 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-07-23 09:53:58 -04:00
  • 4563b39305 New Frontier config Peter Boyle 2024-07-23 09:53:08 -04:00
  • c9d5674d5b FInal for paper Peter Boyle 2024-07-22 15:26:45 -04:00
  • 486412635a 8^4 test for PETSc Peter Boyle 2024-07-22 15:25:17 -04:00
  • 8b23a1546a Force compile temporarily Peter Boyle 2024-07-22 15:24:56 -04:00
  • a901e4e369 Regressed performance for paper Peter Boyle 2024-07-22 15:24:04 -04:00
  • 804d9367d4 Regressed performance Peter Boyle 2024-07-22 15:23:25 -04:00
  • 41d8adca95 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-07-11 15:38:45 +00:00
  • 059e8e5bb0 New compile option Peter Boyle 2024-07-11 15:37:30 +00:00
  • b3ee8ded96 Respect command line Peter Boyle 2024-07-11 15:34:48 +00:00
  • cf3584ad15 Convenient to monitor memory across an HMC trajectory Peter Boyle 2024-07-11 15:30:32 +00:00
  • a66973163f Device vector not UVM Peter Boyle 2024-07-11 15:24:11 +00:00
  • 4502a8c8a1 libc malloc heap info dump on Linux Peter Boyle 2024-07-11 15:22:18 +00:00
  • 9c902e4c2d Batched blas, but not working yet on OneAPI Peter Boyle 2024-07-11 15:19:49 +00:00
  • f3eb36adcf Namespace addition Peter Boyle 2024-07-11 15:19:19 +00:00
  • 7c246606c1 Schur additional case Peter Boyle 2024-07-10 22:04:32 +00:00
  • 172c75029e Redblack additional case Peter Boyle 2024-07-10 22:03:59 +00:00
  • 6ae52da571 LLVM leak sanitizer Peter Boyle 2024-07-08 15:59:18 +00:00
  • 4ee9c68053 Updated compile environment Peter Boyle 2024-07-08 15:57:57 +00:00
  • a15b4378a3 Sanitizer preservation of options Peter Boyle 2024-07-08 15:57:45 +00:00
  • 89fdd7f8dd AOT compilation Peter Boyle 2024-07-05 17:47:56 +00:00
  • c328be24b7 Sanitizer compile options Peter Boyle 2024-07-05 17:46:43 +00:00
  • a73dc6dbf4 Display linux heap info Peter Boyle 2024-06-28 16:05:17 +00:00
  • eee2a2657f Try catch exception wrappers Peter Boyle 2024-06-28 16:02:29 +00:00
  • 12b8be7cb9 Best so far on 96^3 350 Evecs converged on 4^4 block Peter Boyle 2024-06-18 16:31:37 -04:00
  • 63c223ea5d Verbose Peter Boyle 2024-06-18 03:22:01 +00:00
  • 2877fb4a2c More verbose if alloc failure Peter Boyle 2024-06-18 03:21:03 +00:00
  • d299c86633 Std::asin,acos Peter Boyle 2024-06-11 16:41:23 -04:00