1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-19 18:21:02 +01:00
Commit Graph

7941 Commits

Author SHA1 Message Date
paboyle 5cc4f3241d Meson field test 2024-10-18 15:42:30 +00:00
paboyle a78a61d76f Update configure 2024-10-15 14:38:45 +00:00
paboyle 2eff3f34ed Alternate reduction; default to grids own but make a configure flag
--enable-reduction=grid|mpi
2024-10-15 14:36:06 +00:00
paboyle 03687c1d62 Final version of test, closer to original again 2024-10-15 14:35:17 +00:00
paboyle febfe4e77f Make my own reduction a configure flag 2024-10-15 14:32:35 +00:00
paboyle 4d1aa134b5 Use normal reduction, configure flag to force deterministic 2024-10-15 14:32:11 +00:00
paboyle 5ec879860a Odd rounding issue - bears looking into 2024-10-15 14:30:54 +00:00
paboyle b728af903c Fast axpy norm under CFLAG 2024-10-11 03:23:09 +00:00
paboyle 54f1999030 axpy_norm_fast -- wasn't using the determinstic MPI sum causing issues 2024-10-11 03:22:18 +00:00
paboyle fd58f0b669 Return ok 2024-10-11 03:21:21 +00:00
paboyle c5c67b706e cl::sycl -> SYCL 2024-10-10 22:04:12 +00:00
paboyle be7a543e2c Revert barriers -- these were not the problem 2024-10-10 22:03:29 +00:00
paboyle 68f112d576 New software moves cl::sycl 2024-10-10 22:03:04 +00:00
paboyle ec1395a304 Better flight logging 2024-10-10 22:01:57 +00:00
paboyle beb0e474ee Use deterministic own brand reduction 2024-10-10 22:01:24 +00:00
paboyle 2b5fdcbbc5 New software version 2024-10-10 21:59:02 +00:00
paboyle 295127d456 Deterministic homebrew reduction 2024-10-10 21:58:26 +00:00
paboyle 7dcfb13694 New software stack 2024-10-10 21:57:35 +00:00
paboyle 9fa8bd6438 Configure for AOT on Aurora latest software 2024-09-23 11:25:44 +00:00
paboyle 02c8178f16 Almost working on Aurora 2024-09-23 09:43:50 +00:00
paboyle e637fbacae Verbose remove 2024-09-23 09:42:43 +00:00
paboyle 066544281f Deprecate UVM 2024-09-17 13:34:27 +00:00
paboyle 11be10d2c0 Aurora testing 2024-09-10 18:11:52 +00:00
paboyle 160969a758 UVM tester, doesn't turn up anything 2024-09-10 18:09:42 +00:00
paboyle 622f78ebea SYCL updates -- operator = giving trouble on Aurora.
SYCL reduction is failing intermittently with SVM interface - returns
zero, expect non-zero.
Think I need to remove ALL dependence on SVM.
2024-09-04 13:53:48 +00:00
Peter Boyle aa67a5b095 Rename 2024-08-27 19:54:01 +00:00
Peter Boyle af9ea0864c Blas fix 2024-08-27 19:53:09 +00:00
Peter Boyle 4e2a6d87c4 Gemm batched fix 2024-08-27 19:24:05 +00:00
Peter Boyle a465ecece9 Aurora 2024-08-27 19:20:43 +00:00
Peter Boyle 575eb72182 Converges on 16^3 2024-08-27 19:20:38 +00:00
Peter Boyle 3a973914d6 Compile on frontier 2024-08-27 14:55:42 -04:00
Peter Boyle f568c07bbd Improved the BLAS benchmark 2024-08-27 14:53:54 -04:00
Peter Boyle 2c9878fc3a Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2024-08-27 12:05:46 -04:00
Peter Boyle 27b1b1b005 Checkerboard available for offloading pickCheckerboard 2024-08-27 12:04:09 -04:00
Peter Boyle 130d7ab077 Verbose changes 2024-08-27 12:03:28 -04:00
Peter Boyle 29f6b8a74a Setup 2024-08-27 12:02:49 -04:00
Peter Boyle 9779aaea33 16^3 optimise 2024-08-27 11:38:35 -04:00
Peter Boyle ec25604a67 Fastest solver for mrhs multigrid 2024-08-27 11:32:34 -04:00
Peter Boyle 3668e81c5e Extract slice working on checkerboard field for Block Lanczos 2024-08-27 11:31:30 -04:00
Peter Boyle d66b2423cb Move slice operations to GPU for BlockCG 2024-08-27 11:28:47 -04:00
Peter Boyle 15cc78f0b6 peek/poke local site on checkerboard arrays 2024-08-27 11:23:42 -04:00
Peter Boyle 06db4ddea2 Fast init on GPU 2024-08-27 11:22:33 -04:00
Peter Boyle 6cfb90e99f Support needed for accelerator resident set/pick Checkerboard 2024-08-27 11:19:00 -04:00
Peter Boyle d8be95a2a3 Don't early terminate power method to get more accurate top EV 2024-08-27 11:17:37 -04:00
Peter Boyle f82702872d Normal residual 2024-08-27 11:16:44 -04:00
Peter Boyle 3752c49ef0 Add option to record the CG polynomial 2024-08-27 11:14:35 -04:00
Peter Boyle fe65fa4988 MulMatrix 2024-08-27 11:13:18 -04:00
Peter Boyle 1fe4c205a3 Adef 2024-08-27 11:11:47 -04:00
Peter Boyle d4dc5e0f43 BlockCG linalg acceleratoin with BLAS 2024-08-27 11:08:33 -04:00
Peter Boyle 77944437ce Functor initialisation 2024-08-27 11:01:02 -04:00