1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-05 00:47:06 +01:00

Commit Graph

  • c74d11e3d7 PVdagM MG Peter Boyle 2025-02-01 11:04:13 -05:00
  • 84cab5e6e7 no comms and log cleanup Christoph Lehner 2025-02-01 16:37:21 +01:00
  • c4fc972fec Merge branch 'feature/deprecate-uvm' into develop Peter Boyle 2025-01-31 16:32:36 +00:00
  • 8cf809e231 Best results on Aurora so far feature/deprecate-uvm Peter Boyle 2025-01-31 16:14:45 +00:00
  • 94019a922e Significantly better performance on Aurora without using pipeline mode Peter Boyle 2025-01-30 16:36:46 +00:00
  • 4f17c8d081
    Merge branch 'paboyle:develop' into bugfix/nvtx Mashy Green 2025-01-29 13:10:12 +00:00
  • aaab753982 Reverting to older version of nvtx for Tursa support Mashy Green 2025-01-29 12:57:38 +00:00
  • d6b2727f86 Pipeline mode getting better -- 2 nodes @ 10TF/s per node on Aurora Peter Boyle 2025-01-29 09:22:21 +00:00
  • 74a4f43946 Optional host buffer bounce for no CUDA aware MPI Peter Boyle 2025-01-28 15:22:46 +00:00
  • 1caf8b0f86 Rename Peter Boyle 2025-01-28 15:22:37 +00:00
  • 570b72a47b Bugfix. Sorry! Chulwoo Jung 2025-01-21 15:37:39 -05:00
  • a5798a89ed Merge branch 'develop' into specflow Chulwoo Jung 2025-01-21 12:13:24 -05:00
  • 3f3661a86f Heading towards PVdagM multigrid Peter Boyle 2025-01-17 14:33:35 +00:00
  • f7e2f9a401 Checking in spectral flow and DWF/Mobius kernel eigenvalue measurement Chulwoo Jung 2025-01-16 20:47:33 +00:00
  • 2848a9b558 DWF Kernel lanczos working(?) Chulwoo Jung 2025-01-16 01:29:56 +00:00
  • d4868991af Fixed wrong lib for NVTX in configure.ac and updated to nvtx3 Mashy Green 2025-01-10 14:53:19 +00:00
  • e99d42404e Removing the regresion test files that were also in this branch for a clean PR Mashy Green 2024-12-16 16:31:22 +00:00
  • 3ba019c747 Cleaning up and aligning variable naming between action deriv versions Mashy Green 2024-12-03 15:23:00 +00:00
  • 47429218bb patched version + modifications to deriv -> staple in qcd/gauge Mashy Green 2024-11-27 16:29:22 +00:00
  • 8fe429346f Dslash testing for reproduce Peter Boyle 2024-11-11 23:11:11 +00:00
  • 5a4f9bf2e3 Force the ROCM version Peter Boyle 2024-10-29 18:12:31 -04:00
  • b91fc1b6b4 Merge branch 'feature/boosted' into feature/deprecate-uvm Peter Boyle 2024-10-28 16:53:09 -04:00
  • eafc150034 Test fft asserts Peter Boyle 2024-10-23 16:46:26 -04:00
  • 2877f1a268 Verbose reduce Peter Boyle 2024-10-23 15:14:16 -04:00
  • 1e893af775 GPU happy Peter Boyle 2024-10-23 14:52:15 -04:00
  • d9f430a575 Happy GPU Peter Boyle 2024-10-23 14:51:16 -04:00
  • 63abe87f36 Memory manager verbose improvements that were useful to track an error Peter Boyle 2024-10-23 14:49:13 -04:00
  • 368d649c8a feature/deprecate-uvm happier -- preallocate device resident neigbour table Peter Boyle 2024-10-23 14:47:55 -04:00
  • 5603464f39 Fix in partial fraction import/export physical and make the GPU happier on the deprecate-uvm -- don't use static vectors, make member of class Peter Boyle 2024-10-23 14:45:58 -04:00
  • 655c79f39e Suppress warning on partial override Peter Boyle 2024-10-23 14:44:41 -04:00
  • 565b231c03 Nvcc happy Peter Boyle 2024-10-23 14:44:17 -04:00
  • 62a9f180fa NVCC happy Peter Boyle 2024-10-23 14:44:04 -04:00
  • 5ae77876a8 Meson field and Aslash field on GPU; some compiler warning removed Peter Boyle 2024-10-18 19:08:06 -04:00
  • 4ed2c2c74f Config command Peter Boyle 2024-10-18 13:58:33 -04:00
  • 955da582b6 Working on NVCC Peter Boyle 2024-10-18 13:58:03 -04:00
  • 11b07b950d Vanilla linux compile, assuming spack prerequisites Peter Boyle 2024-10-18 13:57:40 -04:00
  • 8f70cfeda9 Clean up Peter Boyle 2024-10-18 13:56:53 -04:00
  • ce64271048 Remove the copying version Peter Boyle 2024-10-18 13:56:24 -04:00
  • 5cc4f3241d Meson field test Peter Boyle 2024-10-18 15:42:30 +00:00
  • 6815e138b4 Boosted fermion attempt feature/boosted Peter Boyle 2024-10-17 18:36:32 +01:00
  • a78a61d76f Update configure Peter Boyle 2024-10-15 14:38:45 +00:00
  • 2eff3f34ed Alternate reduction; default to grids own but make a configure flag --enable-reduction=grid|mpi Peter Boyle 2024-10-15 14:36:06 +00:00
  • 03687c1d62 Final version of test, closer to original again Peter Boyle 2024-10-15 14:35:17 +00:00
  • febfe4e77f Make my own reduction a configure flag Peter Boyle 2024-10-15 14:32:35 +00:00
  • 4d1aa134b5 Use normal reduction, configure flag to force deterministic Peter Boyle 2024-10-15 14:32:11 +00:00
  • 5ec879860a Odd rounding issue - bears looking into Peter Boyle 2024-10-15 14:30:54 +00:00
  • f617468e04
    Update Lattice_base.h Peter Boyle 2024-10-11 10:39:16 -04:00
  • b728af903c Fast axpy norm under CFLAG Peter Boyle 2024-10-11 03:23:09 +00:00
  • 54f1999030 axpy_norm_fast -- wasn't using the determinstic MPI sum causing issues Peter Boyle 2024-10-11 03:22:18 +00:00
  • fd58f0b669 Return ok Peter Boyle 2024-10-11 03:21:21 +00:00
  • c5c67b706e cl::sycl -> SYCL Peter Boyle 2024-10-10 22:04:12 +00:00
  • be7a543e2c Revert barriers -- these were not the problem Peter Boyle 2024-10-10 22:03:29 +00:00
  • 68f112d576 New software moves cl::sycl Peter Boyle 2024-10-10 22:03:04 +00:00
  • ec1395a304 Better flight logging Peter Boyle 2024-10-10 22:01:57 +00:00
  • beb0e474ee Use deterministic own brand reduction Peter Boyle 2024-10-10 22:01:24 +00:00
  • 2b5fdcbbc5 New software version Peter Boyle 2024-10-10 21:59:02 +00:00
  • 295127d456 Deterministic homebrew reduction Peter Boyle 2024-10-10 21:58:26 +00:00
  • 7dcfb13694 New software stack Peter Boyle 2024-10-10 21:57:35 +00:00
  • ee4046fe92 Added a dimension ordered column sum based reduction for scalar. Removes dependence on MPI_Allreduce and allows for work around on systems where this is bollox. Peter Boyle 2024-09-27 09:26:03 -04:00
  • 2a9cfeb9ea New files Peter Boyle 2024-09-26 14:23:29 -04:00
  • 1147b8ea40 Cheby poly setup Peter Boyle 2024-09-26 14:20:32 -04:00
  • 3f9119b39d Remove vectors used for the power spectrum table in paper Peter Boyle 2024-09-26 14:19:41 -04:00
  • 35e8225abd Verbose control Peter Boyle 2024-09-26 14:18:35 -04:00
  • bdbfbb7a14 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-09-26 14:05:45 -04:00
  • f7d4be8d96 Calculate bytes correctly Peter Boyle 2024-09-26 14:04:44 -04:00
  • 9fa8bd6438 Configure for AOT on Aurora latest software Peter Boyle 2024-09-23 11:25:44 +00:00
  • 02c8178f16 Almost working on Aurora Peter Boyle 2024-09-23 09:43:50 +00:00
  • e637fbacae Verbose remove Peter Boyle 2024-09-23 09:42:43 +00:00
  • bffd30abec Optimise lie algebra project feature/fthmc-optimise Peter Boyle 2024-09-19 15:48:09 -04:00
  • 066544281f Deprecate UVM Peter Boyle 2024-09-17 13:34:27 +00:00
  • 11be10d2c0 Aurora testing Peter Boyle 2024-09-10 18:11:52 +00:00
  • 160969a758 UVM tester, doesn't turn up anything Peter Boyle 2024-09-10 18:09:42 +00:00
  • 622f78ebea SYCL updates -- operator = giving trouble on Aurora. SYCL reduction is failing intermittently with SVM interface - returns zero, expect non-zero. Think I need to remove ALL dependence on SVM. Peter Boyle 2024-09-04 13:53:48 +00:00
  • aa67a5b095 Rename Peter Boyle 2024-08-27 19:54:01 +00:00
  • af9ea0864c Blas fix Peter Boyle 2024-08-27 19:53:09 +00:00
  • 4e2a6d87c4 Gemm batched fix Peter Boyle 2024-08-27 19:24:05 +00:00
  • a465ecece9 Aurora Peter Boyle 2024-08-27 19:17:25 +00:00
  • 575eb72182 Converges on 16^3 Peter Boyle 2024-08-20 18:10:08 +00:00
  • 3a973914d6 Compile on frontier Peter Boyle 2024-08-27 14:55:42 -04:00
  • f568c07bbd Improved the BLAS benchmark Peter Boyle 2024-08-27 14:53:12 -04:00
  • 2c9878fc3a Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-08-27 12:05:46 -04:00
  • 27b1b1b005 Checkerboard available for offloading pickCheckerboard Peter Boyle 2024-08-27 12:04:09 -04:00
  • 130d7ab077 Verbose changes Peter Boyle 2024-08-27 12:03:28 -04:00
  • 29f6b8a74a Setup Peter Boyle 2024-08-27 12:02:49 -04:00
  • 9779aaea33 16^3 optimise Peter Boyle 2024-08-27 11:38:35 -04:00
  • ec25604a67 Fastest solver for mrhs multigrid Peter Boyle 2024-08-27 11:32:34 -04:00
  • 3668e81c5e Extract slice working on checkerboard field for Block Lanczos Peter Boyle 2024-08-27 11:31:30 -04:00
  • d66b2423cb Move slice operations to GPU for BlockCG Peter Boyle 2024-08-27 11:28:47 -04:00
  • 15cc78f0b6 peek/poke local site on checkerboard arrays Peter Boyle 2024-08-27 11:23:42 -04:00
  • 06db4ddea2 Fast init on GPU Peter Boyle 2024-08-27 11:22:33 -04:00
  • 6cfb90e99f Support needed for accelerator resident set/pick Checkerboard Peter Boyle 2024-08-27 11:19:00 -04:00
  • d8be95a2a3 Don't early terminate power method to get more accurate top EV Peter Boyle 2024-08-27 11:17:37 -04:00
  • f82702872d Normal residual Peter Boyle 2024-08-27 11:16:44 -04:00
  • 3752c49ef0 Add option to record the CG polynomial Peter Boyle 2024-08-27 11:14:35 -04:00
  • fe65fa4988 MulMatrix Peter Boyle 2024-08-27 11:13:18 -04:00
  • 1fe4c205a3 Adef Peter Boyle 2024-08-27 11:11:47 -04:00
  • d4dc5e0f43 BlockCG linalg acceleratoin with BLAS Peter Boyle 2024-08-27 11:08:33 -04:00
  • 77944437ce Functor initialisation Peter Boyle 2024-08-27 11:01:02 -04:00
  • c164bff758 MMdag Peter Boyle 2024-08-27 11:00:36 -04:00
  • aa2e3d954a MMdag operator Peter Boyle 2024-08-27 10:59:29 -04:00