1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-12-14 01:34:40 +00:00

Commit Graph

  • 3f3661a86f Heading towards PVdagM multigrid Peter Boyle 2025-01-17 14:33:35 +00:00
  • f7e2f9a401 Checking in spectral flow and DWF/Mobius kernel eigenvalue measurement Chulwoo Jung 2025-01-16 20:47:33 +00:00
  • 2848a9b558 DWF Kernel lanczos working(?) Chulwoo Jung 2025-01-16 01:29:56 +00:00
  • d4868991af Fixed wrong lib for NVTX in configure.ac and updated to nvtx3 Mashy Green 2025-01-10 14:53:19 +00:00
  • e99d42404e Removing the regresion test files that were also in this branch for a clean PR Mashy Green 2024-12-16 16:31:22 +00:00
  • 3ba019c747 Cleaning up and aligning variable naming between action deriv versions Mashy Green 2024-12-03 15:23:00 +00:00
  • 47429218bb patched version + modifications to deriv -> staple in qcd/gauge Mashy Green 2024-11-27 16:29:22 +00:00
  • 8fe429346f Dslash testing for reproduce Peter Boyle 2024-11-11 23:11:11 +00:00
  • 5a4f9bf2e3 Force the ROCM version Peter Boyle 2024-10-29 18:12:31 -04:00
  • b91fc1b6b4 Merge branch 'feature/boosted' into feature/deprecate-uvm Peter Boyle 2024-10-28 16:53:09 -04:00
  • eafc150034 Test fft asserts Peter Boyle 2024-10-23 16:46:26 -04:00
  • 2877f1a268 Verbose reduce Peter Boyle 2024-10-23 15:14:16 -04:00
  • 1e893af775 GPU happy Peter Boyle 2024-10-23 14:52:15 -04:00
  • d9f430a575 Happy GPU Peter Boyle 2024-10-23 14:51:16 -04:00
  • 63abe87f36 Memory manager verbose improvements that were useful to track an error Peter Boyle 2024-10-23 14:49:13 -04:00
  • 368d649c8a feature/deprecate-uvm happier -- preallocate device resident neigbour table Peter Boyle 2024-10-23 14:47:55 -04:00
  • 5603464f39 Fix in partial fraction import/export physical and make the GPU happier on the deprecate-uvm -- don't use static vectors, make member of class Peter Boyle 2024-10-23 14:45:58 -04:00
  • 655c79f39e Suppress warning on partial override Peter Boyle 2024-10-23 14:44:41 -04:00
  • 565b231c03 Nvcc happy Peter Boyle 2024-10-23 14:44:17 -04:00
  • 62a9f180fa NVCC happy Peter Boyle 2024-10-23 14:44:04 -04:00
  • 5ae77876a8 Meson field and Aslash field on GPU; some compiler warning removed Peter Boyle 2024-10-18 19:08:06 -04:00
  • 4ed2c2c74f Config command Peter Boyle 2024-10-18 13:58:33 -04:00
  • 955da582b6 Working on NVCC Peter Boyle 2024-10-18 13:58:03 -04:00
  • 11b07b950d Vanilla linux compile, assuming spack prerequisites Peter Boyle 2024-10-18 13:57:40 -04:00
  • 8f70cfeda9 Clean up Peter Boyle 2024-10-18 13:56:53 -04:00
  • ce64271048 Remove the copying version Peter Boyle 2024-10-18 13:56:24 -04:00
  • 5cc4f3241d Meson field test Peter Boyle 2024-10-18 15:42:30 +00:00
  • 6815e138b4 Boosted fermion attempt feature/boosted Peter Boyle 2024-10-17 18:36:32 +01:00
  • a78a61d76f Update configure Peter Boyle 2024-10-15 14:38:45 +00:00
  • 2eff3f34ed Alternate reduction; default to grids own but make a configure flag --enable-reduction=grid|mpi Peter Boyle 2024-10-15 14:36:06 +00:00
  • 03687c1d62 Final version of test, closer to original again Peter Boyle 2024-10-15 14:35:17 +00:00
  • febfe4e77f Make my own reduction a configure flag Peter Boyle 2024-10-15 14:32:35 +00:00
  • 4d1aa134b5 Use normal reduction, configure flag to force deterministic Peter Boyle 2024-10-15 14:32:11 +00:00
  • 5ec879860a Odd rounding issue - bears looking into Peter Boyle 2024-10-15 14:30:54 +00:00
  • f617468e04 Update Lattice_base.h Peter Boyle 2024-10-11 10:39:16 -04:00
  • b728af903c Fast axpy norm under CFLAG Peter Boyle 2024-10-11 03:23:09 +00:00
  • 54f1999030 axpy_norm_fast -- wasn't using the determinstic MPI sum causing issues Peter Boyle 2024-10-11 03:22:18 +00:00
  • fd58f0b669 Return ok Peter Boyle 2024-10-11 03:21:21 +00:00
  • c5c67b706e cl::sycl -> SYCL Peter Boyle 2024-10-10 22:04:12 +00:00
  • be7a543e2c Revert barriers -- these were not the problem Peter Boyle 2024-10-10 22:03:29 +00:00
  • 68f112d576 New software moves cl::sycl Peter Boyle 2024-10-10 22:03:04 +00:00
  • ec1395a304 Better flight logging Peter Boyle 2024-10-10 22:01:57 +00:00
  • beb0e474ee Use deterministic own brand reduction Peter Boyle 2024-10-10 22:01:24 +00:00
  • 2b5fdcbbc5 New software version Peter Boyle 2024-10-10 21:59:02 +00:00
  • 295127d456 Deterministic homebrew reduction Peter Boyle 2024-10-10 21:58:26 +00:00
  • 7dcfb13694 New software stack Peter Boyle 2024-10-10 21:57:35 +00:00
  • ee4046fe92 Added a dimension ordered column sum based reduction for scalar. Removes dependence on MPI_Allreduce and allows for work around on systems where this is bollox. Peter Boyle 2024-09-27 09:26:03 -04:00
  • 2a9cfeb9ea New files Peter Boyle 2024-09-26 14:23:29 -04:00
  • 1147b8ea40 Cheby poly setup Peter Boyle 2024-09-26 14:20:32 -04:00
  • 3f9119b39d Remove vectors used for the power spectrum table in paper Peter Boyle 2024-09-26 14:19:41 -04:00
  • 35e8225abd Verbose control Peter Boyle 2024-09-26 14:18:35 -04:00
  • bdbfbb7a14 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-09-26 14:05:45 -04:00
  • f7d4be8d96 Calculate bytes correctly Peter Boyle 2024-09-26 14:04:44 -04:00
  • 9fa8bd6438 Configure for AOT on Aurora latest software Peter Boyle 2024-09-23 11:25:44 +00:00
  • 02c8178f16 Almost working on Aurora Peter Boyle 2024-09-23 09:43:50 +00:00
  • e637fbacae Verbose remove Peter Boyle 2024-09-23 09:42:43 +00:00
  • bffd30abec Optimise lie algebra project feature/fthmc-optimise Peter Boyle 2024-09-19 15:48:09 -04:00
  • 066544281f Deprecate UVM Peter Boyle 2024-09-17 13:34:27 +00:00
  • 11be10d2c0 Aurora testing Peter Boyle 2024-09-10 18:11:52 +00:00
  • 160969a758 UVM tester, doesn't turn up anything Peter Boyle 2024-09-10 18:09:42 +00:00
  • 622f78ebea SYCL updates -- operator = giving trouble on Aurora. SYCL reduction is failing intermittently with SVM interface - returns zero, expect non-zero. Think I need to remove ALL dependence on SVM. Peter Boyle 2024-09-04 13:53:48 +00:00
  • aa67a5b095 Rename Peter Boyle 2024-08-27 19:54:01 +00:00
  • af9ea0864c Blas fix Peter Boyle 2024-08-27 19:53:09 +00:00
  • 4e2a6d87c4 Gemm batched fix Peter Boyle 2024-08-27 19:24:05 +00:00
  • a465ecece9 Aurora Peter Boyle 2024-08-27 19:17:25 +00:00
  • 575eb72182 Converges on 16^3 Peter Boyle 2024-08-20 18:10:08 +00:00
  • 3a973914d6 Compile on frontier Peter Boyle 2024-08-27 14:55:42 -04:00
  • f568c07bbd Improved the BLAS benchmark Peter Boyle 2024-08-27 14:53:12 -04:00
  • 2c9878fc3a Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-08-27 12:05:46 -04:00
  • 27b1b1b005 Checkerboard available for offloading pickCheckerboard Peter Boyle 2024-08-27 12:04:09 -04:00
  • 130d7ab077 Verbose changes Peter Boyle 2024-08-27 12:03:28 -04:00
  • 29f6b8a74a Setup Peter Boyle 2024-08-27 12:02:49 -04:00
  • 9779aaea33 16^3 optimise Peter Boyle 2024-08-27 11:38:35 -04:00
  • ec25604a67 Fastest solver for mrhs multigrid Peter Boyle 2024-08-27 11:32:34 -04:00
  • 3668e81c5e Extract slice working on checkerboard field for Block Lanczos Peter Boyle 2024-08-27 11:31:30 -04:00
  • d66b2423cb Move slice operations to GPU for BlockCG Peter Boyle 2024-08-27 11:28:47 -04:00
  • 15cc78f0b6 peek/poke local site on checkerboard arrays Peter Boyle 2024-08-27 11:23:42 -04:00
  • 06db4ddea2 Fast init on GPU Peter Boyle 2024-08-27 11:22:33 -04:00
  • 6cfb90e99f Support needed for accelerator resident set/pick Checkerboard Peter Boyle 2024-08-27 11:19:00 -04:00
  • d8be95a2a3 Don't early terminate power method to get more accurate top EV Peter Boyle 2024-08-27 11:17:37 -04:00
  • f82702872d Normal residual Peter Boyle 2024-08-27 11:16:44 -04:00
  • 3752c49ef0 Add option to record the CG polynomial Peter Boyle 2024-08-27 11:14:35 -04:00
  • fe65fa4988 MulMatrix Peter Boyle 2024-08-27 11:13:18 -04:00
  • 1fe4c205a3 Adef Peter Boyle 2024-08-27 11:11:47 -04:00
  • d4dc5e0f43 BlockCG linalg acceleratoin with BLAS Peter Boyle 2024-08-27 11:08:33 -04:00
  • 77944437ce Functor initialisation Peter Boyle 2024-08-27 11:01:02 -04:00
  • c164bff758 MMdag Peter Boyle 2024-08-27 11:00:36 -04:00
  • aa2e3d954a MMdag operator Peter Boyle 2024-08-27 10:59:29 -04:00
  • de62b04728 Block CG linalg acceleration Peter Boyle 2024-08-27 10:58:54 -04:00
  • d0bdb50f24 Analyse power spectrum Peter Boyle 2024-08-27 10:58:19 -04:00
  • da919949f9 Clean up the accelerator pick/set checkerboard Peter Boyle 2024-08-23 12:34:41 -04:00
  • b12b4fdaff Attempt at operating on half checkerboard Peter Boyle 2024-08-23 11:05:09 -04:00
  • a8fecbc609 BlockCG linalg via BLAS Peter Boyle 2024-08-21 16:08:16 -04:00
  • 557fa483ff Blas benchmark committed stand alone Peter Boyle 2024-08-20 16:18:43 +00:00
  • fc15d55df6 Mallinfo Peter Boyle 2024-08-20 14:33:09 +00:00
  • 53573d7d94 Better benchmark Peter Boyle 2024-08-20 14:31:57 +00:00
  • bb3c177000 Better benchmarking Peter Boyle 2024-08-20 14:31:41 +00:00
  • a3322b470f Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-08-20 14:30:52 +00:00
  • f8f408e7a9 BLAS everywhere Peter Boyle 2024-07-25 18:09:02 +00:00
  • baac1127d0 Later intel compiler happiness Peter Boyle 2024-07-25 18:06:05 +00:00