1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-19 18:21:02 +01:00
Commit Graph

7750 Commits

Author SHA1 Message Date
Peter Boyle da890dc293 Verbose changes 2024-04-01 14:18:00 -04:00
Peter Boyle 93d0a1e73a HISQ view call 2024-04-01 14:16:47 -04:00
Peter Boyle f0a8c7d045 Playing with chebyshevs 2024-04-01 14:16:11 -04:00
Peter Boyle db8793777c Logging/verbose 2024-04-01 14:15:41 -04:00
Peter Boyle c745484e65 9.5x speed up version 2024-04-01 14:14:30 -04:00
Peter Boyle cc04dc42dc Merge branch 'develop' into feature/scidac-wp1 2024-03-06 14:55:21 -05:00
Peter Boyle 070b61f08f Simplifying the MultiRHS solver to make it do SRHS *and* MRHS 2024-03-06 14:04:33 -05:00
Peter Boyle 7e5bd46dd3 Booster update 2024-03-06 19:03:45 +01:00
Peter Boyle 228bbb9d81 Benchmark results 2024-03-06 19:03:35 +01:00
paboyle b812a7b4c6 Staggered launch script 2024-03-06 01:32:40 +00:00
paboyle 891a366f73 Repro CG script 2024-03-06 01:22:55 +00:00
paboyle 10116b3be8 Force device copyable and tell SYCL to shut it. 2024-03-06 01:13:27 +00:00
paboyle a46a0f0882 force device copyable and don't take crap from SYCL 2024-03-06 01:12:49 +00:00
paboyle a26a8a38f4 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2024-03-06 00:05:00 +00:00
paboyle 7435315d50 More blasted shell variables 2024-03-06 00:03:59 +00:00
paboyle 9b5f741e85 Reproducing CG can be more useful now 2024-03-06 00:03:16 +00:00
paboyle 517822fdd2 SPR HBM benchmarking right and also PVC batched GEMM 2024-03-06 00:02:27 +00:00
paboyle 1b93a9be88 Print out the hostname 2024-03-06 00:01:58 +00:00
paboyle 783a66b348 Deterministic reduction please 2024-03-06 00:01:37 +00:00
paboyle 976c3e9b59 Hack for flight logging CG inner products.
Can be made to work, but could put in some more serious infrastructure
for repro testing and blame attribution (Britney test) if necessary
2024-03-05 23:59:57 +00:00
paboyle f8ca971dae Use of a bare PRECISION macro is not namespace safe and collides with
SYCL
2024-03-05 23:59:13 +00:00
paboyle 21bc8c24df OneMKL batched blas starting 2024-03-05 23:58:20 +00:00
paboyle 30228214f7 SYCL conflict with Eigen 2024-03-05 23:56:10 +00:00
Peter Boyle 2ae980ae43 Update sourceme.sh 2024-03-05 13:39:18 -05:00
Peter Boyle 6153dec2e4 Update setup.sh 2024-03-05 13:38:32 -05:00
Peter Boyle c805f86343 USQCD benchmark 2024-03-01 00:05:04 -05:00
Peter Boyle 04ca065281 Only one rank opens 2024-02-29 20:09:11 -05:00
Peter Boyle 88d8fa43d7 Benchmark development 2024-02-29 20:01:44 -05:00
Peter Boyle 3c49762875 Propagate in the blas routine 2024-02-29 15:33:06 -05:00
Peter Boyle 436bf1d9d3 Merge pull request #455 from clarkedavida/hisq_fat_links
Hisq fat links
2024-02-29 15:29:39 -05:00
david clarke f70df6e195 changed NO_SHIFT and BACKWARD_CONST from define to enum 2024-02-29 12:29:30 -07:00
Peter Boyle fce3852dff Merge pull request #451 from paboyle/feature/eigen-3.4.0-update
updating Eigen to 3.4.0
2024-02-28 18:03:37 -05:00
Peter Boyle ee1b8bbdbd Merge pull request #454 from edbennett/adjoint-broke
fix HMC for non-fundamental representations
2024-02-28 14:05:27 -05:00
Peter Boyle 3f1636637d Merge pull request #453 from dbollweg/feature/sliceSum_gpu
Feature/slice sum gpu
2024-02-28 14:04:43 -05:00
Peter Boyle 2e570f5300 Merge pull request #457 from lehner/feature/gpt
Import GPT-related updates
2024-02-28 13:59:04 -05:00
Christoph Lehner 9f89486df5 remove unnecessary code path 2024-02-28 19:56:23 +01:00
Christoph Lehner 22b43b86cb Make GPT test suite work with SYCL 2024-02-28 12:57:17 +01:00
dbollweg 3c9012676a CUDA cub refuses to reduce vSpinColourMatrix, breaking up into smaller parts like already done for HIP case. 2024-02-27 12:41:45 -05:00
Peter Boyle ee3b3c4c56 relocate deflation support 2024-02-27 11:52:23 -05:00
Peter Boyle 462d706a63 Move to a blas directory 2024-02-27 11:51:04 -05:00
Peter Boyle ee0d460c8e Blas based block project & deflate for multiRHS 2024-02-27 11:41:44 -05:00
Peter Boyle cd15abe9d1 Mrhs prep 2024-02-27 11:41:13 -05:00
Peter Boyle 9f40467e24 Warning squash 2024-02-27 11:40:36 -05:00
Peter Boyle d0b6593823 More verbose on checksum 2024-02-27 11:40:14 -05:00
Peter Boyle 79fc821d8d reorg headers 2024-02-27 11:39:37 -05:00
Peter Boyle d7fdb9a7e6 Reorg headers 2024-02-27 11:39:06 -05:00
Peter Boyle b74de51c18 Reorder headers 2024-02-27 11:38:52 -05:00
Dennis Bollweg b507fe209c Added SpinColourMatrix case to sliceSum Test 2024-02-27 11:28:32 -05:00
Dennis Bollweg 6cd2d8fcd5 Replace cuda/hip memcpy with Grid functions 2024-02-26 09:55:07 -05:00
david clarke b02d022993 fixed race condition (thx michael) 2024-02-23 17:14:28 -07:00