1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-19 10:11:02 +01:00
Commit Graph

7576 Commits

Author SHA1 Message Date
paboyle b812a7b4c6 Staggered launch script 2024-03-06 01:32:40 +00:00
paboyle 891a366f73 Repro CG script 2024-03-06 01:22:55 +00:00
paboyle 10116b3be8 Force device copyable and tell SYCL to shut it. 2024-03-06 01:13:27 +00:00
paboyle a46a0f0882 force device copyable and don't take crap from SYCL 2024-03-06 01:12:49 +00:00
paboyle a26a8a38f4 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2024-03-06 00:05:00 +00:00
paboyle 7435315d50 More blasted shell variables 2024-03-06 00:03:59 +00:00
paboyle 9b5f741e85 Reproducing CG can be more useful now 2024-03-06 00:03:16 +00:00
paboyle 517822fdd2 SPR HBM benchmarking right and also PVC batched GEMM 2024-03-06 00:02:27 +00:00
paboyle 1b93a9be88 Print out the hostname 2024-03-06 00:01:58 +00:00
paboyle 783a66b348 Deterministic reduction please 2024-03-06 00:01:37 +00:00
paboyle 976c3e9b59 Hack for flight logging CG inner products.
Can be made to work, but could put in some more serious infrastructure
for repro testing and blame attribution (Britney test) if necessary
2024-03-05 23:59:57 +00:00
paboyle f8ca971dae Use of a bare PRECISION macro is not namespace safe and collides with
SYCL
2024-03-05 23:59:13 +00:00
paboyle 21bc8c24df OneMKL batched blas starting 2024-03-05 23:58:20 +00:00
paboyle 30228214f7 SYCL conflict with Eigen 2024-03-05 23:56:10 +00:00
Peter Boyle 2ae980ae43 Update sourceme.sh 2024-03-05 13:39:18 -05:00
Peter Boyle 6153dec2e4 Update setup.sh 2024-03-05 13:38:32 -05:00
Peter Boyle c805f86343 USQCD benchmark 2024-03-01 00:05:04 -05:00
Peter Boyle 04ca065281 Only one rank opens 2024-02-29 20:09:11 -05:00
Peter Boyle 88d8fa43d7 Benchmark development 2024-02-29 20:01:44 -05:00
Peter Boyle 3c49762875 Propagate in the blas routine 2024-02-29 15:33:06 -05:00
Peter Boyle 436bf1d9d3 Merge pull request #455 from clarkedavida/hisq_fat_links
Hisq fat links
2024-02-29 15:29:39 -05:00
david clarke f70df6e195 changed NO_SHIFT and BACKWARD_CONST from define to enum 2024-02-29 12:29:30 -07:00
Peter Boyle fce3852dff Merge pull request #451 from paboyle/feature/eigen-3.4.0-update
updating Eigen to 3.4.0
2024-02-28 18:03:37 -05:00
Peter Boyle ee1b8bbdbd Merge pull request #454 from edbennett/adjoint-broke
fix HMC for non-fundamental representations
2024-02-28 14:05:27 -05:00
Peter Boyle 3f1636637d Merge pull request #453 from dbollweg/feature/sliceSum_gpu
Feature/slice sum gpu
2024-02-28 14:04:43 -05:00
Peter Boyle 2e570f5300 Merge pull request #457 from lehner/feature/gpt
Import GPT-related updates
2024-02-28 13:59:04 -05:00
Christoph Lehner 9f89486df5 remove unnecessary code path 2024-02-28 19:56:23 +01:00
Christoph Lehner 22b43b86cb Make GPT test suite work with SYCL 2024-02-28 12:57:17 +01:00
dbollweg 3c9012676a CUDA cub refuses to reduce vSpinColourMatrix, breaking up into smaller parts like already done for HIP case. 2024-02-27 12:41:45 -05:00
Dennis Bollweg b507fe209c Added SpinColourMatrix case to sliceSum Test 2024-02-27 11:28:32 -05:00
Dennis Bollweg 6cd2d8fcd5 Replace cuda/hip memcpy with Grid functions 2024-02-26 09:55:07 -05:00
david clarke b02d022993 fixed race condition (thx michael) 2024-02-23 17:14:28 -07:00
david clarke 94581e3c7a accelerator_for is broken 2024-02-23 15:58:33 -07:00
david clarke 88b52cc045 Merge branch 'develop' into hisq_fat_links 2024-02-23 14:47:15 -07:00
dbollweg 0a816b5509 Merge branch 'feature/sliceSum_gpu' of https://github.com/dbollweg/Grid into feature/sliceSum_gpu 2024-02-22 21:43:06 -05:00
dbollweg 1c8b807c2e free malloc'd memory 2024-02-22 21:42:44 -05:00
Christoph Lehner 66391f84f2 Merge branch 'feature/gpt' of ../Grid into develop 2024-02-21 19:05:00 +01:00
edbennett 97f7a9ecb3 fix HMC for non-fundamental representations 2024-02-21 08:27:55 +00:00
Dennis Bollweg 15878f7613 sliceSumReduction_cub_large now also faster than CPU on Frontier 2024-02-16 13:55:21 -05:00
dbollweg e0d5e3c6c7 Merge branch 'paboyle:develop' into feature/sliceSum_gpu 2024-02-16 13:16:37 -05:00
dbollweg 6f3455900e Adding sliceSumReduction_cub_small/large since hipcub cannot deal with arb. large vobjs 2024-02-16 13:15:02 -05:00
david clarke 56827d6ad6 accelerator_inline bug 2024-02-14 13:56:57 -07:00
paboyle 73c0b29535 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2024-02-13 20:19:32 +00:00
paboyle 303b83cdb8 Scaling benchmarks, verbosity and MPICH aware in acceleratorInit()
For some reason Dirichlet benchmark fails on several nodes; need to
debug this.
2024-02-13 19:48:03 +00:00
paboyle 5ef4da3f29 Silence verbose 2024-02-13 19:47:36 +00:00
paboyle 1502860004 Benchmark scripts 2024-02-13 19:47:02 +00:00
paboyle 585efc6f3f More benchmark scripts 2024-02-13 19:40:49 +00:00
portelli 62055e04dd missing semicolon generates error with some compilers 2024-02-13 18:18:27 +01:00
portelli e4a641b64e removing old Eigen tensor patch 2024-02-13 10:37:14 +01:00
portelli 8849f187f1 updating Eigen to 3.4.0 2024-02-13 10:30:22 +01:00