Peter Boyle
|
2c8c3be9ee
|
Adef2Mrhs
|
2024-04-05 00:57:13 -04:00 |
|
Peter Boyle
|
5b79d51c22
|
Improvements
|
2024-04-01 14:18:40 -04:00 |
|
Peter Boyle
|
da890dc293
|
Verbose changes
|
2024-04-01 14:18:00 -04:00 |
|
Peter Boyle
|
93d0a1e73a
|
HISQ view call
|
2024-04-01 14:16:47 -04:00 |
|
Peter Boyle
|
f0a8c7d045
|
Playing with chebyshevs
|
2024-04-01 14:16:11 -04:00 |
|
Peter Boyle
|
db8793777c
|
Logging/verbose
|
2024-04-01 14:15:41 -04:00 |
|
Peter Boyle
|
c745484e65
|
9.5x speed up version
|
2024-04-01 14:14:30 -04:00 |
|
Peter Boyle
|
cc04dc42dc
|
Merge branch 'develop' into feature/scidac-wp1
|
2024-03-06 14:55:21 -05:00 |
|
Peter Boyle
|
070b61f08f
|
Simplifying the MultiRHS solver to make it do SRHS *and* MRHS
|
2024-03-06 14:04:33 -05:00 |
|
Peter Boyle
|
7e5bd46dd3
|
Booster update
|
2024-03-06 19:03:45 +01:00 |
|
Peter Boyle
|
228bbb9d81
|
Benchmark results
|
2024-03-06 19:03:35 +01:00 |
|
|
b812a7b4c6
|
Staggered launch script
|
2024-03-06 01:32:40 +00:00 |
|
|
891a366f73
|
Repro CG script
|
2024-03-06 01:22:55 +00:00 |
|
|
10116b3be8
|
Force device copyable and tell SYCL to shut it.
|
2024-03-06 01:13:27 +00:00 |
|
|
a46a0f0882
|
force device copyable and don't take crap from SYCL
|
2024-03-06 01:12:49 +00:00 |
|
|
a26a8a38f4
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2024-03-06 00:05:00 +00:00 |
|
|
7435315d50
|
More blasted shell variables
|
2024-03-06 00:03:59 +00:00 |
|
|
9b5f741e85
|
Reproducing CG can be more useful now
|
2024-03-06 00:03:16 +00:00 |
|
|
517822fdd2
|
SPR HBM benchmarking right and also PVC batched GEMM
|
2024-03-06 00:02:27 +00:00 |
|
|
1b93a9be88
|
Print out the hostname
|
2024-03-06 00:01:58 +00:00 |
|
|
783a66b348
|
Deterministic reduction please
|
2024-03-06 00:01:37 +00:00 |
|
|
976c3e9b59
|
Hack for flight logging CG inner products.
Can be made to work, but could put in some more serious infrastructure
for repro testing and blame attribution (Britney test) if necessary
|
2024-03-05 23:59:57 +00:00 |
|
|
f8ca971dae
|
Use of a bare PRECISION macro is not namespace safe and collides with
SYCL
|
2024-03-05 23:59:13 +00:00 |
|
|
21bc8c24df
|
OneMKL batched blas starting
|
2024-03-05 23:58:20 +00:00 |
|
|
30228214f7
|
SYCL conflict with Eigen
|
2024-03-05 23:56:10 +00:00 |
|
Peter Boyle
|
2ae980ae43
|
Update sourceme.sh
|
2024-03-05 13:39:18 -05:00 |
|
Peter Boyle
|
6153dec2e4
|
Update setup.sh
|
2024-03-05 13:38:32 -05:00 |
|
Peter Boyle
|
c805f86343
|
USQCD benchmark
|
2024-03-01 00:05:04 -05:00 |
|
Peter Boyle
|
04ca065281
|
Only one rank opens
|
2024-02-29 20:09:11 -05:00 |
|
Peter Boyle
|
88d8fa43d7
|
Benchmark development
|
2024-02-29 20:01:44 -05:00 |
|
Peter Boyle
|
3c49762875
|
Propagate in the blas routine
|
2024-02-29 15:33:06 -05:00 |
|
Peter Boyle
|
436bf1d9d3
|
Merge pull request #455 from clarkedavida/hisq_fat_links
Hisq fat links
|
2024-02-29 15:29:39 -05:00 |
|
david clarke
|
f70df6e195
|
changed NO_SHIFT and BACKWARD_CONST from define to enum
|
2024-02-29 12:29:30 -07:00 |
|
Peter Boyle
|
fce3852dff
|
Merge pull request #451 from paboyle/feature/eigen-3.4.0-update
updating Eigen to 3.4.0
|
2024-02-28 18:03:37 -05:00 |
|
Peter Boyle
|
ee1b8bbdbd
|
Merge pull request #454 from edbennett/adjoint-broke
fix HMC for non-fundamental representations
|
2024-02-28 14:05:27 -05:00 |
|
Peter Boyle
|
3f1636637d
|
Merge pull request #453 from dbollweg/feature/sliceSum_gpu
Feature/slice sum gpu
|
2024-02-28 14:04:43 -05:00 |
|
Peter Boyle
|
2e570f5300
|
Merge pull request #457 from lehner/feature/gpt
Import GPT-related updates
|
2024-02-28 13:59:04 -05:00 |
|
Christoph Lehner
|
9f89486df5
|
remove unnecessary code path
|
2024-02-28 19:56:23 +01:00 |
|
Christoph Lehner
|
22b43b86cb
|
Make GPT test suite work with SYCL
|
2024-02-28 12:57:17 +01:00 |
|
dbollweg
|
3c9012676a
|
CUDA cub refuses to reduce vSpinColourMatrix, breaking up into smaller parts like already done for HIP case.
|
2024-02-27 12:41:45 -05:00 |
|
Peter Boyle
|
ee3b3c4c56
|
relocate deflation support
|
2024-02-27 11:52:23 -05:00 |
|
Peter Boyle
|
462d706a63
|
Move to a blas directory
|
2024-02-27 11:51:04 -05:00 |
|
Peter Boyle
|
ee0d460c8e
|
Blas based block project & deflate for multiRHS
|
2024-02-27 11:41:44 -05:00 |
|
Peter Boyle
|
cd15abe9d1
|
Mrhs prep
|
2024-02-27 11:41:13 -05:00 |
|
Peter Boyle
|
9f40467e24
|
Warning squash
|
2024-02-27 11:40:36 -05:00 |
|
Peter Boyle
|
d0b6593823
|
More verbose on checksum
|
2024-02-27 11:40:14 -05:00 |
|
Peter Boyle
|
79fc821d8d
|
reorg headers
|
2024-02-27 11:39:37 -05:00 |
|
Peter Boyle
|
d7fdb9a7e6
|
Reorg headers
|
2024-02-27 11:39:06 -05:00 |
|
Peter Boyle
|
b74de51c18
|
Reorder headers
|
2024-02-27 11:38:52 -05:00 |
|
Dennis Bollweg
|
b507fe209c
|
Added SpinColourMatrix case to sliceSum Test
|
2024-02-27 11:28:32 -05:00 |
|