Peter Boyle
|
5a4f9bf2e3
|
Force the ROCM version
|
2024-10-29 18:12:31 -04:00 |
|
Peter Boyle
|
f617468e04
|
Update Lattice_base.h
|
2024-10-11 10:39:16 -04:00 |
|
Peter Boyle
|
ee4046fe92
|
Added a dimension ordered column sum based reduction for scalar.
Removes dependence on MPI_Allreduce and allows for work around on
systems where this is bollox.
|
2024-09-27 09:26:03 -04:00 |
|
Peter Boyle
|
2a9cfeb9ea
|
New files
|
2024-09-26 14:23:29 -04:00 |
|
Peter Boyle
|
1147b8ea40
|
Cheby poly setup
|
2024-09-26 14:20:32 -04:00 |
|
Peter Boyle
|
3f9119b39d
|
Remove vectors used for the power spectrum table in paper
|
2024-09-26 14:19:41 -04:00 |
|
Peter Boyle
|
35e8225abd
|
Verbose control
|
2024-09-26 14:18:35 -04:00 |
|
Peter Boyle
|
bdbfbb7a14
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2024-09-26 14:05:45 -04:00 |
|
Peter Boyle
|
f7d4be8d96
|
Calculate bytes correctly
|
2024-09-26 14:04:44 -04:00 |
|
Peter Boyle
|
aa67a5b095
|
Rename
|
2024-08-27 19:54:01 +00:00 |
|
Peter Boyle
|
af9ea0864c
|
Blas fix
|
2024-08-27 19:53:09 +00:00 |
|
Peter Boyle
|
4e2a6d87c4
|
Gemm batched fix
|
2024-08-27 19:24:05 +00:00 |
|
Peter Boyle
|
a465ecece9
|
Aurora
|
2024-08-27 19:20:43 +00:00 |
|
Peter Boyle
|
575eb72182
|
Converges on 16^3
|
2024-08-27 19:20:38 +00:00 |
|
Peter Boyle
|
3a973914d6
|
Compile on frontier
|
2024-08-27 14:55:42 -04:00 |
|
Peter Boyle
|
f568c07bbd
|
Improved the BLAS benchmark
|
2024-08-27 14:53:54 -04:00 |
|
Peter Boyle
|
2c9878fc3a
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2024-08-27 12:05:46 -04:00 |
|
Peter Boyle
|
27b1b1b005
|
Checkerboard available for offloading pickCheckerboard
|
2024-08-27 12:04:09 -04:00 |
|
Peter Boyle
|
130d7ab077
|
Verbose changes
|
2024-08-27 12:03:28 -04:00 |
|
Peter Boyle
|
29f6b8a74a
|
Setup
|
2024-08-27 12:02:49 -04:00 |
|
Peter Boyle
|
9779aaea33
|
16^3 optimise
|
2024-08-27 11:38:35 -04:00 |
|
Peter Boyle
|
ec25604a67
|
Fastest solver for mrhs multigrid
|
2024-08-27 11:32:34 -04:00 |
|
Peter Boyle
|
3668e81c5e
|
Extract slice working on checkerboard field for Block Lanczos
|
2024-08-27 11:31:30 -04:00 |
|
Peter Boyle
|
d66b2423cb
|
Move slice operations to GPU for BlockCG
|
2024-08-27 11:28:47 -04:00 |
|
Peter Boyle
|
15cc78f0b6
|
peek/poke local site on checkerboard arrays
|
2024-08-27 11:23:42 -04:00 |
|
Peter Boyle
|
06db4ddea2
|
Fast init on GPU
|
2024-08-27 11:22:33 -04:00 |
|
Peter Boyle
|
6cfb90e99f
|
Support needed for accelerator resident set/pick Checkerboard
|
2024-08-27 11:19:00 -04:00 |
|
Peter Boyle
|
d8be95a2a3
|
Don't early terminate power method to get more accurate top EV
|
2024-08-27 11:17:37 -04:00 |
|
Peter Boyle
|
f82702872d
|
Normal residual
|
2024-08-27 11:16:44 -04:00 |
|
Peter Boyle
|
3752c49ef0
|
Add option to record the CG polynomial
|
2024-08-27 11:14:35 -04:00 |
|
Peter Boyle
|
fe65fa4988
|
MulMatrix
|
2024-08-27 11:13:18 -04:00 |
|
Peter Boyle
|
1fe4c205a3
|
Adef
|
2024-08-27 11:11:47 -04:00 |
|
Peter Boyle
|
d4dc5e0f43
|
BlockCG linalg acceleratoin with BLAS
|
2024-08-27 11:08:33 -04:00 |
|
Peter Boyle
|
77944437ce
|
Functor initialisation
|
2024-08-27 11:01:02 -04:00 |
|
Peter Boyle
|
c164bff758
|
MMdag
|
2024-08-27 11:00:36 -04:00 |
|
Peter Boyle
|
aa2e3d954a
|
MMdag operator
|
2024-08-27 10:59:29 -04:00 |
|
Peter Boyle
|
de62b04728
|
Block CG linalg acceleration
|
2024-08-27 10:58:54 -04:00 |
|
Peter Boyle
|
d0bdb50f24
|
Analyse power spectrum
|
2024-08-27 10:58:19 -04:00 |
|
Peter Boyle
|
a8fecbc609
|
BlockCG linalg via BLAS
|
2024-08-21 16:08:16 -04:00 |
|
Peter Boyle
|
557fa483ff
|
Blas benchmark committed stand alone
|
2024-08-20 16:18:43 +00:00 |
|
Peter Boyle
|
fc15d55df6
|
Mallinfo
|
2024-08-20 14:33:09 +00:00 |
|
Peter Boyle
|
53573d7d94
|
Better benchmark
|
2024-08-20 14:31:57 +00:00 |
|
Peter Boyle
|
bb3c177000
|
Better benchmarking
|
2024-08-20 14:31:41 +00:00 |
|
Peter Boyle
|
a3322b470f
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2024-08-20 14:30:52 +00:00 |
|
Peter Boyle
|
f8f408e7a9
|
BLAS everywhere
|
2024-07-25 18:09:02 +00:00 |
|
Peter Boyle
|
baac1127d0
|
Later intel compiler happiness
|
2024-07-25 18:06:05 +00:00 |
|
Peter Boyle
|
6f1328160c
|
Remove SVM use
|
2024-07-25 18:05:40 +00:00 |
|
Peter Boyle
|
04cf902791
|
Mallinfo and ASAN hooks
|
2024-07-25 18:04:56 +00:00 |
|
Peter Boyle
|
7a5b1c1a19
|
Try Catch convenience macro
|
2024-07-25 18:03:41 +00:00 |
|
Peter Boyle
|
18d2d7da4a
|
Eigen implementation and SYCL implementation
|
2024-07-25 18:02:56 +00:00 |
|