|
da890dc293
|
Verbose changes
|
2024-04-01 14:18:00 -04:00 |
|
|
070b61f08f
|
Simplifying the MultiRHS solver to make it do SRHS *and* MRHS
|
2024-03-06 14:04:33 -05:00 |
|
|
3d13fd56c5
|
Precompute phases, save memory in hermitian
|
2024-01-22 17:43:35 -05:00 |
|
|
42ae36bc28
|
WOrking
|
2024-01-17 16:39:14 -05:00 |
|
|
defd814750
|
Speed up the coarsened matrix matrix evaluation.
It is block project limited.
Could be sped up with calls to Batched GEMM and a data layout change.
|
2023-12-22 18:07:03 -05:00 |
|
|
0cce97a4fe
|
verbosity only
|
2023-12-20 21:30:10 -05:00 |
|
|
639cc6f73a
|
better support for multiRHS coarse space
Still to add restriction of domain of last loop to interior of padded cell (expect about 4.5x on test volume on Crusher)
|
2023-11-23 18:16:26 -05:00 |
|
|
8bece1f861
|
Faster to transpose the matrix and apply with column major order
|
2023-11-15 17:58:38 -05:00 |
|
|
38b87de53f
|
This works around a stacksize limit on AMD GPU
|
2023-10-24 10:56:07 -04:00 |
|
|
4341d96bde
|
Massively sped up coarse grid mult, comms
Save 3ms spend (60% of time !) on cudaMalloc !!
|
2023-10-20 19:27:13 -04:00 |
|
|
e064f17346
|
Faster halo exchange
|
2023-10-20 19:27:13 -04:00 |
|
|
7cc3435ba8
|
Imporved General coarsened matrix
|
2023-10-20 19:27:13 -04:00 |
|
|
f5dcea9dbf
|
Updates for Frontier
|
2023-10-20 19:27:12 -04:00 |
|
|
6a3bc9865e
|
Verbose change
|
2023-10-06 21:02:04 -04:00 |
|
|
eacebfad74
|
Reorganise multigrid into multiple headers
|
2023-10-06 10:46:21 -04:00 |
|