Peter Boyle
|
38b87de53f
|
This works around a stacksize limit on AMD GPU
|
2023-10-24 10:56:07 -04:00 |
|
Peter Boyle
|
aa5047a9e4
|
Faster blockProject blockPromote
|
2023-10-24 10:49:55 -04:00 |
|
Peter Boyle
|
24b6ee0df9
|
M4 file
|
2023-10-24 10:36:48 -04:00 |
|
Peter Boyle
|
1e79cc9cbe
|
Avoid compiler error
|
2023-10-24 10:36:09 -04:00 |
|
Peter Boyle
|
b3925df9c3
|
Verbose on CPU-GPU xfer, remove performance by default
|
2023-10-24 10:25:01 -04:00 |
|
Peter Boyle
|
351795ac3a
|
Better messaging
|
2023-10-20 19:33:04 -04:00 |
|
Peter Boyle
|
9c9c42d0df
|
Tests on frontier with real speed up . 3.5x on 16^3 at mq=0.01
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
b6ad1bafc7
|
Normal memory SendToRecvFrom asynchronous for use in general stencil
code
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
a5ca40f446
|
Better verbose -- track CPU GPU motion under --log Memory, others go to
debug output stream
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
9ab54c5565
|
Overlap comms & data copy/buffer assembly in Ghost zone exchange
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
4341d96bde
|
Massively sped up coarse grid mult, comms
Save 3ms spend (60% of time !) on cudaMalloc !!
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
5fac47a26d
|
Faster halo exchange
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
e064f17346
|
Faster halo exchange
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
afe10ba2a2
|
More digits
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
7cc3435ba8
|
Imporved General coarsened matrix
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
541772313c
|
Verbosity
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
3747494a09
|
Notify delet public
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
f2b98d0dcc
|
Const safety
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
80471bf762
|
Alternate implementation involving face operations
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
a06f63c110
|
Improved I/O and non-lexico option exposed to SciDAC format
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
0ae4478cd9
|
Checkpoint the subspace and ldop
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
ae4e705e09
|
Use random vec as easier for debug
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
f5dcea9dbf
|
Updates for Frontier
|
2023-10-20 19:27:12 -04:00 |
|
Peter Boyle
|
2207309f8a
|
Spack rules
|
2023-10-16 18:38:24 -04:00 |
|
Peter Boyle
|
2111e7ab5f
|
Run at physical mass
|
2023-10-06 21:20:21 -04:00 |
|
Peter Boyle
|
d29abfdcaf
|
Transfer code to Frontier now
|
2023-10-06 21:03:34 -04:00 |
|
Peter Boyle
|
a751c42cc5
|
Checkpoint restore the setup
|
2023-10-06 21:03:08 -04:00 |
|
Peter Boyle
|
6a3bc9865e
|
Verbose change
|
2023-10-06 21:02:04 -04:00 |
|
Peter Boyle
|
4d5f7e4377
|
Verbose change
|
2023-10-06 21:01:37 -04:00 |
|
Peter Boyle
|
78b117fb78
|
Comment fix
|
2023-10-06 21:01:15 -04:00 |
|
Peter Boyle
|
ded63a1319
|
Verbose change/pretty print
|
2023-10-06 21:00:53 -04:00 |
|
Peter Boyle
|
df3e4d1e9c
|
Return fix
|
2023-10-06 21:00:21 -04:00 |
|
Peter Boyle
|
b58fd80379
|
I/O for coarse op and reorganise multigrid headers
|
2023-10-06 13:43:46 -04:00 |
|
Peter Boyle
|
7f6e0f57d0
|
No IO in file
|
2023-10-06 13:39:53 -04:00 |
|
Peter Boyle
|
cae27678d8
|
gpermute
|
2023-10-06 13:39:19 -04:00 |
|
Peter Boyle
|
48ff655bad
|
Slightly less verbose
|
2023-10-06 10:47:52 -04:00 |
|
Peter Boyle
|
2525ad4623
|
Slight clean up
|
2023-10-06 10:47:32 -04:00 |
|
Peter Boyle
|
e7020017c5
|
Reorganise multigrid
|
2023-10-06 10:47:12 -04:00 |
|
Peter Boyle
|
eacebfad74
|
Reorganise multigrid into multiple headers
|
2023-10-06 10:46:21 -04:00 |
|
Peter Boyle
|
3bc2da5321
|
Merge branch 'feature/scidac-wp1' of https://github.com/paboyle/Grid into feature/scidac-wp1
|
2023-10-05 16:57:59 -04:00 |
|
Peter Boyle
|
2d710d6bfd
|
Optimised parameters for 16^3
|
2023-10-05 16:56:55 -04:00 |
|
Peter Boyle
|
6532b7f32b
|
Eliminate older inefficient coarsening implementation
|
2023-10-05 16:56:15 -04:00 |
|
Peter Boyle
|
7b41b92d99
|
Only need to bad non-local dimensions
|
2023-10-05 16:55:48 -04:00 |
|
Peter Boyle
|
dd557af84b
|
ADEF1 and ADEF2 2 level CG
|
2023-10-05 16:55:19 -04:00 |
|
Peter Boyle
|
59b9d0e030
|
coalesceRead the blockSum
|
2023-10-05 16:54:48 -04:00 |
|
Peter Boyle
|
b82eee4733
|
Hermitian dealing with
|
2023-10-05 16:54:14 -04:00 |
|
Peter Boyle
|
6a87487544
|
Running on Frontier, fix RNG big volume y2k, affecting 5D RNG
|
2023-10-05 16:50:59 -04:00 |
|
Peter Boyle
|
fcf5023845
|
Running on Frontier
|
2023-10-05 16:50:59 -04:00 |
|
Peter Boyle
|
c8adad6d8b
|
First runs on Summit. PopulateAdag needs work
|
2023-10-05 16:50:54 -04:00 |
|
Peter Boyle
|
737d3ffb98
|
ADEF1 and 1 hop projection
|
2023-10-03 14:22:18 -04:00 |
|