|
93251bfba0
|
GET not put for better ordering in the downstream dependent kernels -- I
know when I'm done, so we can move a barrier / handshake between ranks
intranode to a point off critical path
|
2025-02-12 14:50:21 +00:00 |
|
|
0baaddbe98
|
Pipeline mode commit on Aurora. 5+ TF/s on 16^3x32 per tile at 384
nodes.
More concurrency/fine grained scheduling is possible.
|
2025-02-04 19:27:26 +00:00 |
|
|
c4fc972fec
|
Merge branch 'feature/deprecate-uvm' into develop
|
2025-01-31 16:32:36 +00:00 |
|
|
94019a922e
|
Significantly better performance on Aurora without using pipeline mode
|
2025-01-30 16:36:46 +00:00 |
|
|
d6b2727f86
|
Pipeline mode getting better -- 2 nodes @ 10TF/s per node on Aurora
|
2025-01-29 09:22:21 +00:00 |
|
|
295127d456
|
Deterministic homebrew reduction
|
2024-10-10 21:58:26 +00:00 |
|
|
ee4046fe92
|
Added a dimension ordered column sum based reduction for scalar.
Removes dependence on MPI_Allreduce and allows for work around on
systems where this is bollox.
|
2024-09-27 09:26:03 -04:00 |
|
|
b6ad1bafc7
|
Normal memory SendToRecvFrom asynchronous for use in general stencil
code
|
2023-10-20 19:27:13 -04:00 |
|
|
a11c12e2e7
|
Modifications for partial dirichlet BCs
|
2022-11-15 16:20:01 -05:00 |
|
|
551a5f8dc8
|
RRII gpu option
|
2022-10-11 14:44:55 -04:00 |
|
|
f7217d12d2
|
World barrier for clock synch
|
2022-07-11 13:45:31 -04:00 |
|
|
aab3bcb46f
|
Dirichlet first cut - wrong answers on dagger multiply.
Struggling to get a compute node so changing systems
|
2022-02-22 19:58:33 +00:00 |
|
|
109507888b
|
Option to force use of MPI over Nvlink
|
2021-09-21 00:53:25 +02:00 |
|
|
8195890640
|
Force MPI over NVLINK
|
2021-09-14 05:00:17 +01:00 |
|
|
cd99edcc5f
|
maxLocalNorm2()
|
2021-02-04 18:25:49 -05:00 |
|
|
c48909590b
|
MPI asynch call removal
|
2020-09-17 20:47:32 +01:00 |
|
|
856d168e41
|
global sum over vectors of uint64_t
|
2020-03-29 07:56:05 -04:00 |
|
|
b57a4d32aa
|
Merge branch 'develop' into feature/gpu-port
|
2018-12-13 05:11:34 +00:00 |
|
|
fb7d021b9d
|
Hadrons: moving Hadrons to root directory, build system improvements
|
2018-08-28 15:00:40 +01:00 |
|