Peter Boyle
|
a11c12e2e7
|
Modifications for partial dirichlet BCs
|
2022-11-15 16:20:01 -05:00 |
|
Peter Boyle
|
1177b8f661
|
Merge branch 'develop' into feature/dirichlet
|
2022-08-31 19:05:57 -04:00 |
|
Peter Boyle
|
06d9ce1a02
|
Synch ranks on node here for GPU - GPU memcopy
|
2022-08-04 13:35:56 -04:00 |
|
Peter Boyle
|
8137cc7049
|
Allways concurrent comms
|
2022-07-28 12:01:51 -04:00 |
|
Peter Boyle
|
2ab1af5754
|
Ensure no synchronize and not optoin dependent
|
2022-07-19 09:51:06 -07:00 |
|
Peter Boyle
|
f7217d12d2
|
World barrier for clock synch
|
2022-07-11 13:45:31 -04:00 |
|
Peter Boyle
|
7eb29cf529
|
MPI fix
|
2022-05-28 15:51:34 -07:00 |
|
Peter Boyle
|
3f31afa4fc
|
Clean up verbose
|
2022-05-24 18:18:51 -07:00 |
|
Peter Boyle
|
aab3bcb46f
|
Dirichlet first cut - wrong answers on dagger multiply.
Struggling to get a compute node so changing systems
|
2022-02-22 19:58:33 +00:00 |
|
Peter Boyle
|
135808dcfa
|
Less verbose
|
2021-12-07 16:24:24 -05:00 |
|
Peter Boyle
|
2bf3b4d576
|
Update to reduce memory footpring in benchmark test
|
2021-12-07 09:02:02 -08:00 |
|
Peter Boyle
|
16c2a99965
|
Overlap cudamemcpy - didn't set up stream right
|
2021-10-11 13:31:26 -07:00 |
|
Peter Boyle
|
c0d56a1c04
|
Perlmutter tune up
|
2021-09-22 06:02:34 -07:00 |
|
Peter Boyle
|
ca9816bfbb
|
Typo
|
2021-09-21 04:12:04 +02:00 |
|
Peter Boyle
|
109507888b
|
Option to force use of MPI over Nvlink
|
2021-09-21 00:53:25 +02:00 |
|
Peter Boyle
|
8195890640
|
Force MPI over NVLINK
|
2021-09-14 05:00:17 +01:00 |
|
Peter Boyle
|
cd99edcc5f
|
maxLocalNorm2()
|
2021-02-04 18:25:49 -05:00 |
|
Peter Boyle
|
d05ce01809
|
TOFU behaviour now optional THREAD_MULTIPLE or THREAD_SERIALIZED
|
2020-11-13 03:52:19 +01:00 |
|
Peter Boyle
|
a8309638d4
|
UVM check in MPI calls
|
2020-09-03 20:29:26 -04:00 |
|
Peter Boyle
|
0c3095e173
|
Comms buffers to device memory
|
2020-09-03 15:45:35 -04:00 |
|
Christoph Lehner
|
197612bc7a
|
fast cpu basisRotate and other small cleanups
|
2020-07-30 07:08:54 -04:00 |
|
nmeyer-ur
|
8726e94ea7
|
merge upstream develop
|
2020-07-07 20:26:47 +02:00 |
|
nmeyer-ur
|
1635c263ee
|
disable TOFU by default
|
2020-06-30 19:27:08 +02:00 |
|
nmeyer-ur
|
465856331a
|
switch back to serialized; wrong results on single too
|
2020-06-15 15:39:39 +02:00 |
|
nmeyer-ur
|
cc958aa9ed
|
switch back to standard MPI_init due to wrong results in Benchmark_wilson using comms-overlap
|
2020-06-15 14:21:38 +02:00 |
|
nmeyer-ur
|
4fedd8d29f
|
switch to MPI_THREAD_SERIALIZED instead of SINGLE
|
2020-05-27 14:08:34 +02:00 |
|
nmeyer-ur
|
9a86059761
|
symmetrize VLA and fixed size build messages
|
2020-05-20 20:05:42 +02:00 |
|
nmeyer-ur
|
b780b7b7a0
|
guard prevents multiple TOFU messages
|
2020-05-20 19:20:59 +02:00 |
|
nmeyer-ur
|
fc2e9850d3
|
temporarily enable TOFU by default when using A64FX or A64FXFIXEDSIZE
|
2020-05-11 13:25:02 +02:00 |
|
nmeyer-ur
|
ffaaed679e
|
MPI_THREAD_SINGLE hack for Fugaku, enabled by -DTOFU
|
2020-05-11 13:21:39 +02:00 |
|
Christoph Lehner
|
856d168e41
|
global sum over vectors of uint64_t
|
2020-03-29 07:56:05 -04:00 |
|
Peter Boyle
|
fa9cd50c5b
|
Merge branch 'develop' into feature/gpu-port
|
2019-07-16 11:55:17 +01:00 |
|
Peter Boyle
|
780a67844e
|
Simple checks
|
2019-04-17 12:07:17 +01:00 |
|
|
f80c548365
|
quieter initialisation
|
2019-02-10 20:47:35 +00:00 |
|
Peter Boyle
|
b57a4d32aa
|
Merge branch 'develop' into feature/gpu-port
|
2018-12-13 05:11:34 +00:00 |
|
Peter Boyle
|
839605c45c
|
Verbose reduce
|
2018-11-07 23:38:46 +00:00 |
|
|
fb7d021b9d
|
Hadrons: moving Hadrons to root directory, build system improvements
|
2018-08-28 15:00:40 +01:00 |
|