Peter Boyle
7c686d29c9
Tracing replaces self timing
2022-08-31 17:10:17 -04:00
Peter Boyle
e8a0a1e75d
Tracing replaces self timing hooks
2022-08-31 17:09:47 -04:00
Peter Boyle
730be89abf
Remove timing hooks as tracing replaces
2022-08-31 17:08:44 -04:00
Peter Boyle
f991ad7d5c
Remove timing hooks as tracing replaces
2022-08-31 17:08:18 -04:00
Peter Boyle
b3f33f82f7
Decrease self timing hooks, use nvtx / roctx type tracing hooks instead
2022-08-31 17:06:47 -04:00
Peter Boyle
a34a6e059f
Logging improvement. Sinitial will be used to improve RHMC terms
2022-08-31 17:06:08 -04:00
Peter Boyle
1333319941
Tracing
2022-08-31 17:00:25 -04:00
Peter Boyle
9295ed8d20
Print full memory range
2022-08-31 16:59:51 -04:00
Peter Boyle
19cc7653fb
Tracing
2022-08-31 16:57:51 -04:00
Peter Boyle
5752538661
Tracing
2022-08-31 16:57:32 -04:00
Peter Boyle
ca40a1b00b
Tracing
2022-08-31 16:54:55 -04:00
Peter Boyle
659fac9dfb
Tracing hook
2022-08-31 16:54:25 -04:00
Peter Boyle
4dc3d6fce0
Buy into Nvidia/Rocm etc... tracing.
2022-08-31 16:53:19 -04:00
Gurtej Kanwar
60dfb49afa
Remove FP16 tests when FP16 is disabled
2022-08-21 17:29:55 +02:00
Gurtej Kanwar
554c238359
Update OpenSSL digest to use high-level methods
...
This avoids deprecation warnings when compiling against OpenSSL 3.0
but should still be backwards compatible. It is the recommended way
to use the digest API going forward.
2022-08-21 17:28:57 +02:00
Gurtej Kanwar
f922adf05e
Fix Photon ComplexField type
2022-08-21 16:16:18 +02:00
Peter Boyle
95b640cb6b
10TF/s on 32^3 x 64 on single node
2022-08-04 15:43:52 -04:00
Peter Boyle
2cb5bedc15
Copy stream HIP improvements
2022-08-04 15:24:03 -04:00
Peter Boyle
806b02bddf
Simplify dead code
2022-08-04 15:23:13 -04:00
Peter Boyle
de40395773
More timing. Think I should start to use nvtx and rocmtx ??
2022-08-04 13:37:16 -04:00
Peter Boyle
7ba4788715
Fix
2022-08-04 13:36:44 -04:00
Peter Boyle
06d9ce1a02
Synch ranks on node here for GPU - GPU memcopy
2022-08-04 13:35:56 -04:00
Peter Boyle
75bb6b2b40
Move barrier into the StencilSend begin routine
2022-08-04 13:35:26 -04:00
Peter Boyle
74f10c2dc0
Move barrier into Stencil Send
2022-08-04 13:34:11 -04:00
Peter Boyle
188d2c7a4d
PVC default, ignore ATS
2022-08-02 08:38:53 -07:00
Peter Boyle
17d7177105
Files for SYCL
2022-08-02 08:33:39 -07:00
Peter Boyle
bb0a0da47a
inon blocking caution due to SYCL
2022-08-02 08:09:43 -07:00
Peter Boyle
84110166e4
Fix the fence
2022-08-02 08:00:43 -07:00
Peter Boyle
d32b923b6c
Fencing on a stream in SYCL is needed. Didn't know that ... gulp
2022-08-02 07:58:04 -07:00
Peter Boyle
a93d5459d4
Better mpi request completion
2022-07-28 12:18:35 -04:00
Peter Boyle
9c21add0c6
High res timer replaces getttimeofday
2022-07-28 12:14:03 -04:00
Peter Boyle
639aab6563
High res timer instead of gettimeofday
2022-07-28 12:13:35 -04:00
Peter Boyle
8137cc7049
Allways concurrent comms
2022-07-28 12:01:51 -04:00
Peter Boyle
60e63dca1d
Add memory logging channel
2022-07-28 11:39:15 -04:00
Peter Boyle
486409574e
Expanded cach to avoid any allocs in HMC
2022-07-28 11:38:34 -04:00
Peter Boyle
a913b8be12
Dslash self timing. Might want to not have this
2022-07-28 11:37:55 -04:00
Peter Boyle
2239751850
Better logging
2022-07-28 11:37:36 -04:00
Peter Boyle
9b20f1449c
Better timing
2022-07-28 11:37:12 -04:00
Peter Boyle
b99453083d
Updated timing
2022-07-28 11:37:02 -04:00
Peter Boyle
2ab1af5754
Ensure no synchronize and not optoin dependent
2022-07-19 09:51:06 -07:00
Peter Boyle
5f8892bf03
Mistake pointed out by Camilo
2022-07-19 09:31:51 -07:00
Peter Boyle
f14e7e51e7
Grid accelerator
2022-07-12 10:56:22 -07:00
Peter Boyle
943fbb914d
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-07-11 13:48:42 -04:00
Peter Boyle
ca4603580d
Verbose
2022-07-11 13:48:35 -04:00
Peter Boyle
f73db8f1f3
Synch clocks
2022-07-11 13:47:39 -04:00
Peter Boyle
f7217d12d2
World barrier for clock synch
2022-07-11 13:45:31 -04:00
Peter Boyle
fab50c57d9
More loggin
2022-07-11 18:42:27 +01:00
Peter Boyle
3440534fbf
MixedPrec support
2022-07-10 21:35:18 +01:00
Peter Boyle
177b1a7ec6
Mixed prec
2022-07-10 21:34:10 +01:00
Peter Boyle
58182fe345
Different approach to default dirichlet params
2022-07-10 21:32:58 +01:00