1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-19 10:11:02 +01:00
Commit Graph

8232 Commits

Author SHA1 Message Date
Peter Boyle e8a0a1e75d Tracing replaces self timing hooks 2022-08-31 17:09:47 -04:00
Peter Boyle 730be89abf Remove timing hooks as tracing replaces 2022-08-31 17:08:44 -04:00
Peter Boyle f991ad7d5c Remove timing hooks as tracing replaces 2022-08-31 17:08:18 -04:00
Peter Boyle b3f33f82f7 Decrease self timing hooks, use nvtx / roctx type tracing hooks instead 2022-08-31 17:06:47 -04:00
Peter Boyle a34a6e059f Logging improvement. Sinitial will be used to improve RHMC terms 2022-08-31 17:06:08 -04:00
Peter Boyle 1333319941 Tracing 2022-08-31 17:00:25 -04:00
Peter Boyle 9295ed8d20 Print full memory range 2022-08-31 16:59:51 -04:00
Peter Boyle 19cc7653fb Tracing 2022-08-31 16:57:51 -04:00
Peter Boyle 5752538661 Tracing 2022-08-31 16:57:32 -04:00
Peter Boyle ca40a1b00b Tracing 2022-08-31 16:54:55 -04:00
Peter Boyle 659fac9dfb Tracing hook 2022-08-31 16:54:25 -04:00
Peter Boyle 4dc3d6fce0 Buy into Nvidia/Rocm etc... tracing. 2022-08-31 16:53:19 -04:00
Gurtej Kanwar 60dfb49afa Remove FP16 tests when FP16 is disabled 2022-08-21 17:29:55 +02:00
Gurtej Kanwar 554c238359 Update OpenSSL digest to use high-level methods
This avoids deprecation warnings when compiling against OpenSSL 3.0
but should still be backwards compatible. It is the recommended way
to use the digest API going forward.
2022-08-21 17:28:57 +02:00
Gurtej Kanwar f922adf05e Fix Photon ComplexField type 2022-08-21 16:16:18 +02:00
Peter Boyle 95b640cb6b 10TF/s on 32^3 x 64 on single node 2022-08-04 15:43:52 -04:00
Peter Boyle 2cb5bedc15 Copy stream HIP improvements 2022-08-04 15:24:03 -04:00
Peter Boyle 806b02bddf Simplify dead code 2022-08-04 15:23:13 -04:00
Peter Boyle de40395773 More timing. Think I should start to use nvtx and rocmtx ?? 2022-08-04 13:37:16 -04:00
Peter Boyle 7ba4788715 Fix 2022-08-04 13:36:44 -04:00
Peter Boyle 06d9ce1a02 Synch ranks on node here for GPU - GPU memcopy 2022-08-04 13:35:56 -04:00
Peter Boyle 75bb6b2b40 Move barrier into the StencilSend begin routine 2022-08-04 13:35:26 -04:00
Peter Boyle 74f10c2dc0 Move barrier into Stencil Send 2022-08-04 13:34:11 -04:00
Peter Boyle 188d2c7a4d PVC default, ignore ATS 2022-08-02 08:38:53 -07:00
Peter Boyle 17d7177105 Files for SYCL 2022-08-02 08:33:39 -07:00
Peter Boyle bb0a0da47a inon blocking caution due to SYCL 2022-08-02 08:09:43 -07:00
Peter Boyle 84110166e4 Fix the fence 2022-08-02 08:00:43 -07:00
Peter Boyle d32b923b6c Fencing on a stream in SYCL is needed. Didn't know that ... gulp 2022-08-02 07:58:04 -07:00
Peter Boyle a93d5459d4 Better mpi request completion 2022-07-28 12:18:35 -04:00
Peter Boyle 9c21add0c6 High res timer replaces getttimeofday 2022-07-28 12:14:03 -04:00
Peter Boyle 639aab6563 High res timer instead of gettimeofday 2022-07-28 12:13:35 -04:00
Peter Boyle 8137cc7049 Allways concurrent comms 2022-07-28 12:01:51 -04:00
Peter Boyle 60e63dca1d Add memory logging channel 2022-07-28 11:39:15 -04:00
Peter Boyle 486409574e Expanded cach to avoid any allocs in HMC 2022-07-28 11:38:34 -04:00
Peter Boyle a913b8be12 Dslash self timing. Might want to not have this 2022-07-28 11:37:55 -04:00
Peter Boyle 2239751850 Better logging 2022-07-28 11:37:36 -04:00
Peter Boyle 9b20f1449c Better timing 2022-07-28 11:37:12 -04:00
Peter Boyle b99453083d Updated timing 2022-07-28 11:37:02 -04:00
Peter Boyle 2ab1af5754 Ensure no synchronize and not optoin dependent 2022-07-19 09:51:06 -07:00
Peter Boyle 5f8892bf03 Mistake pointed out by Camilo 2022-07-19 09:31:51 -07:00
Peter Boyle f14e7e51e7 Grid accelerator 2022-07-12 10:56:22 -07:00
Peter Boyle 943fbb914d Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet 2022-07-11 13:48:42 -04:00
Peter Boyle ca4603580d Verbose 2022-07-11 13:48:35 -04:00
Peter Boyle f73db8f1f3 Synch clocks 2022-07-11 13:47:39 -04:00
Peter Boyle f7217d12d2 World barrier for clock synch 2022-07-11 13:45:31 -04:00
Peter Boyle fab50c57d9 More loggin 2022-07-11 18:42:27 +01:00
Peter Boyle 3440534fbf MixedPrec support 2022-07-10 21:35:18 +01:00
Peter Boyle 177b1a7ec6 Mixed prec 2022-07-10 21:34:10 +01:00
Peter Boyle 58182fe345 Different approach to default dirichlet params 2022-07-10 21:32:58 +01:00
Peter Boyle 1f907d330d Different default params for dirichlet 2022-07-10 21:31:48 +01:00