1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-09-20 09:15:38 +01:00
Commit Graph

6719 Commits

Author SHA1 Message Date
Peter Boyle
c7baeb5bae Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2021-09-14 08:31:11 -07:00
Peter Boyle
402d80e197 Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop 2021-09-14 16:16:06 +01:00
Peter Boyle
86e33c8ab2 Significant GPU perf speed up finished 2021-09-14 16:14:23 +01:00
Peter Boyle
5dae6a6dac Deprecate half prec comms 2021-09-14 15:06:59 +01:00
Peter Boyle
361bb8a101 Remove half prec comms 2021-09-14 15:06:29 +01:00
Peter Boyle
7efdb3cd2b Remove half prec comms 2021-09-14 15:06:06 +01:00
Peter Boyle
65ef4ec29f Move tables to device memory 2021-09-14 15:05:01 +01:00
Peter Boyle
d5835c0222 Switch to coalesced stencil face gather 2021-09-14 15:04:14 +01:00
Peter Boyle
a7b943b33e Remove half prec comms 2021-09-14 05:05:33 +01:00
Peter Boyle
7440cde92f No half prec comms; coalesced access on GPU 2021-09-14 05:04:56 +01:00
Peter Boyle
0fc662bb24 Dirac cuda 11.4 happy ; force host for functions accessing mult table
ET runs these on host BEFORE lodging result in AST for kernel
2021-09-14 05:00:44 +01:00
Peter Boyle
8195890640 Force MPI over NVLINK 2021-09-14 05:00:17 +01:00
Peter Boyle
4c88104a73 Fix compile warns 2021-09-11 23:08:05 +01:00
Peter Boyle
73b944c152 Drop half prec comms for now. 2021-09-11 23:07:18 +01:00
Peter Boyle
d1b0b7f5c6 Half prec comms dropping 2021-09-11 23:05:40 +01:00
Peter Boyle
381d8797d0 Drop half prec comms for now 2021-09-11 23:05:02 +01:00
11ee8a1061 Merge remote-tracking branch 'upstream/develop' into develop 2021-09-02 16:57:42 +01:00
Peter Boyle
b06526bc1e Comment update 2021-08-30 21:15:39 -04:00
Peter Boyle
3044419111 Some sample code 2021-08-30 20:32:11 -04:00
Peter Boyle
bcfa9cf068 Improvement of output 2021-08-28 08:08:15 -07:00
Peter Boyle
114920b8de Some example clean up 2021-08-25 12:24:17 +01:00
Peter Boyle
0d588b95f4 Bug fix to Example_Laplacian test 2021-08-23 23:14:26 +01:00
Peter Boyle
5b3c530aa7 Return value 2021-08-23 15:30:45 +01:00
Peter Boyle
c6a5499c8b Fail on non-apple 2021-08-22 18:40:55 +01:00
Peter Boyle
ec9c3fe77a Remove the file 2021-08-22 18:28:39 +01:00
Peter Boyle
6135ad530e Extra examples / solutions 2021-08-22 18:25:07 +01:00
Peter Boyle
40098424c7 Examples 2021-08-22 14:17:12 +01:00
Peter Boyle
7163b31a26 Examples 2021-08-20 01:15:23 +01:00
Peter Boyle
ffbdd91e0e Apple happiness 2021-08-20 01:15:00 +01:00
Peter Boyle
5d29e175d8 Typo fix 2021-08-10 18:25:43 +01:00
Peter Boyle
417dbfa257 Fix 2021-08-10 08:55:35 -07:00
peterx.a.boyle
1eda4d8e0b Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2021-08-10 05:41:18 -07:00
peterx.a.boyle
50181f16e5 Level 0 IPC set up 2021-08-10 05:35:15 -07:00
Peter Boyle
75030637cc Improved comms benchmark, same as benchmark_comms_host_device 2021-08-10 05:16:30 -07:00
Peter Boyle
fe5aaf7677 Make comms benchmark same as Benchmark_comms_host_device 2021-08-09 04:06:30 -07:00
Peter Boyle
80ac2a73ca Check is wrong (HtoD / DtoH) 2021-08-05 18:33:20 -04:00
d75a66a3e6 test done 2021-07-06 11:42:36 +01:00
fcc4374d7b i/o done 2021-07-05 14:52:00 +01:00
67c3c16fe5 working test 2021-07-05 14:41:52 +01:00
25e9be50b5 created test file 2021-07-02 15:51:19 +01:00
428b8ba907 Updated from upstream and added halo benchmark 2021-06-29 01:05:12 +01:00
Ed Bennett
323cf6c038 make message consistent with configure script 2021-06-23 17:00:43 +01:00
Peter Boyle
29a22ae603 Simpler SYCL setup 2021-06-22 17:57:20 +00:00
Peter Boyle
403bff1a47 Force reqd subgroup size fo SYCL 2021-06-22 17:56:10 +00:00
Christoph Lehner
c50f27e68b Make FFT play nice with split grid 2021-06-20 11:34:38 +02:00
Peter Georg
80afacec5b nvcc: Add -fopenmp to LDFLAGS 2021-06-17 13:05:13 +02:00
Peter Boyle
6cd9224dd7 SYCL comms buffer allocate 2021-06-16 17:10:55 +00:00
Peter Boyle
4bf8196ff1 Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop 2021-06-15 21:45:36 +00:00
Peter Boyle
4c5440fb06 const happy for sycl 2021-06-15 21:45:07 +00:00
a269a3d919
Merge pull request #358 from mmphys/feature/serialisation-test
Add a ragged std::vector to the serialisation test
2021-06-09 10:16:25 +01:00