Peter Boyle
|
86e33c8ab2
|
Significant GPU perf speed up finished
|
2021-09-14 16:14:23 +01:00 |
|
Peter Boyle
|
5dae6a6dac
|
Deprecate half prec comms
|
2021-09-14 15:06:59 +01:00 |
|
Peter Boyle
|
361bb8a101
|
Remove half prec comms
|
2021-09-14 15:06:29 +01:00 |
|
Peter Boyle
|
7efdb3cd2b
|
Remove half prec comms
|
2021-09-14 15:06:06 +01:00 |
|
Peter Boyle
|
65ef4ec29f
|
Move tables to device memory
|
2021-09-14 15:05:01 +01:00 |
|
Peter Boyle
|
d5835c0222
|
Switch to coalesced stencil face gather
|
2021-09-14 15:04:14 +01:00 |
|
Peter Boyle
|
a7b943b33e
|
Remove half prec comms
|
2021-09-14 05:05:33 +01:00 |
|
Peter Boyle
|
7440cde92f
|
No half prec comms; coalesced access on GPU
|
2021-09-14 05:04:56 +01:00 |
|
Peter Boyle
|
0fc662bb24
|
Dirac cuda 11.4 happy ; force host for functions accessing mult table
ET runs these on host BEFORE lodging result in AST for kernel
|
2021-09-14 05:00:44 +01:00 |
|
Peter Boyle
|
8195890640
|
Force MPI over NVLINK
|
2021-09-14 05:00:17 +01:00 |
|
Peter Boyle
|
4c88104a73
|
Fix compile warns
|
2021-09-11 23:08:05 +01:00 |
|
Peter Boyle
|
73b944c152
|
Drop half prec comms for now.
|
2021-09-11 23:07:18 +01:00 |
|
Peter Boyle
|
d1b0b7f5c6
|
Half prec comms dropping
|
2021-09-11 23:05:40 +01:00 |
|
Peter Boyle
|
381d8797d0
|
Drop half prec comms for now
|
2021-09-11 23:05:02 +01:00 |
|
|
11ee8a1061
|
Merge remote-tracking branch 'upstream/develop' into develop
|
2021-09-02 16:57:42 +01:00 |
|
Peter Boyle
|
b06526bc1e
|
Comment update
|
2021-08-30 21:15:39 -04:00 |
|
Peter Boyle
|
3044419111
|
Some sample code
|
2021-08-30 20:32:11 -04:00 |
|
Peter Boyle
|
bcfa9cf068
|
Improvement of output
|
2021-08-28 08:08:15 -07:00 |
|
Peter Boyle
|
114920b8de
|
Some example clean up
|
2021-08-25 12:24:17 +01:00 |
|
Peter Boyle
|
0d588b95f4
|
Bug fix to Example_Laplacian test
|
2021-08-23 23:14:26 +01:00 |
|
Peter Boyle
|
5b3c530aa7
|
Return value
|
2021-08-23 15:30:45 +01:00 |
|
Peter Boyle
|
c6a5499c8b
|
Fail on non-apple
|
2021-08-22 18:40:55 +01:00 |
|
Peter Boyle
|
ec9c3fe77a
|
Remove the file
|
2021-08-22 18:28:39 +01:00 |
|
Peter Boyle
|
6135ad530e
|
Extra examples / solutions
|
2021-08-22 18:25:07 +01:00 |
|
Peter Boyle
|
40098424c7
|
Examples
|
2021-08-22 14:17:12 +01:00 |
|
Peter Boyle
|
7163b31a26
|
Examples
|
2021-08-20 01:15:23 +01:00 |
|
Peter Boyle
|
ffbdd91e0e
|
Apple happiness
|
2021-08-20 01:15:00 +01:00 |
|
Peter Boyle
|
5d29e175d8
|
Typo fix
|
2021-08-10 18:25:43 +01:00 |
|
Peter Boyle
|
417dbfa257
|
Fix
|
2021-08-10 08:55:35 -07:00 |
|
peterx.a.boyle
|
1eda4d8e0b
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2021-08-10 05:41:18 -07:00 |
|
peterx.a.boyle
|
50181f16e5
|
Level 0 IPC set up
|
2021-08-10 05:35:15 -07:00 |
|
Peter Boyle
|
75030637cc
|
Improved comms benchmark, same as benchmark_comms_host_device
|
2021-08-10 05:16:30 -07:00 |
|
Peter Boyle
|
fe5aaf7677
|
Make comms benchmark same as Benchmark_comms_host_device
|
2021-08-09 04:06:30 -07:00 |
|
Peter Boyle
|
80ac2a73ca
|
Check is wrong (HtoD / DtoH)
|
2021-08-05 18:33:20 -04:00 |
|
Andrew Yong
|
770680669d
|
Whitespace removal.
|
2021-08-04 09:21:59 +01:00 |
|
Andrew Yong
|
0cdfc5cf22
|
Merge remote-tracking branch 'upstream/develop' into develop
|
2021-07-30 14:40:55 +01:00 |
|
|
d75a66a3e6
|
test done
|
2021-07-06 11:42:36 +01:00 |
|
|
fcc4374d7b
|
i/o done
|
2021-07-05 14:52:00 +01:00 |
|
|
67c3c16fe5
|
working test
|
2021-07-05 14:41:52 +01:00 |
|
|
25e9be50b5
|
created test file
|
2021-07-02 15:51:19 +01:00 |
|
|
428b8ba907
|
Updated from upstream and added halo benchmark
|
2021-06-29 01:05:12 +01:00 |
|
Ed Bennett
|
323cf6c038
|
make message consistent with configure script
|
2021-06-23 17:00:43 +01:00 |
|
Peter Boyle
|
29a22ae603
|
Simpler SYCL setup
|
2021-06-22 17:57:20 +00:00 |
|
Peter Boyle
|
403bff1a47
|
Force reqd subgroup size fo SYCL
|
2021-06-22 17:56:10 +00:00 |
|
Christoph Lehner
|
c50f27e68b
|
Make FFT play nice with split grid
|
2021-06-20 11:34:38 +02:00 |
|
Peter Georg
|
80afacec5b
|
nvcc: Add -fopenmp to LDFLAGS
|
2021-06-17 13:05:13 +02:00 |
|
Peter Boyle
|
6cd9224dd7
|
SYCL comms buffer allocate
|
2021-06-16 17:10:55 +00:00 |
|
Peter Boyle
|
4bf8196ff1
|
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
|
2021-06-15 21:45:36 +00:00 |
|
Peter Boyle
|
4c5440fb06
|
const happy for sycl
|
2021-06-15 21:45:07 +00:00 |
|
|
a269a3d919
|
Merge pull request #358 from mmphys/feature/serialisation-test
Add a ragged std::vector to the serialisation test
|
2021-06-09 10:16:25 +01:00 |
|