066544281f
Deprecate UVM
2024-09-17 13:34:27 +00:00
c9c4576237
Improved frontier cshift
2023-10-13 18:21:56 +03:00
7d62f1d6d2
Populate the Cshift_table in the GPU
...
Cshift is allocated in Unified memory and used
in the LambdaApply kernels but also populated
from the host. This creates a lot of Unified HtoD
and DtoH mem operations and has a negative effect
in performance. With this commit we populate the
Cshift table in the device with the
populate_Cshift_table() kernel.
2023-01-11 21:26:25 +00:00
6a429ee6d3
2d loop hits Nvidia 16bit limit on large local vols
2021-03-09 04:31:10 +01:00
50b808ab33
Configure option between host and device
2020-11-12 22:28:12 +01:00
6dbd117aa5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-09-03 20:30:49 -04:00
85b1c5df39
A never hit case that is not 100% confident is asserted for safety
2020-09-03 15:48:16 -04:00
5791021dcd
Speed up Cshift more with coalesced
2020-09-01 15:28:15 -04:00
69db4816f7
fix variable capture in Scatter_plane_merge on accelerators
2020-08-06 16:57:16 +02:00
1a4c8c3387
Global edit with change to View usage. autoView() creates a wrapper object that closes the view when scope closes.
2020-06-05 18:52:35 -04:00
7860a50f70
Make view specify where and drive data motion - first cut.
...
This is a compile tiime option --enable-unified=yes/no
2020-05-21 16:13:16 -04:00
07c0c02f8c
Speed up Cshift
2020-05-11 17:02:01 -04:00
68541606ab
Thread loop changes. Soon try these with accelerator loops and benchmark
2019-06-15 07:46:42 +01:00
4a96c067ae
Remove warnings from NVCC
2019-01-01 13:43:09 +00:00
b57a4d32aa
Merge branch 'develop' into feature/gpu-port
2018-12-13 05:11:34 +00:00
fb7d021b9d
Hadrons: moving Hadrons to root directory, build system improvements
2018-08-28 15:00:40 +01:00