3dbfce5223
Tests clean build on HIP
2022-11-16 20:15:51 -05:00
bb0a0da47a
inon blocking caution due to SYCL
2022-08-02 08:09:43 -07:00
a4ce6e42c7
Warning free compile on make all and make tests under nvcc
2021-10-27 00:27:03 +01:00
2bb374daea
hip-friendly
2021-03-19 11:33:23 +01:00
281ac5fc12
Red black support on coars
2021-01-14 20:48:08 -05:00
4d2dc7ba03
Enable even-odd for CoarsenedMatrix
2020-09-11 20:32:02 +02:00
cf3535d16e
Expose more functions in CMat
2020-08-27 14:06:48 +02:00
b2087f14c4
Fix CoarsenedMatrix regarding illegal memory accesses
...
Need a reference to geom since the lambda copies the this pointer which points to host memory, see
- https://docs.nvidia.com/cuda/cuda-c-programming-guide/#star-this-capture
- https://devblogs.nvidia.com/new-compiler-features-cuda-8/
2020-08-24 17:46:47 +02:00
dd1ba266b2
Fix mapping between dir + disp and point in CMat
2020-08-24 17:46:46 +02:00
1292d59563
Add a typedef + broaden interface of CMat
2020-08-24 17:46:45 +02:00
b5e87e8d97
summit compile fixes
2020-06-12 18:16:12 -04:00
cdf0a04fc5
Merge branch 'develop' into sycl
2020-06-09 04:00:12 -04:00
1a4c8c3387
Global edit with change to View usage. autoView() creates a wrapper object that closes the view when scope closes.
2020-06-05 18:52:35 -04:00
7860a50f70
Make view specify where and drive data motion - first cut.
...
This is a compile tiime option --enable-unified=yes/no
2020-05-21 16:13:16 -04:00
82f71643a4
Remove the norm in MdagM
2020-05-12 17:55:53 -04:00
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
8a5c13d5fb
Still fast moving in changes
2020-02-06 17:57:26 -05:00
bdccb0c91f
Working 2 types of decomposition
2020-02-06 17:26:55 -05:00
76c823781e
Much faster coarsening
2020-01-27 13:43:19 -05:00
e583035614
Change to interface to minise comms in evaluating coarse space operator
2020-01-06 11:43:59 -05:00
f7e4bd1f6d
Getting more optimised
2020-01-04 03:11:53 -05:00
c0d8e4dce5
Improved Multigrid for DWF
2019-12-28 10:32:15 -05:00
9cfd64c604
Coarse grid on GPU, not fast enough yet. Need a 10x
2019-12-17 05:24:45 -05:00
d18994eddc
offload more of mgrid to GPU
2019-12-13 22:08:11 -05:00
736b19485e
Faster set up and some dead code ifdef'ed out
2019-12-13 21:30:48 -05:00
710fee5d26
Subspace setup testing code
...
and timing verbose
2019-12-10 21:48:42 -05:00
f2a4f13111
Must offload the Coarsened matrix if Stencil buffers are device resident
2019-12-10 19:32:12 -05:00
26605ef387
HDCR back to working
2019-12-09 02:51:01 -05:00
8f223962ff
Thread loop changed
2019-06-15 07:43:42 +01:00
c5e081d69c
Re-Merge branch 'develop' into feature/gpu-port
...
Pull in Regensburg MultiGrid pull request
2019-01-03 01:50:16 +00:00
8a1be021d3
Merge branch 'feature/wilsonmg' of https://github.com/DanielRichtmann/Grid into DanielRichtmann-feature/wilsonmg
2019-01-02 14:39:59 +00:00
35ed1defac
Passes make check now single and double compile
2018-12-19 11:09:32 +00:00
b57a4d32aa
Merge branch 'develop' into feature/gpu-port
2018-12-13 05:11:34 +00:00
fb7d021b9d
Hadrons: moving Hadrons to root directory, build system improvements
2018-08-28 15:00:40 +01:00