33097681b9
FTHMC compiled and merged to develop
2023-10-14 00:42:55 +03:00
1177b8f661
Merge branch 'develop' into feature/dirichlet
2022-08-31 19:05:57 -04:00
2cb5bedc15
Copy stream HIP improvements
2022-08-04 15:24:03 -04:00
d32b923b6c
Fencing on a stream in SYCL is needed. Didn't know that ... gulp
2022-08-02 07:58:04 -07:00
f14e7e51e7
Grid accelerator
2022-07-12 10:56:22 -07:00
bd99fd608c
Introduce a non-default stream for compute operatoins
2022-07-01 09:42:53 -04:00
136d843ce7
Crusher updates
2022-05-25 12:36:09 -04:00
76cde73705
HIP improvements on messaging and intranode hipMemCopyAsynch
2021-11-22 20:44:39 -05:00
cfe9e870d3
Stream
2021-10-15 20:46:44 +01:00
8ed0b57b09
Memory verbose and tracking, shrink default cache
...
Print PCI device IDs on node 0
2021-10-05 11:41:03 -04:00
814d5abc7e
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-09-21 04:05:51 +02:00
1fb6aaf150
Device 2 Device with cudaMemcpy
2021-09-21 01:03:07 +02:00
ea7126496d
Merge pull request #361 from edbennett/fix-setdevice-message
...
make message about setdevice consistent with configure script
2021-09-16 10:23:37 -04:00
50181f16e5
Level 0 IPC set up
2021-08-10 05:35:15 -07:00
323cf6c038
make message consistent with configure script
2021-06-23 17:00:43 +01:00
4d1ea15c79
More verbosity. The 16bit limit on Grid.y, Grid.z is annoying
2021-03-09 04:29:37 +01:00
eda9ab487b
MADWF 5d source option for hadrons - look at Grid of source
...
Abort on GPU error
2021-02-08 10:47:22 -05:00
4ea8d128c2
Merge pull request #18 from paboyle/develop
...
Sync
2020-11-20 15:36:50 +01:00
6e313575be
Use of default GPU is behaviour, not a system property. Move Summit specific to configure.ac
2020-11-13 03:50:16 +01:00
80fd6ab407
Merge pull request #17 from paboyle/develop
...
sync upstream
2020-10-06 09:01:39 +02:00
81441e98f4
HIP runs sensible
2020-09-16 03:35:03 +01:00
4677c40195
HIP improvements
2020-09-16 00:32:27 +01:00
2a75516330
state MPI/SLURM message only on world_rank zero
2020-08-26 12:34:17 -04:00
1efe30d6cc
SLurm stop nodes using same GPU
2020-08-21 02:02:53 +02:00
11bc1aeadc
TThread count defaultt to fastest
2020-06-19 14:30:35 -04:00
66005929af
Set up the cache size on all ranks
2020-06-19 12:50:54 -04:00
2b1e259441
Decode of SYCL devices fix
2020-06-04 17:16:55 -07:00
f39c2a240b
Priintinig and device memory size detection
2020-06-04 14:58:03 -04:00
e93e12b6a4
More verbose SYCL setup
2020-06-03 09:12:11 -04:00
22c5168d70
Sycl happier
2020-05-25 08:35:56 -07:00
32be2b13d3
Updates for HiP
2020-05-24 14:00:55 -04:00
07c0c02f8c
Speed up Cshift
2020-05-11 17:02:01 -04:00
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00