1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-18 07:47:06 +01:00
Commit Graph

119 Commits

Author SHA1 Message Date
72641211cd Merge branch 'paboyle:develop' into feature/omp-offload 2023-12-14 15:31:39 -05:00
f516acda5f fixed conflicts; su3 working 2023-12-04 17:20:17 -05:00
7a7aa61d52 cleaned up 2023-12-04 16:37:28 -05:00
867abeaf8e removed print flags 2023-12-04 15:12:03 -05:00
33097681b9 FTHMC compiled and merged to develop 2023-10-14 00:42:55 +03:00
9626a2c7c0 Asynch handling 2023-10-13 18:21:56 +03:00
e5bc51779a edited readme amd stack err 2023-09-08 21:31:08 -04:00
1bda8c47fa fixed conflicts after merging pabyle develop 2023-07-03 12:35:37 -04:00
2100cc6497 fixed conflicts after merging pabyle develop 2023-07-03 11:46:37 -04:00
cb277ae516 added file line traces 2023-07-03 11:18:13 -04:00
b4f2ca81ff Copy queue and compute queue same as better concurrency 2023-04-11 12:18:21 -07:00
da503fef0e Name change on barrier routine 2023-04-11 12:14:04 -07:00
3671ace5a1 added omp allocators and dev copies 2023-04-04 12:10:03 -04:00
4a382fad3f Use distinct SYCL queue for copies 2023-04-04 07:41:41 -07:00
af64c1c6b6 Had managed to drop the accelerator_barrier() in the Wilson Compressor gather 2023-03-30 17:34:44 -04:00
866f48391a Temporary fix for develop incorrect results 2023-03-30 17:10:13 -04:00
496d04cd85 Weaken the Fence 2023-03-29 18:58:51 -04:00
b5b759df73 Merge branch 'develop' into feature/dirichlet 2023-03-21 16:05:46 -04:00
861e5d7f4c SYCL version update. Why do they keep making incompatible changes 2023-03-14 12:10:02 -07:00
f6661ce29b Merged openmp offload implementation with develop 2022-12-13 18:32:55 -05:00
c33a3b3b40 Fixed --accelerator-threads input to omp target thread_limit() 2022-12-13 15:13:11 -08:00
03508448f8 Remove verbose 2022-10-04 11:12:15 -07:00
1177b8f661 Merge branch 'develop' into feature/dirichlet 2022-08-31 19:05:57 -04:00
95b640cb6b 10TF/s on 32^3 x 64 on single node 2022-08-04 15:43:52 -04:00
2cb5bedc15 Copy stream HIP improvements 2022-08-04 15:24:03 -04:00
188d2c7a4d PVC default, ignore ATS 2022-08-02 08:38:53 -07:00
84110166e4 Fix the fence 2022-08-02 08:00:43 -07:00
d32b923b6c Fencing on a stream in SYCL is needed. Didn't know that ... gulp 2022-08-02 07:58:04 -07:00
5f8892bf03 Mistake pointed out by Camilo 2022-07-19 09:31:51 -07:00
f14e7e51e7 Grid accelerator 2022-07-12 10:56:22 -07:00
3544965f54 Stream doesn't work 2022-07-07 17:49:20 +01:00
bd99fd608c Introduce a non-default stream for compute operatoins 2022-07-01 09:42:53 -04:00
136d843ce7 Crusher updates 2022-05-25 12:36:09 -04:00
5012adfebf Merge branch 'develop' into feature/dirichlet 2022-04-05 16:26:19 -04:00
92a83a9eb3 Performance improve for Tesseract 2022-03-16 17:14:36 +00:00
5340e50427 HMC running with new formulation 2022-03-01 17:10:25 -05:00
e16fc5b2e4 Threaded intranode comms transfer - ideally between NUMA domains 2022-03-01 11:17:24 -05:00
86f4e17928 Changing thread block order and adding launch_bounds 2022-02-07 11:29:37 -06:00
7f7d06d963 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2021-12-07 09:06:42 -08:00
2bf3b4d576 Update to reduce memory footpring in benchmark test 2021-12-07 09:02:02 -08:00
6ceb556684 Intranode asynch hipMemCopy 2021-11-22 20:45:12 -05:00
76cde73705 HIP improvements on messaging and intranode hipMemCopyAsynch 2021-11-22 20:44:39 -05:00
16c2a99965 Overlap cudamemcpy - didn't set up stream right 2021-10-11 13:31:26 -07:00
ab6ea29913 Print removal 2021-10-05 20:13:25 -04:00
8ed0b57b09 Memory verbose and tracking, shrink default cache
Print PCI device IDs on node 0
2021-10-05 11:41:03 -04:00
c2f8ba194e Working simple OpenMP offloading with cudaMallocManaged; cshift not working 2021-09-29 15:23:13 -07:00
712b326e40 Added OpenMP target offloading support 2021-09-27 19:00:18 -04:00
3206f69478 SYCL happy 2021-09-21 18:01:35 -07:00
8eb1232683 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2021-09-21 09:25:07 -07:00
b3b033d343 Clean 2021-09-21 09:18:54 -07:00