Peter Boyle
|
3aff64dddb
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2023-04-11 12:19:15 -07:00 |
|
Peter Boyle
|
b4f2ca81ff
|
Copy queue and compute queue same as better concurrency
|
2023-04-11 12:18:21 -07:00 |
|
Peter Boyle
|
d1dea5f840
|
New driver
|
2023-04-11 12:16:52 -07:00 |
|
Peter Boyle
|
54f8b84d16
|
Fence
|
2023-04-11 12:16:08 -07:00 |
|
Peter Boyle
|
da503fef0e
|
Name change on barrier routine
|
2023-04-11 12:14:04 -07:00 |
|
Peter Boyle
|
4a6802098a
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2023-04-07 15:43:28 -04:00 |
|
Peter Boyle
|
f9b41a84d2
|
Trajectory runs to completion on Crusher within wall clock time
|
2023-04-07 15:42:45 -04:00 |
|
|
4072408b6f
|
Update README.md
|
2023-04-07 11:45:28 +01:00 |
|
|
bd76b47fbf
|
Update CI badge in README
|
2023-04-07 11:44:48 +01:00 |
|
|
18ce23aa75
|
Fix NEON SIMD
|
2023-04-06 11:30:48 +01:00 |
|
Peter Boyle
|
ffa7fe0cc2
|
Merge branch 'feature/dirichlet' into develop
|
2023-04-04 23:13:52 -04:00 |
|
Peter Boyle
|
6b979f0a69
|
Dirichlet improvements that I failed to commit
|
2023-04-04 23:13:17 -04:00 |
|
Peter Boyle
|
86dac5ff4f
|
Better printing
|
2023-04-04 07:42:19 -07:00 |
|
Peter Boyle
|
4a382fad3f
|
Use distinct SYCL queue for copies
|
2023-04-04 07:41:41 -07:00 |
|
Peter Boyle
|
cc753670d9
|
Barrier elimination, surface list build
|
2023-04-04 07:39:14 -07:00 |
|
Peter Boyle
|
cc9d88ea1c
|
Fence changes and EXT kernel loop cout reduction
|
2023-04-04 07:37:23 -07:00 |
|
Peter Boyle
|
b281b0166e
|
Put the barrier in the subroutine
|
2023-04-04 07:36:03 -07:00 |
|
Peter Boyle
|
6a21f694ff
|
Apply barrier in Gather kernel sequence.
Could place before comms, or in Gather, but decided to insist Gather means Gather is done
|
2023-04-04 07:33:24 -07:00 |
|
Peter Boyle
|
fc4db5e963
|
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
|
2023-04-03 18:26:11 -04:00 |
|
Peter Boyle
|
6252ffaf76
|
No unified
|
2023-04-03 18:25:22 -04:00 |
|
Peter Boyle
|
af64c1c6b6
|
Had managed to drop the accelerator_barrier() in the Wilson Compressor gather
|
2023-03-30 17:34:44 -04:00 |
|
Peter Boyle
|
866f48391a
|
Temporary fix for develop incorrect results
|
2023-03-30 17:10:13 -04:00 |
|
Peter Boyle
|
a4df527d74
|
Merge pull request #428 from mmphys/bugfix/comm_none
Fixes for --enable-comms=none
|
2023-03-30 08:38:14 -04:00 |
|
Michael Marshall
|
5764d21161
|
Fixes for --enable-comms=none
|
2023-03-30 10:15:28 +01:00 |
|
Peter Boyle
|
496d04cd85
|
Weaken the Fence
|
2023-03-29 18:58:51 -04:00 |
|
Peter Boyle
|
10e6d7c6ce
|
Merge branch 'feature/dirichlet' into develop
|
2023-03-29 16:26:47 -04:00 |
|
Peter Boyle
|
c42e25e5b8
|
Dirichlet remove
|
2023-03-29 16:25:52 -04:00 |
|
Peter Boyle
|
a00ae981e0
|
Fence propagation from SYCL
|
2023-03-29 15:00:40 -04:00 |
|
Peter Boyle
|
58e020b62a
|
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
|
2023-03-29 14:37:40 -04:00 |
|
Peter Boyle
|
a7e1aceeca
|
Compile fix on Nvidia
|
2023-03-29 14:36:50 -04:00 |
|
Peter Boyle
|
7212432f43
|
More careful fencing
|
2023-03-28 20:10:22 -07:00 |
|
Peter Boyle
|
4a261fab30
|
Changes premerge to develop
|
2023-03-28 20:04:21 -07:00 |
|
Peter Boyle
|
6af97069b9
|
Preparing for close of feature/dirichlet
Initial code change review complete
|
2023-03-28 13:39:44 -07:00 |
|
Peter Boyle
|
5068413cdb
|
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
|
2023-03-28 08:35:38 -07:00 |
|
Peter Boyle
|
71c6960eea
|
Commet
|
2023-03-28 08:34:24 -07:00 |
|
Peter Boyle
|
ddf6d5c9e3
|
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
|
2023-03-28 11:33:05 -04:00 |
|
Peter Boyle
|
900e01f49b
|
Temporary
|
2023-03-27 21:35:06 -07:00 |
|
Peter Boyle
|
2376156fbc
|
Merge branch 'develop' into feature/dirichlet
|
2023-03-27 21:33:50 -07:00 |
|
Peter Boyle
|
3f2fd49db4
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2023-03-27 17:29:54 -07:00 |
|
Peter Boyle
|
0efa107cb6
|
Script update
|
2023-03-27 17:29:43 -07:00 |
|
Peter Boyle
|
8feedb4f6f
|
Include files moved
|
2023-03-27 17:29:21 -07:00 |
|
Peter Boyle
|
05e562e3d7
|
Move the copy synch out to stencil and do one per call instead of one per packet
|
2023-03-27 17:28:38 -07:00 |
|
Peter Boyle
|
dd3bbb8fa2
|
MOve the synchronise out to the stencil so one call instead of one call per packet
|
2023-03-27 17:27:45 -07:00 |
|
Peter Boyle
|
2fbcf13c46
|
SYCL fix
|
2023-03-27 14:25:14 -07:00 |
|
Peter Boyle
|
4ea48ef0c4
|
Merge pull request #419 from lehner/feature/gpt
Separate rankSum from sum
|
2023-03-24 15:42:16 -04:00 |
|
Peter Boyle
|
5c85774ee3
|
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
|
2023-03-24 15:40:57 -04:00 |
|
Peter Boyle
|
d8a9a745d8
|
stream synchronise
|
2023-03-24 15:40:30 -04:00 |
|
Peter Boyle
|
dcf172da3b
|
Merge pull request #415 from paboyle/feature/block_lanczos22
Feature/block lanczos22
|
2023-03-24 12:08:16 -04:00 |
|
Peter Boyle
|
d57ed25071
|
Merge branch 'feature/dirichlet' into feature/block_lanczos22
|
2023-03-24 12:08:09 -04:00 |
|
Peter Boyle
|
546be724e7
|
Merge pull request #421 from UniOfLeicester/feature/accel_Copy_plane
Populate the Cshift_table in the GPU
|
2023-03-24 12:04:06 -04:00 |
|