Peter Boyle
|
bb8b6d9d73
|
Fix
|
2025-04-29 18:04:04 -04:00 |
|
Peter Boyle
|
677b4cc5b0
|
Make all tests compile
|
2025-04-24 20:33:26 -04:00 |
|
Peter Boyle
|
df6120e5f6
|
CPU compile oops fix
|
2025-04-24 14:50:06 -04:00 |
|
Peter Boyle
|
21de6f7da8
|
Merge pull request #477 from lehner/feature/wilson-clover-5d
Feature/wilson clover 5d
|
2025-04-24 14:44:48 -04:00 |
|
Peter Boyle
|
ab3de50d5e
|
Merge pull request #473 from UCL-ARC/gauge_action_deriv
WilsonGagueAction deriv
|
2025-04-24 14:39:10 -04:00 |
|
Peter Boyle
|
6a1c64fbdd
|
Merge pull request #470 from paboyle/specflow
Spectral flow, DWF/Mobius kernel measurement
|
2025-04-24 14:34:33 -04:00 |
|
Peter Boyle
|
233150d93f
|
Bug fix for no accelerator aware MPI, thanks Shuhei for finding it.
|
2025-04-24 11:40:46 -04:00 |
|
Chulwoo Jung
|
cee4c8ce8c
|
Merge branch 'develop' of https://github.com/paboyle/Grid into specflow
|
2025-04-18 19:55:36 +00:00 |
|
Christoph Lehner
|
96bf814d8c
|
Add checkerboarding to 5D compact clover
|
2025-04-10 23:05:39 +02:00 |
|
Christoph Lehner
|
7ddc422788
|
CompactWilsonClover5D
|
2025-04-10 23:05:29 +02:00 |
|
Peter Boyle
|
e652fc2825
|
Shared Memory test reenabled on every Grid object creation.
Const improvements in Accelerator.h
|
2025-04-07 11:51:40 -04:00 |
|
Peter Boyle
|
4f89f603ae
|
Changes to add back shared memory test on GPU
|
2025-04-04 18:40:15 -04:00 |
|
Peter Boyle
|
11dc2c5e1d
|
PVdagM initialise
|
2025-04-04 18:35:06 -04:00 |
|
Peter Boyle
|
3811d19298
|
Fence
|
2025-04-04 18:35:06 -04:00 |
|
Peter Boyle
|
83a3ab6b6f
|
Barrier -- not sure 100% this was needed
|
2025-04-04 18:35:05 -04:00 |
|
Peter Boyle
|
d66a9af6a3
|
No compile fix
|
2025-04-04 18:35:05 -04:00 |
|
Peter Boyle
|
adc90d3a86
|
NVLINK GET/PUT on cuda aware mpi
|
2025-04-04 18:35:05 -04:00 |
|
Peter Boyle
|
ebbd015c5c
|
Deprecate shared memory copy as direction matters on nvidia GPU
|
2025-04-04 18:35:05 -04:00 |
|
Peter Boyle
|
4ab73b36b2
|
Deprecate shared memory copy as direction matters on GPU
|
2025-04-04 18:35:05 -04:00 |
|
Peter Boyle
|
130e07a422
|
Non hermitian support
|
2025-04-04 18:35:05 -04:00 |
|
Peter Boyle
|
8f47bb367e
|
Shifted non herm
|
2025-04-04 18:35:05 -04:00 |
|
Peter Boyle
|
9eae8fca5d
|
Size outut
|
2025-04-04 18:35:05 -04:00 |
|
Mashy Green
|
e465fce201
|
Merge remote-tracking branch 'upstream/develop' into gauge_action_deriv
|
2025-03-24 10:12:42 +00:00 |
|
Christoph Lehner
|
fe66c7ca30
|
verbosity
|
2025-03-13 12:49:36 +00:00 |
|
Christoph Lehner
|
e9177e4af3
|
Blas compatibility
|
2025-03-13 08:48:23 +00:00 |
|
Christoph Lehner
|
d15a6c5933
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature-aurora
|
2025-03-13 07:29:55 +00:00 |
|
paboyle
|
25ab9325e7
|
Use hostVector but remove construct resize
|
2025-03-11 15:02:32 +00:00 |
|
paboyle
|
19f9378b98
|
Should work on Aurora nowb
|
2025-03-11 13:50:43 +00:00 |
|
Mashy Green
|
785bc7a14f
|
Adding staple zeroing fix
|
2025-03-10 12:29:04 +00:00 |
|
Mashy Green
|
1a1fe85428
|
Merge remote-tracking branch 'upstream' into gauge_action_deriv
|
2025-03-10 08:37:36 +00:00 |
|
Christoph Lehner
|
9ffd1ed4ce
|
Merged
|
2025-03-08 15:30:08 +00:00 |
|
Peter Boyle
|
3d014864e2
|
Makinig LLVM happy
|
2025-03-06 14:19:25 -05:00 |
|
paboyle
|
1d22841811
|
Working on aurora, GPT issue turned up is fixed
|
2025-03-06 03:20:18 +00:00 |
|
paboyle
|
6ae809ed40
|
Print not liked on GPT compile
|
2025-02-27 20:12:49 +00:00 |
|
Peter Boyle
|
311e2aab3f
|
Update Accelerator.h
|
2025-02-26 11:42:52 -05:00 |
|
paboyle
|
438dfbdb83
|
Only throw if there is a pending list entry in CommsComplete
|
2025-02-25 16:57:27 +00:00 |
|
paboyle
|
b2ce760cf4
|
Verbose issue with GPT
|
2025-02-25 16:55:23 +00:00 |
|
Mashy Green
|
717f647418
|
added the WilsonFlow patch from upstream PR #471
|
2025-02-24 08:41:31 +00:00 |
|
Mashy Green
|
98e7418187
|
Merge remote-tracking branch 'upstream/develop' into gauge_action_deriv
|
2025-02-24 08:33:05 +00:00 |
|
Mashy Green
|
d2dd8f54e2
|
Fixing after revering too much!
|
2025-02-17 17:32:27 +00:00 |
|
Mashy Green
|
7726ee4b16
|
Reverting whitespace changes
|
2025-02-17 17:16:28 +00:00 |
|
paboyle
|
ba9bbe0221
|
Bounce MPI through host
|
2025-02-12 19:34:59 +00:00 |
|
paboyle
|
4c3dd82d84
|
CSHIFT with bounce throuhgh Host memory on MPI packets
|
2025-02-12 19:09:53 +00:00 |
|
paboyle
|
44e911b5b7
|
Comment change
|
2025-02-12 17:37:55 +00:00 |
|
paboyle
|
a7a16df9d0
|
GET not put has kinder barrier sequence for NVLINK type access as when
GET is done, I can use it without barrier. Moves a barrier to a nicer
place, overlapped with DtoH DMA
|
2025-02-12 14:59:28 +00:00 |
|
paboyle
|
382e0abefd
|
Was issueing a double fence -- the gather also fences
|
2025-02-12 14:57:28 +00:00 |
|
paboyle
|
6fdefe5b90
|
Barrier sequencing if doing "GET" not "PUT" is different.
This is somewhat better timing for Barriers
|
2025-02-12 14:55:20 +00:00 |
|
paboyle
|
4788dd8e2e
|
More states in packet progression for GPU non aware MPI
|
2025-02-12 14:53:57 +00:00 |
|
paboyle
|
1cc5f221f3
|
GET not put ordering is better as I know when I've got all MY data
|
2025-02-12 14:53:05 +00:00 |
|
paboyle
|
93251bfba0
|
GET not put for better ordering in the downstream dependent kernels -- I
know when I'm done, so we can move a barrier / handshake between ranks
intranode to a point off critical path
|
2025-02-12 14:50:21 +00:00 |
|