9514035b87
refactor slicesum: slicesum uses GPU version by default now
2024-02-09 13:02:28 -05:00
ab2de131bd
work towards sliceSum for sycl backend
2024-02-06 13:24:45 -05:00
b8b9dc952d
Async memcpy's and cleanup
2024-02-01 17:55:35 -05:00
79a6ed32d8
Use accelerator_for2d and DeviceSegmentedRecude to avoid kernel launch latencies
2024-02-01 16:41:03 -05:00
caa5f97723
Add sliceSum gpu using cub/hipcub
2024-01-31 16:50:06 -05:00
c5f1420dea
Merge remote-tracking branch 'LupoA/develop' into LupoA-develop
2023-10-02 16:22:35 -04:00
018e6da872
Merge pull request #440 from giltirn/feature/paddedcellgauge
...
Feature/paddedcellgauge
2023-10-02 10:00:42 -04:00
b8a7004365
Partial fraction test
2023-08-14 15:17:03 -04:00
f7b79cdd45
Added test for ProjectSpn
2023-07-03 18:00:32 +01:00
b92428f05f
better test
2023-07-02 13:34:03 +01:00
34b11864b6
prettiest tests
2023-07-02 13:25:57 +01:00
f44dce390f
Implemented acclerator-optimized versions of localCopyRegion and insertSliceLocal to speed up padding
...
Fixed const correctness on PaddedCell methods
Fixed compile issues on Crusher
Added timing breakdowns for PaddedCell::Expand and the padded implementations of the staples, visible under --log Performance
Optimized kernel for StaplePadded
Test_iwasaki_action_newstaple now repeats the calculation 10 times and reports average timings
2023-06-27 14:58:10 -04:00
6f6844ccf1
Added new StapleAll and RectStapleAll functions that return the staples for all mu as an array
...
Modified plaq+rectangle gauge actions to use the above
Added a test code to confirm the above changes
2023-06-26 15:48:47 -04:00
4c6613d72c
Modified RectStapleDouble and RectStapleOptimised to use Gauge-BC respecting CshiftLink
...
Added test code tests/debug/Test_optimized_staple_gaugebc demonstrating equivalence of above to RectStapleUnoptimised for cconj gauge BCs
Removed optimized staple only being used for periodic gauge BCs; it is now always used
2023-06-26 10:20:23 -04:00
cff1f8d3b8
rm unused variables and formatting
2023-06-23 16:04:18 +01:00
f27d2083cd
adjustments in SUn and Sp2n impl
2023-06-23 15:34:08 +01:00
de30c4e22a
minor improvements
2023-06-23 10:49:41 +01:00
4241c7d4a3
Imported coalescedReadGeneralPermute GPU implementation from Christoph
...
Fixed bug in padded staple code where extract was being called on the result before the GPU view was closed
Fixed compile issue with pointer cast in padded staple code
Added timing summaries of padded staple code and timing breakdown of staple implementation to Test_padded_cell_staple
2023-06-21 16:01:01 -04:00
7b11075102
The user can now specify the implementation of Cshift used by the PaddedCell class through a virtual base class API. Implementations for default (regular Cshift) and for gauge links (which respects the gauge BCs)
...
Fixed const-correctness for PaddedCell and ConjugateGimpl::setDirections
Modified test code for padded-cell implementation of staple, rect-staple to use cconj BCs
2023-06-20 17:09:56 -04:00
abc658dca5
Added coalescedReadGeneralPermute CPU implementation based on Christoph's GPT code
...
In a test code, implemented a padded-cell version of the staple and rectangular-staple calculation
2023-06-20 16:14:25 -04:00
2372275b2c
Merge pull request #36 from LupoA/sp2n/gpu-bugfix
...
Sp2n/gpu bugfix [close #30 ]
2023-06-20 13:46:00 +01:00
5e539e2d54
Forgot some follow-ups on changed signature
2023-06-18 12:37:51 +01:00
621e612c30
Fix non-zero ret on device bug
2023-06-16 16:27:49 +01:00
8c3792721b
ClangFormat
2023-06-16 15:58:23 +01:00
c797cbe737
deal with post-merge trauma
2023-06-16 14:20:37 +01:00
e09dfbf1c2
definetely the right merge upstream/develop
2023-06-16 14:19:46 +01:00
f1c358b596
Additional tests
2023-06-15 10:43:04 -04:00
b0646ca187
Remove some unused variables
2023-06-15 15:09:09 +01:00
5465961e30
New test for FTHMC portion
2023-06-01 06:14:04 -04:00
fe88a0c12f
cleaner twoindex class, cleaner tests
2023-05-26 16:55:30 +01:00
de8daa3824
group is SUn by default
2023-05-26 13:44:41 +01:00
be1a4f5860
implement TwoIndexSymm for sp2n
2023-05-22 17:21:03 +01:00
5897b93dd4
debug tests, fix dimension
2023-05-22 13:42:21 +01:00
af091e0881
DimensionHelper for 2index irreps
2023-05-21 16:56:06 +01:00
a5125e23f4
Typo
2023-05-18 15:41:35 +01:00
932c783fbf
2AS for every Nc!
2023-05-17 20:22:05 +01:00
55f9cce577
Revert "Added automated HMC test for Nc=4"
...
This reverts commit eee27b8b30
.
2023-05-17 09:17:48 +01:00
b3533ca847
correct tests (failing)
2023-05-16 17:43:52 +01:00
fd2a637010
test 2index
2023-05-16 14:10:39 +01:00
eee27b8b30
Added automated HMC test for Nc=4
2023-05-15 18:37:33 +01:00
8522352aa3
ClangFormat
2023-05-15 18:36:05 +01:00
e8ad1fef53
Unify TwoIndex
2023-05-12 14:35:50 +01:00
9c8750f261
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2023-05-11 12:29:09 -04:00
91efd08179
Option for Qlat generator basis
2023-05-11 12:27:45 -04:00
1b8a834beb
Debug
2023-05-11 12:22:24 -04:00
aa9df63a05
rename group projections based on determinants
2023-05-10 14:50:52 +01:00
6e62f4f616
ClangFormat
2023-05-03 16:15:12 +01:00
6a7bdca53b
Take over additional algebra tests from Alessandro
2023-05-03 16:02:02 +01:00
c7fba9aace
Take over additional group tests from Alessandro
2023-05-03 16:01:48 +01:00
ac6c7cb8d6
Merge in Alessandro's changes [test fails]
2023-05-03 02:53:03 +01:00