Peter Boyle
436bf1d9d3
Merge pull request #455 from clarkedavida/hisq_fat_links
...
Hisq fat links
2024-02-29 15:29:39 -05:00
david clarke
f70df6e195
changed NO_SHIFT and BACKWARD_CONST from define to enum
2024-02-29 12:29:30 -07:00
Peter Boyle
ee1b8bbdbd
Merge pull request #454 from edbennett/adjoint-broke
...
fix HMC for non-fundamental representations
2024-02-28 14:05:27 -05:00
david clarke
b02d022993
fixed race condition (thx michael)
2024-02-23 17:14:28 -07:00
david clarke
94581e3c7a
accelerator_for is broken
2024-02-23 15:58:33 -07:00
david clarke
88b52cc045
Merge branch 'develop' into hisq_fat_links
2024-02-23 14:47:15 -07:00
Christoph Lehner
66391f84f2
Merge branch 'feature/gpt' of ../Grid into develop
2024-02-21 19:05:00 +01:00
Ed Bennett
97f7a9ecb3
fix HMC for non-fundamental representations
2024-02-21 08:27:55 +00:00
david clarke
56827d6ad6
accelerator_inline bug
2024-02-14 13:56:57 -07:00
62055e04dd
missing semicolon generates error with some compilers
2024-02-13 18:18:27 +01:00
david clarke
db420525b3
fix Simd::Nsimd typo
2024-02-12 15:03:53 -07:00
david clarke
2da09ae99b
acceleration compiles and doesn't break scalar mode
2024-02-06 18:40:13 -07:00
david clarke
a38fb0e04a
first effort toward accelerators
2024-02-06 18:24:55 -07:00
david clarke
0a6e2f42c5
small amount of cleanup
2024-02-06 16:32:07 -07:00
david clarke
4924b3209e
projectU3 yields a unitary matrix
2024-01-23 14:43:58 -07:00
david clarke
00f24f8765
already found some bugs in projection, still needs testing
2024-01-22 05:50:16 -07:00
david clarke
f5b3d582b0
first attempt at U3 projection
2024-01-22 02:49:40 -07:00
david clarke
c020b78e02
Merge branch 'develop' into hisq_fat_links
2024-01-21 20:21:08 -07:00
Peter Boyle
f48298ad4e
Bug fix
2023-12-11 20:57:02 -05:00
Peter Boyle
d1d9827263
Integrator logging update
2023-12-08 12:14:00 -05:00
david clarke
9cd4128833
fix naik bug
2023-11-03 14:11:38 -06:00
david clarke
c8b17c9526
Naik to CShift
2023-11-02 12:43:22 -06:00
david clarke
2ae2a81e85
attempt to fix Naik
2023-10-31 13:54:55 -06:00
david clarke
69c869d345
fixed stupid typo
2023-10-30 17:41:52 -06:00
david clarke
df9b958c40
naik now returns separately
2023-10-30 17:40:53 -06:00
david clarke
3d3376d1a3
LePage works, trying Naik
2023-10-27 16:26:31 -06:00
david clarke
21ed6ac0f4
added floating-point support
2023-10-20 13:54:26 -06:00
david clarke
7bb8ab7000
improve smearing templating
2023-10-20 08:41:02 -06:00
david clarke
2c824c2641
Merge branch 'develop' into hisq_fat_links
2023-10-17 16:03:59 -06:00
david clarke
391fd9cc6a
try lepage term
2023-10-17 14:57:15 -06:00
Peter Boyle
33097681b9
FTHMC compiled and merged to develop
2023-10-14 00:42:55 +03:00
Peter Boyle
ffc0639cb9
Running in HMC tests
2023-10-13 18:21:56 +03:00
david clarke
bf4369f72d
clean up HISQSmear with decltypes
2023-10-12 12:41:06 -06:00
david clarke
36600899e2
working 7-link; Grid_log; generalShift
2023-10-12 11:11:39 -06:00
david clarke
b9c70d156b
Merge branch 'develop' into hisq_fat_links
2023-10-10 22:44:17 -06:00
david clarke
eb89579fe7
Merge remote-tracking branch 'origin/develop' into develop
2023-10-10 22:43:51 -06:00
david clarke
0cfd13d18b
7-link working
2023-10-10 22:41:52 -06:00
Peter Boyle
d93eac7b1c
Performance regressed and is OK in icpx 2023.2
2023-10-03 15:53:14 +00:00
Peter Boyle
afc316f501
Rename headers
2023-10-02 16:25:11 -04:00
Peter Boyle
f14bfd5c1b
Relocate sub includes
2023-10-02 16:23:38 -04:00
Peter Boyle
c5f1420dea
Merge remote-tracking branch 'LupoA/develop' into LupoA-develop
2023-10-02 16:22:35 -04:00
Peter Boyle
018e6da872
Merge pull request #440 from giltirn/feature/paddedcellgauge
...
Feature/paddedcellgauge
2023-10-02 10:00:42 -04:00
david clarke
d247031c98
try 7-link
2023-09-16 23:18:16 -06:00
david clarke
99d879ea7f
5-link first attempt
2023-08-11 22:56:30 -06:00
Alessandro Lupo
075b9d22d0
adjoint rep implemented as 2indx symmetric
2023-07-02 13:58:31 +01:00
Alessandro Lupo
34b11864b6
prettiest tests
2023-07-02 13:25:57 +01:00
Christopher Kelly
1dfaa08afb
The stencils for the staple and rect-staple padded cell implementations are now created and stored by workspace classes that allow for reuse providing the grids remain consistent
...
The workspaces are now used by the plaq+rectangle gauge action resulting in a further 2x performance improvement as measured on a 16^4 local volume for 2 nodes (16 ranks) of Crusher
2023-06-28 15:11:24 -04:00
david clarke
9d263d9a7d
fix bug in HISQSmearing; move benchmark b/c i don't understand how makefiles work
2023-06-28 10:05:34 -06:00
david clarke
9015c229dc
add benchmark to see whether matrix multiplication is slower than read from object
2023-06-27 21:28:26 -06:00
Christopher Kelly
f44dce390f
Implemented acclerator-optimized versions of localCopyRegion and insertSliceLocal to speed up padding
...
Fixed const correctness on PaddedCell methods
Fixed compile issues on Crusher
Added timing breakdowns for PaddedCell::Expand and the padded implementations of the staples, visible under --log Performance
Optimized kernel for StaplePadded
Test_iwasaki_action_newstaple now repeats the calculation 10 times and reports average timings
2023-06-27 14:58:10 -04:00