Alessandro Lupo
b92428f05f
better test
2023-07-02 13:34:03 +01:00
Alessandro Lupo
34b11864b6
prettiest tests
2023-07-02 13:25:57 +01:00
Christopher Kelly
1dfaa08afb
The stencils for the staple and rect-staple padded cell implementations are now created and stored by workspace classes that allow for reuse providing the grids remain consistent
...
The workspaces are now used by the plaq+rectangle gauge action resulting in a further 2x performance improvement as measured on a 16^4 local volume for 2 nodes (16 ranks) of Crusher
2023-06-28 15:11:24 -04:00
david clarke
9d263d9a7d
fix bug in HISQSmearing; move benchmark b/c i don't understand how makefiles work
2023-06-28 10:05:34 -06:00
david clarke
9015c229dc
add benchmark to see whether matrix multiplication is slower than read from object
2023-06-27 21:28:26 -06:00
Christopher Kelly
f44dce390f
Implemented acclerator-optimized versions of localCopyRegion and insertSliceLocal to speed up padding
...
Fixed const correctness on PaddedCell methods
Fixed compile issues on Crusher
Added timing breakdowns for PaddedCell::Expand and the padded implementations of the staples, visible under --log Performance
Optimized kernel for StaplePadded
Test_iwasaki_action_newstaple now repeats the calculation 10 times and reports average timings
2023-06-27 14:58:10 -04:00
Christopher Kelly
bb71e9a96a
Added PaddedCell and GeneralisedLocalStencil header includes to standard base headers
...
Moved versions of the padded-cell implementations of staple and rect-staple from test code to WilsonLoops header
Added StapleAndRectStapleAll which is now called by the plaq+rectangle action class. Under the hood it uses the padded cell implementations with maximal reuse of the padded gauge links
2023-06-27 11:23:30 -04:00
78bae9417c
returning Nstop vectors even if not all meet true convergence criterion
2023-06-27 14:38:19 +01:00
dd170ead01
whitespace
2023-06-27 11:37:01 +01:00
014704856f
do one more iteration if not all vectors converged
2023-06-27 11:33:30 +01:00
david clarke
a7eabaad56
rudimentary appendShift convenience method, which allows the user to append an arbitrary shift in one line
2023-06-26 23:59:28 -06:00
david clarke
eeb4703b84
develop wrappers to make the stencils easier to construct
2023-06-26 17:45:35 -06:00
david clarke
a07421b3d3
Merge branch 'develop' into hisq_fat_links
2023-06-26 13:51:32 -06:00
david clarke
cda53b4068
Merge remote-tracking branch 'origin/develop' into develop
2023-06-26 13:51:06 -06:00
Christopher Kelly
6f6844ccf1
Added new StapleAll and RectStapleAll functions that return the staples for all mu as an array
...
Modified plaq+rectangle gauge actions to use the above
Added a test code to confirm the above changes
2023-06-26 15:48:47 -04:00
Christopher Kelly
4c6613d72c
Modified RectStapleDouble and RectStapleOptimised to use Gauge-BC respecting CshiftLink
...
Added test code tests/debug/Test_optimized_staple_gaugebc demonstrating equivalence of above to RectStapleUnoptimised for cconj gauge BCs
Removed optimized staple only being used for periodic gauge BCs; it is now always used
2023-06-26 10:20:23 -04:00
Peter Boyle
ee92e08edb
Merge pull request #435 from fjosw/fix/warnings_in_WilsonKernelsImplementation
...
Unused variable in WilsonKernelsImplementation
2023-06-23 11:47:19 -04:00
Peter Boyle
c1dcee9328
Merge pull request #437 from fjosw/fix/stencil_debug
...
Added GridLogDebug to BuildSurfaceList debug message
2023-06-23 11:47:00 -04:00
Alessandro Lupo
559257bbe9
better documentation and filelist names
2023-06-23 16:16:48 +01:00
Peter Boyle
6b150961fe
Better script
2023-06-23 18:09:25 +03:00
Alessandro Lupo
cff1f8d3b8
rm unused variables and formatting
2023-06-23 16:04:18 +01:00
Alessandro Lupo
f27d2083cd
adjustments in SUn and Sp2n impl
2023-06-23 15:34:08 +01:00
Christopher Kelly
36cc9c524f
Threaded the constructor of GeneralLocalStencil
2023-06-23 09:57:38 -04:00
Alessandro Lupo
2822487450
rm unncessary line
2023-06-23 14:55:23 +01:00
Alessandro Lupo
e07fafe46a
minor adjustments to twoindex
2023-06-23 12:18:04 +01:00
Alessandro Lupo
063d290bd8
missing function
2023-06-23 11:11:20 +01:00
Alessandro Lupo
4e6194d92a
Avoid code duplication in ProjectSUn
2023-06-23 11:03:50 +01:00
Alessandro Lupo
de30c4e22a
minor improvements
2023-06-23 10:49:41 +01:00
david clarke
df99f227c1
include missing staple orientations; invert path direction, which was backwards
2023-06-22 14:57:10 -06:00
Peter Boyle
5bafcaedfa
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2023-06-22 19:59:45 +03:00
Peter Boyle
bfeceae708
FTHMC
2023-06-22 12:58:18 -04:00
Peter Boyle
eacb66591f
Config command
2023-06-22 19:56:40 +03:00
Peter Boyle
fadaa85626
Update
2023-06-22 19:56:27 +03:00
Peter Boyle
02a5b0d786
Updating run during testing
2023-06-22 19:52:46 +03:00
Peter Boyle
0e2141442a
Dennis says broken
2023-06-22 19:19:51 +03:00
Peter Boyle
769eb0eecb
Precision coverage
2023-06-22 19:19:20 +03:00
Christopher Kelly
4241c7d4a3
Imported coalescedReadGeneralPermute GPU implementation from Christoph
...
Fixed bug in padded staple code where extract was being called on the result before the GPU view was closed
Fixed compile issue with pointer cast in padded staple code
Added timing summaries of padded staple code and timing breakdown of staple implementation to Test_padded_cell_staple
2023-06-21 16:01:01 -04:00
david clarke
d536c67b9d
add HISQSmearing to Smearing.h
2023-06-20 16:04:48 -06:00
david clarke
f44f005dad
rename _lvl1 --> _linkTreatment
2023-06-20 15:48:27 -06:00
david clarke
26b2caf570
add template parameter to Smear_HISQ_fat for MILC interfacing
2023-06-20 15:37:54 -06:00
Christopher Kelly
7b11075102
The user can now specify the implementation of Cshift used by the PaddedCell class through a virtual base class API. Implementations for default (regular Cshift) and for gauge links (which respects the gauge BCs)
...
Fixed const-correctness for PaddedCell and ConjugateGimpl::setDirections
Modified test code for padded-cell implementation of staple, rect-staple to use cconj BCs
2023-06-20 17:09:56 -04:00
Christopher Kelly
abc658dca5
Added coalescedReadGeneralPermute CPU implementation based on Christoph's GPT code
...
In a test code, implemented a padded-cell version of the staple and rectangular-staple calculation
2023-06-20 16:14:25 -04:00
david clarke
8bb078db25
Merge branch 'develop' into hisq_fat_links
2023-06-20 13:05:00 -06:00
david clarke
b61ba40023
Merge remote-tracking branch 'origin/develop' into develop
2023-06-20 13:04:53 -06:00
Christoph Lehner
452bf2e907
Accelerator basisRotate also on HIP
2023-06-20 20:36:24 +03:00
Alessandro Lupo
2372275b2c
Merge pull request #36 from LupoA/sp2n/gpu-bugfix
...
Sp2n/gpu bugfix [close #30 ]
2023-06-20 13:46:00 +01:00
chillenzer
ef736e8aa4
Merge pull request #35 from LupoA/sp2n/enableSp
...
consistent enable sp config flag
2023-06-20 10:41:09 +00:00
Julian Lenz
5e539e2d54
Forgot some follow-ups on changed signature
2023-06-18 12:37:51 +01:00
Julian Lenz
96773f5254
Apparently forgot to remove one Lattice version
2023-06-18 12:21:39 +01:00
Alessandro Lupo
d80df09f3b
consistent enable sp config flag
2023-06-16 19:16:46 +01:00