Peter Boyle
d3496d2fe0
Merge pull request #397 from giltirn/feature/dirichlet-gparity-stage
...
Gparity HMC import round 2
2022-05-25 13:29:45 -04:00
Christopher Kelly
6121397587
Imported changes from feature/gparity_HMC branch:
...
Added storage of final true residual in mixed-prec CG and enhanced log output
Fixed const correctness of multi-shift constructor
Added a mixed precision variant of the multi-shift algorithm that uses a single precision operator and applies periodic reliable update to the residual
Added tests/solver/Test_dwf_multishift_mixedprec to test the above
Fixed local coherence lanczos using the (large!) max approx to the chebyshev eval as the scale from which to judge the quality of convergence, resulting a test that always passes
Added a method to local coherence lanczos class that returns the fine eval/evec pair
Added iterative log output to power method
Added optional disabling of the plaquette check in Nerscio to support loading old G-parity configs which have a factor of 2 error in the plaquette
G-parity Dirac op no longer allows GPBC in the time direction; instead we toggle between periodic and antiperiodic
Replaced thread_for G-parity 5D force insertion implementation with accelerator_for version capable of running on GPUs
Generalized tests/lanczos/Test_dwf_lanczos to support regular DWF as well as Gparity, with the action chosen by a command line option
Modified tests/forces/Test_dwf_gpforce,Test_gpdwf_force,Test_gpwilson_force to use GPBC a spatial direction rather than the t-direction, and antiperiodic BCs for time direction
tests/core/Test_gparity now supports using APBC in time direction using command line toggle
2022-05-09 16:27:57 -04:00
Peter Boyle
0417b96896
Merge pull request #391 from giltirn/feature/dirichlet-gparity-stage
...
First stage of import
2022-05-03 08:50:18 -04:00
Christopher Kelly
81fe4c937e
Hopefully fix link errors on Intel compilers due to having no function body for MomentumFilterBase::apply_phase
2022-04-12 09:51:59 -04:00
Christopher Kelly
f77f3a6598
Imported G-parity flavor algebra + tester from feature/gparity_HMC branch
2022-04-06 10:21:04 -04:00
Peter Boyle
239afb18fb
Merge branch 'feature/dirichlet' into feature/dirichlet-gparity
2022-04-05 16:49:32 -04:00
Peter Boyle
ef820a26cd
Bcopy on crusher compile
2022-04-05 16:49:02 -04:00
Peter Boyle
65abe4d0d3
Merge branch 'feature/dirichlet' into feature/dirichlet-gparity
2022-04-05 16:26:54 -04:00
Peter Boyle
5012adfebf
Merge branch 'develop' into feature/dirichlet
2022-04-05 16:26:19 -04:00
Peter Boyle
b808d48fa1
Tone down printing in integrator
2022-04-05 16:25:22 -04:00
Peter Boyle
83f818a99d
Updates for DDHMC
2022-04-05 16:24:34 -04:00
Peter Boyle
387397374a
Current run options
2022-03-23 16:35:11 -04:00
Peter Boyle
605cf401e1
Merge branch 'feature/sumd-npr' into develop
2022-03-16 22:43:12 +00:00
Peter Boyle
f99c3660d2
Merge branch 'feature/cpu-threaded-smp' into develop
2022-03-16 22:07:54 +00:00
Peter Boyle
92a83a9eb3
Performance improve for Tesseract
2022-03-16 17:14:36 +00:00
Peter Boyle
b615fa0f35
Merge pull request #388 from fjosw/feature/sumd-npr
...
Feature/sumd npr
2022-03-15 09:05:57 -04:00
Peter Boyle
bb5c16b97f
New scripts
2022-03-03 17:00:37 -05:00
Peter Boyle
0d80eeb545
small DDHMC update
2022-03-03 16:56:02 -05:00
d1decee4cc
Cleaned up unused variables in Lattice_reduction_gpu.h
2022-03-02 16:54:23 +00:00
d4ae71b880
sum_gpu_large and sum_gpu templates added.
2022-03-02 15:40:18 +00:00
Peter Boyle
b0f4eee78b
New files
2022-03-01 19:09:13 -05:00
Peter Boyle
5340e50427
HMC running with new formulation
2022-03-01 17:10:25 -05:00
Peter Boyle
e16fc5b2e4
Threaded intranode comms transfer - ideally between NUMA domains
2022-03-01 11:17:24 -05:00
Peter Boyle
694306f202
Configure for mac arm
2022-03-01 10:53:44 -05:00
Peter Boyle
9aac1e6d64
Merge branch 'develop' into feature/sumd-npr
2022-03-01 10:51:38 -05:00
Peter Boyle
3e882f555d
Large / small sumD options
2022-03-01 08:54:45 -05:00
Peter Boyle
0f1c5b08a1
Dirichlet filters running on AMD and now integrated in Fermion op
2022-02-23 19:29:28 -05:00
Peter Boyle
70988e43d2
Passes multinode dirichlet test with boundaries at
...
node boundary or at the single rank boundary
2022-02-23 01:42:14 -05:00
Peter Boyle
aab3bcb46f
Dirichlet first cut - wrong answers on dagger multiply.
...
Struggling to get a compute node so changing systems
2022-02-22 19:58:33 +00:00
Peter Boyle
da06d15f73
Merge branch 'feature/feature/staggered-comms' into develop
2022-02-17 04:58:50 +00:00
Peter Boyle
e8b1251b8c
Staggered fix finished
2022-02-17 04:51:13 +00:00
Peter Boyle
63dbaeefaa
Extra barrier prior to finalize just in case it fixes an issue on Tursa
2022-02-16 14:01:43 +00:00
Peter Boyle
e8c187b323
SyCL happier?
2022-02-15 11:24:38 -05:00
Peter Boyle
fad5a74a4b
Bug fix to detection case
2022-02-15 10:27:39 -05:00
Peter Boyle
e83f6a6ae9
Merge branch 'develop' into feature/feature/staggered-comms
2022-02-15 08:52:39 -05:00
Peter Boyle
0c1618197f
Faster intranode MPI works now
2022-02-15 08:52:07 -05:00
Peter Boyle
f49d5c2d22
Updated scripts for crusher
2022-02-14 17:55:16 -05:00
Peter Boyle
a3b022d469
Crusher compile
2022-02-14 15:09:08 -05:00
Peter Boyle
48772f0976
Merge pull request #384 from jdmaia/hip_launchbounds
...
Changing thread block order and adding launch_bounds
2022-02-14 11:08:28 -05:00
Peter Boyle
c322420580
Dont instantiate an Nc=3 and non-GP hardwired code for other implementations
2022-02-14 16:04:08 +00:00
Azusa Yamaguchi
6283d11d50
Add the comment line to tell the existance of copied data/buffer
2022-02-08 15:22:06 +00:00
Julio Maia
86f4e17928
Changing thread block order and adding launch_bounds
2022-02-07 11:29:37 -06:00
Peter Boyle
6616d5d090
Commit
2022-02-02 16:38:24 -05:00
Peter Boyle
215df671be
Merge pull request #382 from DanielRichtmann/feature/compact-clover
...
Compact Clover Fermions
2022-02-01 21:45:38 -05:00
Daniel Richtmann
1b6b12589f
Get splitting up into implementation and instantiation files correct
2022-02-02 00:51:11 +01:00
Daniel Richtmann
3082ab8252
Check in compact version of wilson clover fermions
2022-02-02 00:50:05 +01:00
Daniel Richtmann
add86cd7f4
Abandon ET for clover application, use construct similar to multLink
2022-02-01 23:09:06 +01:00
Daniel Richtmann
0b6fd20c54
Enable memory coalescing in clover term generation
2022-02-01 23:09:06 +01:00
Daniel Richtmann
e83423fee6
Refactor clover to align with other files and prepare for upcoming changes
2022-02-01 23:09:06 +01:00
Daniel Richtmann
b4f8e87982
Have Grid's cli interface understand floats
2022-02-01 23:09:06 +01:00