portelli/Grid - Grid - DiRAC Tursa git server

mirror of https://github.com/paboyle/Grid.git synced 2025-09-19 01:31:04 +01:00

Author	SHA1	Message	Date
Alessandro Lupo	d6ff644aab	Towards the day all tests compile	2023-03-14 10:43:25 +00:00
Julian Lenz	29586f6b5e	Deactivate some tests for Nc!=3	2023-03-13 08:17:14 +00:00
Christopher Kelly	e82cf1d311	Further prec-change improvements Mixed prec CG algorithm has been modified to precompute precision change workspaces As the original Test_dwf_mixedcg_prec has been coopted to do a performance stability and reproducibility test, requiring the single-prec CG to be run 200 times, I have created a new version of Test_dwf_mixedcg_prec in the solver subdirectory that just does the mixed vs double CG test	2023-02-23 09:45:29 -05:00
Christopher Kelly	1db58a8acc	Precision change improvements Added a new, much faster implementation of precision change that uses (optionally) a precomputed workspace containing pointer offsets that is device resident, such that all lattice copying occurs only on the device and no host<->device transfer is required, other than the pointer table. It also avoids the need to unpack and repack the fields using explicit lane copying. When this new precisionChange is called without a workspace, one will be computed on-the-fly; however it is still considerably faster than the original implementation. In the special case of using double2 and when the Grids are the same, calls to the new precisionChange will automatically use precisionChangeFast, such that there is a single API call for all precision changes. Reliable update and mixed-prec multishift have been modified to precompute precision change workspaces Renamed the original precisionChange as precisionChangeOrig Fixed incorrect pointer offset bug in copyLane Added a test and a benchmark for precisionChange Added a test for reliable update CG	2023-02-21 10:52:42 -05:00
Peter Boyle	ccd21f96ff	Plaquette agreeing and moving to final form (slowly) need to optimise	2023-02-01 22:57:44 -05:00
Peter Boyle	4b90cb8888	First cut passes combining padded cell with general stencil towards fast plaquette and staggered force	2023-02-01 22:14:10 -05:00
Peter Boyle	4ca1bf7cca	Added gauge invariance test	2022-12-21 07:23:16 -05:00
Peter Boyle	ede02b6883	Memory manager debug Felix case	2022-12-20 05:10:23 -05:00
Peter Boyle	d8c29f5fcf	Updated FFT test for PETSc	2022-12-18 12:05:00 -05:00
Peter Boyle	281f8101fe	Matt FFT test	2022-12-17 20:35:33 -05:00
Peter Boyle	472ed2dd5c	Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet	2022-12-17 20:17:09 -05:00
Peter Boyle	4f85672674	Simpler test for PETSc	2022-12-17 20:16:11 -05:00
Peter Boyle	5bb7ba92fa	Test for DDHMC force term	2022-12-13 08:15:11 -05:00
Chulwoo Jung	dc6a38f177	Minor cleanup	2022-11-30 17:13:12 -05:00
Chulwoo Jung	82c1ecf60f	Block lanczos added	2022-11-30 16:08:40 -05:00
Julian Lenz	505fa49983	Renamed SUn.h -> GaugeGroup.h	2022-11-30 17:09:48 +00:00
Julian Lenz	7bcf33def9	Removed Sp2n.h	2022-11-30 16:59:46 +00:00
Julian Lenz	fa71b46a41	Hide nsp	2022-11-30 14:44:23 +00:00
Julian Lenz	6e750ecb0e	Remove apparently forgotten file	2022-11-28 16:33:46 +00:00
Julian Lenz	1aa28b47ae	Add existing test to check	2022-11-25 17:40:40 +00:00
Julian Lenz	629cb2987a	Fix typo in Makefile.am	2022-11-25 17:40:21 +00:00
Alessandro Lupo	22064c7e4c	Fixing #11	2022-11-25 13:10:29 +00:00
Alessandro Lupo	2de03e5172	Revert "Revert "Fixing issue #11 : consistent use of ncolour and nsp"" This reverts commit `3af4929dda`.	2022-11-23 19:40:28 +00:00
Alessandro Lupo	3af4929dda	Revert "Fixing issue #11 : consistent use of ncolour and nsp" This reverts commit `1ba429345b`.	2022-11-23 19:34:59 +00:00
Alessandro Lupo	1ba429345b	Fixing issue #11 : consistent use of ncolour and nsp	2022-11-23 18:45:01 +00:00
Peter Boyle	3dbfce5223	Tests clean build on HIP	2022-11-16 20:15:51 -05:00
Peter Boyle	e51eaedc56	Making tests compile	2022-11-15 22:58:30 -05:00
Peter Boyle	a3927a8a27	Dirichlet	2022-11-02 20:22:27 -04:00
Peter Boyle	c82b164f6b	Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet	2022-10-04 17:41:48 -04:00
Christopher Kelly	66d001ec9e	Refactored Wilson flow class; previously the class implemented both iterative and adaptive smearing, but only the iterative method was accessible through the Smearing base class. The implementation of Smearing also forced a clunky need to pass iterative smearing parameters through the constructor but adaptive smearing parameters through the function call. Now there is a WilsonFlowBase class that implements common functionality, and separate WilsonFlow (iterative) and WilsonFlowAdaptive (adaptive) classes, both of which implement Smearing virtual functions. Modified the Wilson flow adaptive smearing step size update to implement the original Ramos definition of the distance, where previously it used the norm of a difference which scales with the volume and so would choose too coarse or too fine steps depending on the volume. This is based on Chulwoo's code. Added a test comparing adaptive (with tuneable tolerance) to iterative Wilson flow smearing on a random gauge configuration.	2022-10-03 10:59:38 -04:00
Christopher Kelly	19da647e3c	Added support for non-periodic gauge field implementations in the random gauge shift performed at the start of the HMC trajectory (The above required exposing the gauge implementation to the HMC class through the Integrator class) Made the random shift optional (default on) through a parameter in HMCparameters Modified ConjugateBC::CshiftLink such that it supports any shift in -L < shift < L rather than just +-1 Added a tester for the BC-respecting Cshift Fixed a missing system header include in SSE4 intrinsics wrapper Fixed sumD_cpu for single-prec types performing an incorrect conversion to a single-prec data type at the end, that fails to compile on some systems	2022-09-09 12:47:09 -04:00
Peter Boyle	1177b8f661	Merge branch 'develop' into feature/dirichlet	2022-08-31 19:05:57 -04:00
Peter Boyle	3c1c51f9aa	Merge branch 'feature/dirichlet-gparity' into feature/dirichlet	2022-08-31 18:25:34 -04:00
Peter Boyle	8cc3c522c3	Merge pull request #409 from giltirn/feature/dirichlet-gparity-stage Import round 5	2022-08-31 18:22:50 -04:00
Gurtej Kanwar	60dfb49afa	Remove FP16 tests when FP16 is disabled	2022-08-21 17:29:55 +02:00
Peter Boyle	9b20f1449c	Better timing	2022-07-28 11:37:12 -04:00
Christopher Kelly	33e4a0caee	Imported changes from feature/gparity_HMC branch: Rework of WilsonFlow class Fixed logic error in smear method where the step index was initialized to 1 rather than 0, resulting in the logged output value of tau being too large by epsilon Previously smear_adaptive would maintain the current value of tau as a class member variable whereas smear would compute it separately; now both methods maintain the current value internally and it is updated by the evolve_step routines. Both evolve methods are now const. smear_adaptive now also maintains the current value of epsilon internally, allowing it to be a const method and also allowing the same class instance to be reused without needing to be reset Replaced the fixed evaluation of the plaquette energy density and plaquette topological charge during the smearing with a highly flexible general strategy where the user can add arbitrary measurements as functional objects that are evaluated at an arbitrary frequency By default the same plaquette-based measurements are performed, but additional example functions are provided where the smearing is performed with different choices of measurement that are returned as an array for further processing Added a method to compute the energy density using the Cloverleaf approach which has smaller discretization errors Added a new tensor utility operation, copyLane, which allows for the copying of a single SIMD lane between two instances of the same tensor type but potentially different precisions To LocalCoherenceLanczos, added the option to compute the high/low eval of the fine operator on every restart to aid in tuning the Chebyshev Added Test_field_array_io which demonstrates and tests a single-file write of an arbitrary array of fields Added Test_evec_compression which generates evecs using Lanczos and attempts to compress them using the local coherence technique Added Test_compressed_lanczos_gparity which demonstrates the local coherence Lanczos for G-parity BCs Added HMC main programs for the 40ID and 48ID G-parity lattices	2022-07-01 14:12:12 -04:00
Peter Boyle	1f903d9296	Merge branch 'feature/dirichlet' into feature/dirichlet-gparity	2022-07-01 12:12:50 -04:00
Peter Boyle	53d01312b3	Rough flop counting, need to add M5D, M5Ddag, MooeeInv flops	2022-06-30 13:44:09 -04:00
Christopher Kelly	fd933420c6	Imported changes from feature/gparity_HMC branch: Added a bounds-check function for the RHMC with arbitrary power Added a pseudofermion action for the rational ratio with an arbitrary power and a mixed-precision variant of the same. The existing one-flavor rational ratio class now uses the general class under the hood To support testing of the two-flavor even-odd ratio pseudofermion, separated the functionality of generating the random field and performing the heatbath step, and added a method to obtain the pseudofermion field Added a new HMC runner start type: CheckpointStartReseed, which reseeds the RNG from scratch, allowing for the creation of new evolution streams from an existing checkpoint. Added log output of seeds used when the RNG is seeded. EOFA changes: To support mixed-precision inversion, generalized the class to maintain a separate solver for the L and R operators in the heatbath (separate solvers are already implemented for the other stages) To support mixed-precision, the action of setting the operator shift coefficients is now maintained in a virtual function. A derived class for mixed-precision solvers ensures the coefficients are applied to both the double and single-prec operators The \|\|^2 of the random source is now stored by the heatbath and compared to the initial action when it is computed. These should be equal but may differ if the rational bounds are not chosen correctly, hence serving as a useful and free test Fixed calculation of M_eofa (previously incomplete and #if'd out) Added functionality to compute M_eofa^-1 to complement the calculation of M_eofa (both are equally expensive!) To support testing, separated the functionality of generating the random field and performing the heatbath step, and added a method to obtain the pseudofermion field Added a test program which computes the G-parity force using the 1 and 2 flavor implementations and compares the result. Test supports DWF, EOFA and DSDR actions, chosen by a command line option. The Mobius EOFA force test now also checks the rational approximation used for the heatbath Added a test program for the mixed precision EOFA compared to the double-prec implementation, G-parity HMC test now applied GPBC in the y direction and not the t direction (GPBC in t are no longer supported) and checkpoints after every configuration Added a test program which computes the two-flavor G-parity action (via RHMC) with both the 1 and 2 flavor implementations and checks they agree Added a test program to check the implementation of M_eofa^{-1}	2022-06-22 10:27:48 -04:00
Peter Boyle	8208a6214f	Merge branch 'feature/dirichlet-gparity' into feature/dirichlet	2022-06-15 19:23:48 -04:00
Christopher Kelly	1ad54d049d	To PeriodicBC and ConjugateBC, added a new function "CshiftLink" which performs a boundary-aware C-shift of links or products of links. For the latter, the links crossing the global boundary are complex-conjugated. To the gauge implementations, added CshiftLink functions calling into the appropriate operation for the BC in a given direction. GaugeTransform, FourierAcceleratedGaugeFixer and WilsonLoops::FieldStrength no longer implicitly assume periodic boundary conditions; instead the shifted link is obtained using CshiftLink and is aware of the gauge implementation. Added an assert-check to ensure that the gauge fixing converges within the specified number of steps. Added functionality to compute the timeslice averaged plaquette Added functionality to compute the 5LI topological charge and timeslice topological charge Added a check of the properties of the charge conjugation matrix C=-gamma_2 gamma_4 to Test_gamma Fixed const correctness for Replicate Modified Test_fft_gfix to support either conjugate or periodic BCs, optionally disabling Fourier-accelerated gauge fixing, and tuning of alpha using cmdline options	2022-06-02 15:30:41 -04:00
Peter Boyle	18028f4309	Merge branch 'develop' into feature/dirichlet	2022-05-24 18:26:18 -07:00
Peter Boyle	b52e8ef65a	Dirichlet changes	2022-05-19 16:45:41 -07:00
Christopher Kelly	6121397587	Imported changes from feature/gparity_HMC branch: Added storage of final true residual in mixed-prec CG and enhanced log output Fixed const correctness of multi-shift constructor Added a mixed precision variant of the multi-shift algorithm that uses a single precision operator and applies periodic reliable update to the residual Added tests/solver/Test_dwf_multishift_mixedprec to test the above Fixed local coherence lanczos using the (large!) max approx to the chebyshev eval as the scale from which to judge the quality of convergence, resulting a test that always passes Added a method to local coherence lanczos class that returns the fine eval/evec pair Added iterative log output to power method Added optional disabling of the plaquette check in Nerscio to support loading old G-parity configs which have a factor of 2 error in the plaquette G-parity Dirac op no longer allows GPBC in the time direction; instead we toggle between periodic and antiperiodic Replaced thread_for G-parity 5D force insertion implementation with accelerator_for version capable of running on GPUs Generalized tests/lanczos/Test_dwf_lanczos to support regular DWF as well as Gparity, with the action chosen by a command line option Modified tests/forces/Test_dwf_gpforce,Test_gpdwf_force,Test_gpwilson_force to use GPBC a spatial direction rather than the t-direction, and antiperiodic BCs for time direction tests/core/Test_gparity now supports using APBC in time direction using command line toggle	2022-05-09 16:27:57 -04:00
Peter Boyle	79ea027c0b	Merge pull request #377 from RJHudspith/develop NERSC and ILDG for non-SU(3) configuration checkpoints	2022-05-03 08:55:48 -04:00
Christopher Kelly	f77f3a6598	Imported G-parity flavor algebra + tester from feature/gparity_HMC branch	2022-04-06 10:21:04 -04:00
Fabian Joswig	b8bc560b51	Test_wilson_conserved_current implemented, all 5d references removed.	2022-04-05 17:33:45 +01:00
Fabian Joswig	6bc2483d57	Merge branch 'feature/eclover' into feature/conserved_current_wilson	2022-04-05 15:26:49 +01:00
Fabian Joswig	82aecbf4cf	Test_wilson_conserved_current added	2022-04-05 15:26:39 +01:00

1 2 3 4 5 ...

1497 Commits