portelli/Grid - Grid - DiRAC Tursa git server

mirror of https://github.com/paboyle/Grid.git synced 2026-07-19 16:43:27 +01:00

Author	SHA1	Message	Date
Peter Boyle	be18ffe3b4	Further tuning and lanczos	2023-09-27 16:21:58 -04:00
Peter Boyle	3a86cce8c1	Compile	2023-09-27 16:19:18 -04:00
Peter Boyle	37884d369f	Coarse space is expensive, but gives a speed up in fine matrix multiplies now. Down to optimisation	2023-09-25 17:24:19 -04:00
Peter Boyle	9246e653cd	Basic non-local coarsening of operator test	2023-09-25 17:20:58 -04:00
Peter Boyle	b9dcad89e8	Test cases for coarsening with non-local stencil	2023-09-07 10:53:22 -04:00
Peter Boyle	2b43308208	First cut non-local coarsening	2023-08-25 17:38:07 -04:00
Christopher Kelly	f44dce390f	Implemented acclerator-optimized versions of localCopyRegion and insertSliceLocal to speed up padding Fixed const correctness on PaddedCell methods Fixed compile issues on Crusher Added timing breakdowns for PaddedCell::Expand and the padded implementations of the staples, visible under --log Performance Optimized kernel for StaplePadded Test_iwasaki_action_newstaple now repeats the calculation 10 times and reports average timings	2023-06-27 14:58:10 -04:00
Christopher Kelly	6f6844ccf1	Added new StapleAll and RectStapleAll functions that return the staples for all mu as an array Modified plaq+rectangle gauge actions to use the above Added a test code to confirm the above changes	2023-06-26 15:48:47 -04:00
Christopher Kelly	4c6613d72c	Modified RectStapleDouble and RectStapleOptimised to use Gauge-BC respecting CshiftLink Added test code tests/debug/Test_optimized_staple_gaugebc demonstrating equivalence of above to RectStapleUnoptimised for cconj gauge BCs Removed optimized staple only being used for periodic gauge BCs; it is now always used	2023-06-26 10:20:23 -04:00
Christopher Kelly	4241c7d4a3	Imported coalescedReadGeneralPermute GPU implementation from Christoph Fixed bug in padded staple code where extract was being called on the result before the GPU view was closed Fixed compile issue with pointer cast in padded staple code Added timing summaries of padded staple code and timing breakdown of staple implementation to Test_padded_cell_staple	2023-06-21 16:01:01 -04:00
Christopher Kelly	7b11075102	The user can now specify the implementation of Cshift used by the PaddedCell class through a virtual base class API. Implementations for default (regular Cshift) and for gauge links (which respects the gauge BCs) Fixed const-correctness for PaddedCell and ConjugateGimpl::setDirections Modified test code for padded-cell implementation of staple, rect-staple to use cconj BCs	2023-06-20 17:09:56 -04:00
Christopher Kelly	abc658dca5	Added coalescedReadGeneralPermute CPU implementation based on Christoph's GPT code In a test code, implemented a padded-cell version of the staple and rectangular-staple calculation	2023-06-20 16:14:25 -04:00
david clarke	c7bdf2c0e4	3-link test at least gives an answer	2023-05-21 04:33:20 -06:00
Peter Boyle	9c8750f261	Merge branch 'develop' of https://github.com/paboyle/Grid into develop	2023-05-11 12:29:09 -04:00
Peter Boyle	ccd21f96ff	Plaquette agreeing and moving to final form (slowly) need to optimise	2023-02-01 22:57:44 -05:00
Peter Boyle	4b90cb8888	First cut passes combining padded cell with general stencil towards fast plaquette and staggered force	2023-02-01 22:14:10 -05:00
Peter Boyle	3dbfce5223	Tests clean build on HIP	2022-11-16 20:15:51 -05:00
Peter Boyle	8cd4263974	Tests compile	2021-04-25 22:20:37 -04:00
Michael Marshall	2983b6fdf6	Optional (superficial) changes to make comparison with Hadrons WardIdentity module easier: use Schur solver; example of Hadrons random gauge init; logging updates; only solve reverse propagator if provided	2021-01-23 12:41:48 +00:00
Peter Boyle	11a5fd09d6	Hot config	2021-01-21 21:39:41 -05:00
Michael Marshall	873519e960	Enable existing conserved current code for CUDA (compiles OK for CUDA 10.1). Add option to Test_cayley_mres to load a configuration	2020-12-14 16:06:10 +00:00
Peter Boyle	d201277652	Expose Nc as a compile time configure option. Remove precision option	2020-10-07 13:07:00 -04:00
Peter Boyle	d982a5b6d5	Fix coaarsened	2020-09-01 00:14:04 -04:00
Peter Boyle	1a4c8c3387	Global edit with change to View usage. autoView() creates a wrapper object that closes the view when scope closes.	2020-06-05 18:52:35 -04:00
Peter Boyle	f999408e92	View locatoin and access mode	2020-05-21 16:14:20 -04:00
Peter Boyle	29ae5615c0	Seqeuential fix	2020-04-29 03:05:15 -04:00
Peter Boyle	ed70cce542	Test for 5D DWF obserevables	2020-04-23 04:29:45 -04:00
Peter Boyle	462900b48d	Modified entire test directory to suit new GPU constructs for looping	2019-06-15 12:53:27 +01:00
Peter Boyle	bcbb5e9d26	Remove assembly tests	2019-06-15 07:57:05 +01:00
Peter Boyle	422764757d	Updates in tests to make all of Grid compile	2018-12-14 16:55:54 +00:00
Peter Boyle	b57a4d32aa	Merge branch 'develop' into feature/gpu-port	2018-12-13 05:11:34 +00:00
Peter Boyle	68c13045d6	Added a test for Felix and Michael to look at	2018-11-07 23:40:15 +00:00
Peter Boyle	24c07694bc	Mixed precision now supported in MADWF	2018-10-14 00:22:52 +01:00
Peter Boyle	f0229025e2	MADWF working across a range of actions	2018-10-13 19:55:03 +01:00
Peter Boyle	49f25e08e8	PauliVillars based 4D -> 5D reconstruction with Fourier Accelerated PV inverse by Christoph. Differs from the one by Rudy in BFM since it vectorises the twisted 4D solves in pairs.	2018-10-11 12:35:32 +01:00
paboyle	285deab432	Coordinate handling GPU friendly. Avoid std::vector	2018-02-24 22:19:28 +00:00
paboyle	dd8f2a64fe	INterface to suit hadrons on Lanczos	2018-02-13 02:08:49 +00:00
paboyle	98af36217a	Zero changes. (I mean literally)	2018-01-27 23:46:02 +00:00
paboyle	c4f82e072b	_grid becomes private ; use Grid()§	2018-01-27 00:04:12 +00:00
paboyle	3f9654e397	Hiding internals	2018-01-26 23:09:03 +00:00
paboyle	d74c21a386	GLobal edit for QCD namespace removal & NAMESPACE macros	2018-01-15 09:37:58 +00:00
paboyle	cb9ff20249	Approx tests and lanczos improvement	2017-10-13 11:30:50 +01:00
paboyle	9fe6ac71ea	Starting reorg of Blocked lanczos	2017-10-11 10:12:07 +01:00
David Murphy	459f70e8d4	Check-in of working Mobius EOFA class and tests	2017-08-22 22:38:30 -04:00
David Murphy	ec1e2f7a40	Add (mostly implemented) ExactOneFlavourRatio pseudofermion class and tests of Shamir heatbath and action	2017-08-16 12:38:59 -04:00
David Murphy	6d0786ff9d	Typo fixes and check-in of G-parity action test for DWF	2017-08-15 22:47:00 -04:00
David Murphy	202a7fe900	Re-import DWF and abstract base EOFA fermion classes and tests	2017-08-15 13:36:08 -04:00
paboyle	e8b95bd35b	Clean up finished. Could shrink Lanczos to around 400 lines at a push	2017-06-21 02:50:09 +01:00
Azusa Yamaguchi	0a8faac271	Fix make tests compile	2017-06-19 22:54:18 +01:00
paboyle	a8db024c92	Cleaning up the dense matrix and lanczos sector	2017-04-15 08:54:11 +01:00

1 2 3