Christopher Kelly 
							
						 
					 
					
						
						
							
						
						1b84f59273 
					 
					
						
						
							
							Added a mixed precision multishift algorithm for which the matrix multiplies are performed in single precision but the search directions are accumulated in double precision.  
						
						... 
						
						
						
						A reliable update step is performed at a tunable frequency to correct the residual. A final mixed-prec single-shift solve is performed on each pole to perform cleanup if necessary.
A test is provided to demonstrate the algorithm. 
						
						
							
						
					 
					
						2021-01-06 12:24:44 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						1fb41a4300 
					 
					
						
						
							
							Added copyLane function to Tensor_extract_merge.h which copies one lane of data from an input tensor object to a different lane of an output tensor object of potentially different precision  
						
						... 
						
						
						
						precisionChange lattice function now uses copyLane to remove need for temporary scalar objects, reducing register footprint and significantly improving performance 
						
						
							
						
					 
					
						2021-01-06 11:50:56 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						287bac946f 
					 
					
						
						
							
							ConjugateGradientMixedPrec now stores final true residual and uses the precisionChange workspaces for improved efficiency  
						
						
						
						
							
						
					 
					
						2021-01-06 09:50:41 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						80c14be65e 
					 
					
						
						
							
							Added core test to check precision change  
						
						
						
						
							
						
					 
					
						2021-01-06 09:34:44 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						d7a2a4852d 
					 
					
						
						
							
							Reimplemented precisionChange to run on GPUs. A workspace containing the mapping table can be optionally precomputed and reused for improved performance.  
						
						
						
						
							
						
					 
					
						2021-01-06 09:30:49 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						d185f2eaa7 
					 
					
						
						
							
							OneFlavourEvenOddRatioRationalPseudoFermionAction now derives from GeneralEvenOddRatioRationalPseudoFermionAction, simply performs transcription of parameters  
						
						
						
						
							
						
					 
					
						2020-12-23 16:26:10 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						813d4cd900 
					 
					
						
						
							
							Added test program that ensures the generic checkerboarded RHMC (with parameters set appropriately) gives the same answer as the existing 1f code  
						
						
						
						
							
						
					 
					
						2020-12-23 16:01:42 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						75c6c6b173 
					 
					
						
						
							
							General RHMC pseudofermion action now allows for different rational approximations to be used in the MD and action evaluation  
						
						
						
						
							
						
					 
					
						2020-12-23 11:19:26 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						220ad5e3ee 
					 
					
						
						
							
							Added more verbose log output to GeneralEvenOddRatioRationalPseudoFermionAction  
						
						... 
						
						
						
						In GeneralEvenOddRatioRationalPseudoFermionAction, setting the bounds check frequency to 0 now disables the check 
						
						
							
						
					 
					
						2020-12-22 11:08:22 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						ba5dc670a5 
					 
					
						
						
							
							Reimplemented GparityWilsonImpl::InsertForce5D to run efficiently on GPUs  
						
						... 
						
						
						
						Swapped order of templated tensor code and c-number specializations in Tensor_outer.h to fix compile issue with type deduction on Summit 
						
						
							
						
					 
					
						2020-12-22 10:10:07 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						a0ca362690 
					 
					
						
						
							
							Added an RHMC pseudofermion action, GeneralEvenOddRatioRationalPseudoFermionAction, that works for an arbitrary fractional power, not just a square root  
						
						... 
						
						
						
						Added a test evolution for the above, Test_rhmc_EOWilsonRatioPowQuarter, demonstrating conservation of Hamiltonian
Fixed HMC ignoring the MetropolisTest parameter of HMCparameters 
						
						
							
						
					 
					
						2020-12-17 16:21:58 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						249b6e61ec 
					 
					
						
						
							
							For G-parity BCs the Nd-1 direction is now assumed to be the time direction and setting a twist in this direction will apply antiperiodic BCs  
						
						... 
						
						
						
						Added option to run Test_gparity with antiperiodic time BCs 
						
						
							
						
					 
					
						2020-12-17 14:09:00 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						9e7bacb5a4 
					 
					
						
						
							
							Merge branch 'develop' into feature/gparity_twist_GPU  
						
						
						
						
							
						
					 
					
						2020-12-07 11:55:39 -05:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						2ef1fa66a8 
					 
					
						
						
							
							Improved performance of G-parity kernel for GPUs by simplifying multLink implementation  
						
						
						
						
							
						
					 
					
						2020-12-07 11:53:35 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						cf76741ec6 
					 
					
						
						
							
							Intel DPCPP Gold happy now (compiles all, runs Benchmark_dwf_fp32 )  
						
						
						
						
							
						
					 
					
						2020-12-03 03:47:11 -08:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						497e7c1c40 
					 
					
						
						
							
							Duplicate code  
						
						
						
						
							
						
					 
					
						2020-12-02 17:55:30 -08:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						888eacd3b8 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
							
						
					 
					
						2020-11-24 21:46:33 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						321f0f51b5 
					 
					
						
						
							
							Project to SU(N)  
						
						
						
						
							
						
					 
					
						2020-11-24 21:46:10 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						30ad9578a2 
					 
					
						
						
							
							Merge branch 'lehner-feature/gpt' into develop  
						
						
						
						
							
						
					 
					
						2020-11-24 06:10:24 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						9dce101586 
					 
					
						
						
							
							Merge branch 'feature/gpt' of  https://github.com/lehner/Grid  into lehner-feature/gpt  
						
						
						
						
							
						
					 
					
						2020-11-24 06:10:16 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						97e264d0ff 
					 
					
						
						
							
							Christoph's changes  
						
						
						
						
							
						
					 
					
						2020-11-23 15:46:11 +00:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						683a5e5bf5 
					 
					
						
						
							
							Stencil use host vector for integera table on enable-shared=no and mirror it on device  
						
						
						
						
							
						
					 
					
						2020-11-23 15:39:51 +00:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d4861a362c 
					 
					
						
						
							
							Stencil use non-UVM memory for look up table on enable-shared=no  
						
						
						
						
							
						
					 
					
						2020-11-23 15:38:49 +00:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5ff3eae027 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
							
						
					 
					
						2020-11-20 13:14:44 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						147dc15d26 
					 
					
						
						
							
							Update  
						
						
						
						
							
						
					 
					
						2020-11-20 13:13:59 -05:00 
						 
				 
			
				
					
						
							
							
								Christoph Lehner 
							
						 
					 
					
						
						
							
						
						c61ea72949 
					 
					
						
						
							
							Merge pull request  #19  from paboyle/develop  
						
						... 
						
						
						
						Sync 
						
						
							
						
					 
					
						2020-11-20 17:31:13 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						86e8b9fe38 
					 
					
						
						
							
							ALLOC_ALIGN removed  
						
						
						
						
							
						
					 
					
						2020-11-20 17:07:16 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						612e468889 
					 
					
						
						
							
							Configurable ALLOC_ALIGN and ALLOC_CACHE  
						
						
						
						
							
						
					 
					
						2020-11-20 16:48:28 +01:00 
						 
				 
			
				
					
						
							
							
								Christoph Lehner 
							
						 
					 
					
						
						
							
						
						4ea8d128c2 
					 
					
						
						
							
							Merge pull request  #18  from paboyle/develop  
						
						... 
						
						
						
						Sync 
						
						
							
						
					 
					
						2020-11-20 15:36:50 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e49b7f2f88 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
							
						
					 
					
						2020-11-19 19:24:41 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						aace3d47b9 
					 
					
						
						
							
							partial work in progress  
						
						
						
						
							
						
					 
					
						2020-11-19 19:24:14 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d5049949a4 
					 
					
						
						
							
							Starting to fix reunitarise  
						
						
						
						
							
						
					 
					
						2020-11-19 19:23:41 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f1c7480e3c 
					 
					
						
						
							
							Warning remove  
						
						
						
						
							
						
					 
					
						2020-11-19 19:23:03 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5adae5d6ff 
					 
					
						
						
							
							Unused variable remove  
						
						
						
						
							
						
					 
					
						2020-11-19 19:22:12 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a8412ace05 
					 
					
						
						
							
							Merge pull request  #317  from i-kanamori/develop  
						
						... 
						
						
						
						adding an error check for input: Parameters.StartingType 
						
						
							
						
					 
					
						2020-11-18 23:09:40 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						9fd1c2ad4b 
					 
					
						
						
							
							Merge pull request  #325  from DanielRichtmann/feature/threaded-clover-inversion  
						
						... 
						
						
						
						Threaded clover term inversion 
						
						
							
						
					 
					
						2020-11-18 23:08:37 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						4cf3575353 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
							
						
					 
					
						2020-11-18 03:07:36 +00:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						804a810d68 
					 
					
						
						
							
							Wildcard mismatch  
						
						
						
						
							
						
					 
					
						2020-11-18 03:06:53 +00:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						8fcb392e24 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
							
						
					 
					
						2020-11-17 04:51:31 -08:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						dd8d70eeff 
					 
					
						
						
							
							Build without LIME  
						
						
						
						
							
						
					 
					
						2020-11-17 04:41:15 -08:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						aa8aba6543 
					 
					
						
						
							
							--shm-force-mpi  
						
						
						
						
							
						
					 
					
						2020-11-16 20:15:50 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						13df14f96e 
					 
					
						
						
							
							Switch off SHM paths with --disable-shm  
						
						
						
						
							
						
					 
					
						2020-11-16 18:07:15 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3aab983760 
					 
					
						
						
							
							Flop count set as in DiRAC-ITT-2020 (mistaken 20% low, but must maintain consistency)  
						
						
						
						
							
 
						
					 
					
						2020-11-16 17:13:58 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						9c4dcc5ea3 
					 
					
						
						
							
							Merge branch 'master' into develop  
						
						
						
						
							
						
					 
					
						2020-11-16 16:34:57 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a1063ddbb9 
					 
					
						
						
							
							Update options and simplify  
						
						
						
						
							
						
					 
					
						2020-11-13 04:11:03 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						18ef8056ec 
					 
					
						
						
							
							Hide Shared Memory  
						
						
						
						
							
						
					 
					
						2020-11-13 04:10:40 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						1c673977fa 
					 
					
						
						
							
							Must ask for COMMMS_THREADS  
						
						
						
						
							
						
					 
					
						2020-11-13 03:59:36 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e9bc748828 
					 
					
						
						
							
							Useful GPU machine benchmark for GDR used to shakeout Booster at Juelich - see slack earlyaccess channel  
						
						
						
						
							
						
					 
					
						2020-11-13 03:58:34 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f48156529b 
					 
					
						
						
							
							Work on 2,2,2,8 ranks  
						
						
						
						
							
						
					 
					
						2020-11-13 03:57:58 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d05ce01809 
					 
					
						
						
							
							TOFU behaviour now optional THREAD_MULTIPLE or THREAD_SERIALIZED  
						
						
						
						
							
						
					 
					
						2020-11-13 03:52:19 +01:00