Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						164d3691db 
					 
					
						
						
							
							Staggered  
						
						
						
						
					 
					
						2016-11-01 14:24:22 +00:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						e8c3174ae2 
					 
					
						
						
							
							Small change in the defines  
						
						
						
						
					 
					
						2016-10-30 12:23:11 +00:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						9b066e94d0 
					 
					
						
						
							
							Compilation with both single and double precision  
						
						
						
						
					 
					
						2016-10-30 12:04:06 +00:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						e1042aef77 
					 
					
						
						
							
							First version of the doube prec for testing purposes  
						
						... 
						
						
						
						It does not compile single and double version at the same time 
						
						
					 
					
						2016-10-28 17:20:04 +01:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						ca21003f01 
					 
					
						
						
							
							Merge branch 'feature/fft-opt' into feature/feynman-rules  
						
						... 
						
						
						
						# Conflicts:
#	lib/FFT.h
#	lib/qcd/action/fermion/WilsonFermion5D.h
#	tests/core/Test_fft.cc 
						
						
					 
					
						2016-10-26 18:44:47 +01:00 
						 
				 
			
				
					
						
							
							
								azusayamaguchi 
							
						 
					 
					
						
						
							
						
						c190221fd3 
					 
					
						
						
							
							Internal SHM comms in non-simd directions working  
						
						... 
						
						
						
						Need to fix simd directions 
						
						
					 
					
						2016-10-22 18:14:27 +01:00 
						 
				 
			
				
					
						
							
							
								azusayamaguchi 
							
						 
					 
					
						
						
							
						
						6a9eae6b6b 
					 
					
						
						
							
							Reporting improvements  
						
						
						
						
					 
					
						2016-10-21 13:36:18 +01:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						bd6a228af6 
					 
					
						
						
							
							Merge commit '20a091c3eddfdb67a82ece6413740a93650a2f98' into feature/feynman-rules  
						
						
						
						
					 
					
						2016-10-21 13:10:30 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						b58adc6a4b 
					 
					
						
						
							
							commVector  
						
						
						
						
					 
					
						2016-10-20 17:00:15 +01:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						997fd882ff 
					 
					
						
						
							
							Merge branch 'develop' into feature/feynman-rules  
						
						... 
						
						
						
						# Conflicts:
#	lib/Threads.h
#	lib/qcd/action/fermion/WilsonFermion.cc
#	lib/qcd/action/fermion/WilsonFermion.h
#	lib/qcd/utils/SUn.h
#	lib/simd/Grid_avx.h
#	lib/simd/Intel512common.h 
						
						
					 
					
						2016-10-19 18:35:18 +01:00 
						 
				 
			
				
					
						
							
							
								azusayamaguchi 
							
						 
					 
					
						
						
							
						
						81f2aeaece 
					 
					
						
						
							
							KNL streaming stores, and KNL performance coutners  
						
						
						
						
					 
					
						2016-10-12 11:45:22 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						3619167d62 
					 
					
						
						
							
							Mass parameter  
						
						
						
						
					 
					
						2016-10-10 23:47:33 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						96f1d1b828 
					 
					
						
						
							
							Debugged Domain wall and Overlap feynman rules (infinite Ls, finite mass).  
						
						
						
						
					 
					
						2016-10-10 23:46:45 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						657e0a8f4d 
					 
					
						
						
							
							Mass parameter  
						
						
						
						
					 
					
						2016-10-10 23:46:10 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						616e7cd83e 
					 
					
						
						
							
							Mass parameter  
						
						
						
						
					 
					
						2016-10-10 23:45:48 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						6f26d2e8d4 
					 
					
						
						
							
							Overlap tree level feynman rule  
						
						
						
						
					 
					
						2016-10-10 23:45:18 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						c014574504 
					 
					
						
						
							
							A "please implement me" feynman rule. If this were abstract virtual it would  
						
						... 
						
						
						
						require/force implementation 
						
						
					 
					
						2016-10-10 23:44:00 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						d7ce164e6e 
					 
					
						
						
							
							Feynman rule for DWF  
						
						
						
						
					 
					
						2016-10-10 23:43:36 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						c0d5b99016 
					 
					
						
						
							
							Dminus  
						
						
						
						
					 
					
						2016-10-10 23:43:19 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						09ca32d678 
					 
					
						
						
							
							Dminus added for Cayley  
						
						
						
						
					 
					
						2016-10-10 23:42:55 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						b56c9ffa52 
					 
					
						
						
							
							Fix for AVXFMA  
						
						
						
						
					 
					
						2016-10-10 14:43:37 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						2e453dfbf5 
					 
					
						
						
							
							Added some instrumentation to benchmark the force computation  
						
						
						
						
					 
					
						2016-10-06 17:52:45 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						4089984431 
					 
					
						
						
							
							Timing hooks  
						
						
						
						
					 
					
						2016-10-06 09:25:12 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						c78bbd0f8c 
					 
					
						
						
							
							Fix ASM compilation  
						
						
						
						
					 
					
						2016-10-04 15:37:32 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						b6713ecb60 
					 
					
						
						
							
							Momentum space rules for Overlap, DWF untested to date  
						
						
						
						
					 
					
						2016-09-26 09:39:09 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						b6597b74e7 
					 
					
						
						
							
							Added support for the Two index Symmetric and Antisymmetric representations  
						
						... 
						
						
						
						Tested for HMC convergence: OK
Added also a test file showing an example for mixed representations 
						
						
					 
					
						2016-09-22 14:17:37 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						b9c80318a2 
					 
					
						
						
							
							Merge branch 'develop' into feature/hirep  
						
						
						
						
					 
					
						2016-09-13 10:01:51 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						f76f281e58 
					 
					
						
						
							
							Cleaning files after fix  
						
						
						
						
					 
					
						2016-09-09 11:34:25 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						aa20cc8b52 
					 
					
						
						
							
							Fixing compilation error with AVX512 flag  
						
						
						
						
					 
					
						2016-09-09 02:58:52 -07:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						0fd179fb33 
					 
					
						
						
							
							Merge branch 'develop' into feature/hirep  
						
						
						
						
					 
					
						2016-09-01 12:59:53 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						b573d1f35a 
					 
					
						
						
							
							Wilson tree level added  
						
						
						
						
					 
					
						2016-08-31 00:27:04 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						0c1d7e4daf 
					 
					
						
						
							
							Mom space prop for Wilson action  
						
						
						
						
					 
					
						2016-08-31 00:26:36 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						02e983a0cd 
					 
					
						
						
							
							Momentum space prop and free prop convolution  
						
						
						
						
					 
					
						2016-08-31 00:26:02 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						fd5614738d 
					 
					
						
						
							
							Merge branch 'develop' into feature/hirep  
						
						
						
						
					 
					
						2016-08-30 18:21:36 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						4ab7dbfd57 
					 
					
						
						
							
							Instantiate  
						
						
						
						
					 
					
						2016-08-15 23:00:40 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						90e70790f3 
					 
					
						
						
							
							Feature for z-Mobius prep  
						
						
						
						
					 
					
						2016-08-15 22:31:29 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						089f0ab582 
					 
					
						
						
							
							Debugged HMC for Creutz relation  
						
						
						
						
					 
					
						2016-07-28 16:44:41 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						b93e18ed50 
					 
					
						
						
							
							Modified the Dirac Kernel class to compile with different number of colours  
						
						... 
						
						
						
						Added the general push_back functionality to accomodate for all defined representations
Compiles, not tested 
						
						
					 
					
						2016-07-18 16:36:28 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						9c77bb69a5 
					 
					
						
						
							
							Added all elements for Hirep HMC  
						
						... 
						
						
						
						TODO: Test and debug 
						
						
					 
					
						2016-07-18 12:05:23 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						fad5c675eb 
					 
					
						
						
							
							sign error on the 4d gparity force  
						
						
						
						
					 
					
						2016-07-16 01:51:56 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						f4dd5062d7 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
					 
					
						2016-07-15 19:26:06 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						980ff18956 
					 
					
						
						
							
							Solving the instantiation no compile issue  
						
						
						
						
					 
					
						2016-07-15 17:19:44 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						1a6c7204ac 
					 
					
						
						
							
							Disable instantiation; Use cache version instead  
						
						
						
						
					 
					
						2016-07-15 00:34:39 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						dfd714e1ef 
					 
					
						
						
							
							Multiple implementations for the 5d hopping terms, depending on cache friendly  
						
						... 
						
						
						
						ops and/or the 5th direction being vectorised
All use 4d redblack. 
						
						
					 
					
						2016-07-15 00:00:09 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						79a8ca1a62 
					 
					
						
						
							
							Rewrite for performance. Impl dependent instantiations give  
						
						... 
						
						
						
						4d linalg impls of the 5d hopping terms (and inverse)
Cache friendly loop orderings of the above
Dense matrix stored and apply to the above
-- Switch to Ls vectorised, and use dense matrix approach for the MooeeInv
   and rotate/shift of the Mooee M5D routines. 
						
						
					 
					
						2016-07-14 23:58:15 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						a307274c96 
					 
					
						
						
							
							Fermion impl rename for ls vectorised 5d approaches  
						
						
						
						
					 
					
						2016-07-14 23:56:13 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						3f2c44a5fe 
					 
					
						
						
							
							Updating the class to 5d selection based on impl type  
						
						
						
						
					 
					
						2016-07-14 23:55:26 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						48fb1cdc11 
					 
					
						
						
							
							Update domain 5d vectorised impl type, move the type over to 4d redblack with  
						
						... 
						
						
						
						the dense OO inverse 
						
						
					 
					
						2016-07-14 23:54:35 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						8a79e93cc2 
					 
					
						
						
							
							Rename the 5d domain wall fermion vectorised Ls impl class  
						
						
						
						
					 
					
						2016-07-14 23:53:00 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						adbc7c1188 
					 
					
						
						
							
							Adding files for multiple implementations (cache opt) and Ls vectorisation  
						
						... 
						
						
						
						of the 5D cayley form chiral fermions for the 5d matrix. With Ls entirely
in the vector direction, s-hopping terms involve rotations.
The serial dependence of the LDU inversion for Mobius and 4d even odd
checkerboarding is removed by simply applying Ls^2 operations (vectorised
many ways) as a dense matrix operation.
This should give similar throughput but high flops (non-compulsory flops)
but enable use of the KNL cache friendly kernels throughout the code.
Ls is still constrained to be a multiple of Nsimd, which is as much as 8 for AVX512
with single precision. 
						
						
					 
					
						2016-07-14 22:59:21 +01:00