Guido Cossu 
							
						 
					 
					
						
						
							
						
						9cb90f714e 
					 
					
						
						
							
							Merge remote-tracking branch 'origin/develop' into temporary-smearing  
						
						
						
						
					 
					
						2016-07-04 17:28:40 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						2daffdf95d 
					 
					
						
						
							
							Tested smeared WilsonRatio action, accepts  
						
						
						
						
					 
					
						2016-07-04 16:17:28 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						149f826601 
					 
					
						
						
							
							Tested smearing for Nf2 WilsonFermionAction, non EO: accepts  
						
						
						
						
					 
					
						2016-07-04 16:09:19 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						cd8ee27080 
					 
					
						
						
							
							Simple change in iGamma for smearing  
						
						
						
						
					 
					
						2016-07-04 16:02:57 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						0fa66e8f3c 
					 
					
						
						
							
							Debugged smearing for EOWilson, accepts  
						
						
						
						
					 
					
						2016-07-04 15:35:37 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						8dd099267d 
					 
					
						
						
							
							Corrected a bug in the Expression Templates (acso and asin were wrong)  
						
						
						
						
					 
					
						2016-07-03 12:28:25 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						1a6d65c6a4 
					 
					
						
						
							
							Converted set_uw and set_fj to all complex functions  
						
						
						
						
					 
					
						2016-07-03 10:27:43 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						092fa0d8da 
					 
					
						
						
							
							Debugged set_fj,  
						
						... 
						
						
						
						to be fixed: BUG in imag() 
						
						
					 
					
						2016-07-01 16:06:20 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						680645f849 
					 
					
						
						
							
							Merge branch 'release/v0.5.0'  
						
						
						
						
					 
					
						2016-06-30 15:15:03 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						712b9a3489 
					 
					
						
						
							
							Asm only for avx512  
						
						
						
						
					 
					
						2016-06-30 14:35:02 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						bdaa5b1767 
					 
					
						
						
							
							Updated to have perfect prefetching for the s-vectorised kernel with any cache blocking.  
						
						
						
						
					 
					
						2016-06-30 14:35:02 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						8fcefc021a 
					 
					
						
						
							
							Improved the prefetching when using cache blocking codes  
						
						
						
						
					 
					
						2016-06-30 14:35:02 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						05c884a62a 
					 
					
						
						
							
							Prefetch change  
						
						
						
						
					 
					
						2016-06-30 14:35:01 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						2d8bb4c594 
					 
					
						
						
							
							Tweaks  
						
						
						
						
					 
					
						2016-06-30 14:35:01 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						6d58cb2a68 
					 
					
						
						
							
							Enable reordering of the loops in the assembler for cache friendly.  
						
						... 
						
						
						
						This gets in the way of L2 prefetching however. Do next next link in stencil
prefetching. 
						
						
					 
					
						2016-06-30 14:35:01 -07:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						565e9329ba 
					 
					
						
						
							
							Changed the colouring classes  
						
						
						
						
					 
					
						2016-06-30 16:51:03 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						5e02392f9c 
					 
					
						
						
							
							Fixed compilation error for benchmark_dwf  
						
						... 
						
						
						
						Some parts were assuming floating point precision 
						
						
					 
					
						2016-06-20 12:30:51 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						87418e7df1 
					 
					
						
						
							
							Slightly faster prefetching perf.  
						
						
						
						
					 
					
						2016-06-13 02:32:52 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						55f65b81b5 
					 
					
						
						
							
							Improvements to the assembler interface that let us move chunks of the  
						
						... 
						
						
						
						site and s loop into the kernels. This will save on function call overhead and
guarantee L2 prefetching strategy is right since OMP can't distribute the
sub-chunks of work. 
						
						
					 
					
						2016-06-09 01:12:36 -07:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						d9408893b3 
					 
					
						
						
							
							Prefetching in the normal kernel implementation.  
						
						
						
						
					 
					
						2016-06-08 05:43:48 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						8ac021de73 
					 
					
						
						
							
							Added a test an fixed it for red black precon Ls innermost vectorised DWF  
						
						
						
						
					 
					
						2016-06-07 13:16:56 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						e503ef5590 
					 
					
						
						
							
							Cleaned up  
						
						
						
						
					 
					
						2016-06-07 00:11:36 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						a7682b0060 
					 
					
						
						
							
							Only instantiate the one routine to avoid duplicate symbol under g++5/MacOS  
						
						
						
						
					 
					
						2016-06-06 23:48:21 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						53d06046b0 
					 
					
						
						
							
							Compiling updates for KNL  
						
						
						
						
					 
					
						2016-06-03 03:47:54 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						139cc5f1ae 
					 
					
						
						
							
							Large change with KNL preparation  
						
						
						
						
					 
					
						2016-06-03 03:24:26 -07:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						c698b16d75 
					 
					
						
						
							
							function to generate Chroma-style gamma matrix products  
						
						
						
						
					 
					
						2016-05-01 18:30:35 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						5341977948 
					 
					
						
						
							
							IMCI fixes. Thought I had committed these. The "real" disambiguation  
						
						... 
						
						
						
						between std::real and Grid::real shouldn't have been necessary and I don't
know why only the icpc v16.0 on babbage hits it.
May need a longer term rename of Grid::real or some careful EnableIf work. 
						
						
					 
					
						2016-04-30 03:34:16 -07:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						f6c53e5039 
					 
					
						
						
							
							Merge commit '1e554350acae0e67fa7177ed0db9d4f684a54af2'  
						
						
						
						
					 
					
						2016-04-30 00:17:52 -07:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						6aa000176f 
					 
					
						
						
							
							Fermion <-> Propagator functions  
						
						
						
						
					 
					
						2016-04-30 00:14:33 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						1e554350ac 
					 
					
						
						
							
							The threaded coms didn't agree with GCC. Suprised, and looks like GCC bug.  
						
						
						
						
					 
					
						2016-04-29 16:49:18 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						c79ea0dcef 
					 
					
						
						
							
							Fixingn IMCI  
						
						
						
						
					 
					
						2016-04-22 21:52:54 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						8fd8bc25e9 
					 
					
						
						
							
							simd 5th dim with rotation  
						
						
						
						
					 
					
						2016-04-19 15:39:00 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						ba427abde9 
					 
					
						
						
							
							simd 5d  
						
						
						
						
					 
					
						2016-04-19 15:38:39 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						9b6ab6db16 
					 
					
						
						
							
							simd in 5th dimension support  
						
						
						
						
					 
					
						2016-04-19 15:38:01 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						806a83d38b 
					 
					
						
						
							
							simd in fifth dim support for dwf  
						
						
						
						
					 
					
						2016-04-19 15:36:19 -07:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						339be37dba 
					 
					
						
						
							
							Debugging smeared HMC  
						
						
						
						
					 
					
						2016-04-13 17:00:14 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						a87b744621 
					 
					
						
						
							
							HMC runs but does not accept with smearing on  
						
						
						
						
					 
					
						2016-04-07 16:45:11 +09:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						b1192a8908 
					 
					
						
						
							
							Benchmark_zmm added  
						
						
						
						
					 
					
						2016-04-06 03:00:07 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						e8dddb1596 
					 
					
						
						
							
							Adding extra benchmark  
						
						
						
						
					 
					
						2016-04-06 10:32:54 +01:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						97d0d56bcb 
					 
					
						
						
							
							Debugging Smearing routines (set_fj)  
						
						
						
						
					 
					
						2016-04-06 17:58:43 +09:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						7c7ea35ffb 
					 
					
						
						
							
							Putting the Traceless Antihermitian part outside the deriv in pseudofermion actions  
						
						
						
						
					 
					
						2016-04-05 16:28:09 +09:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						4b1cf580e0 
					 
					
						
						
							
							Debugging the Smearing routines  
						
						
						
						
					 
					
						2016-04-05 16:19:30 +09:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						e67fc2be18 
					 
					
						
						
							
							Adding a trial for openmp overhead minimisation  
						
						
						
						
					 
					
						2016-03-31 16:00:37 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						8052556275 
					 
					
						
						
							
							Cleaning up the single/double kernel implementation switch  
						
						
						
						
					 
					
						2016-03-31 14:51:32 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						60d965f79e 
					 
					
						
						
							
							AVX512 improvements; sigfpe trapping too  
						
						
						
						
					 
					
						2016-03-30 08:42:34 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						1ecbf9794d 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/paboyle/Grid  
						
						
						
						
					 
					
						2016-03-30 08:37:55 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						c77b7ee897 
					 
					
						
						
							
							AddSub based alternate SU3 routine  
						
						
						
						
					 
					
						2016-03-28 17:55:22 -06:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						1e355a51e1 
					 
					
						
						
							
							Interface change  
						
						
						
						
					 
					
						2016-03-27 23:46:55 -07:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						21abaf7e91 
					 
					
						
						
							
							Gamma sign change  
						
						
						
						
					 
					
						2016-03-28 00:35:45 -06:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						165bffc2e7 
					 
					
						
						
							
							Avx512 changes for assembler kernels  
						
						
						
						
					 
					
						2016-03-26 22:25:45 -06:00