Peter Boyle 
							
						 
					 
					
						
						
							
						
						bfb1cd36e2 
					 
					
						
						
							
							Strip out the dslash kernel implementation  
						
						
						
						
					 
					
						2015-05-26 19:55:18 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						840754dd42 
					 
					
						
						
							
							Hand unrolled version of dslash in a separate class.  
						
						... 
						
						
						
						Useful to compare; raises Intel compiler from 9GFlop/s to 17.5 Gflops.
                   on ivybridge core. Raises Clang form 14.5 to 17.5 
						
						
					 
					
						2015-05-26 19:54:03 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						500f6ed0c5 
					 
					
						
						
							
							More cleanup of Grid_simd.h  
						
						
						
						
					 
					
						2015-05-26 13:54:34 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						4dbaa389c8 
					 
					
						
						
							
							Cleaning up simd files  
						
						
						
						
					 
					
						2015-05-26 13:31:10 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						48cc816136 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/master'  
						
						... 
						
						
						
						Conflicts:
	lib/math/Grid_math_tensors.h
	lib/simd/Grid_vector_types.h 
						
						
					 
					
						2015-05-26 13:14:06 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						1a24801246 
					 
					
						
						
							
							checked performance of new vector libaries.  
						
						... 
						
						
						
						Added check for c++11 support on the configure.ac 
						
						
					 
					
						2015-05-26 12:02:54 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						37721572e7 
					 
					
						
						
							
							Makefile update  
						
						
						
						
					 
					
						2015-05-25 14:43:08 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						489b1b9633 
					 
					
						
						
							
							Schur complement based red-black inversion working  
						
						
						
						
					 
					
						2015-05-25 13:47:12 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						ea3240ad55 
					 
					
						
						
							
							Better EO support letting Schur solver work  
						
						
						
						
					 
					
						2015-05-25 13:46:28 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						956e728b40 
					 
					
						
						
							
							Most cosmetic  
						
						
						
						
					 
					
						2015-05-25 13:45:32 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						94d679c4e6 
					 
					
						
						
							
							Better checkerboard tracking.  
						
						
						
						
					 
					
						2015-05-25 13:45:08 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						616f871735 
					 
					
						
						
							
							move constants into red black  
						
						
						
						
					 
					
						2015-05-25 13:44:35 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						624c0ac3ef 
					 
					
						
						
							
							Updates now schur red black solver working  
						
						
						
						
					 
					
						2015-05-25 13:43:58 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						ac99832d21 
					 
					
						
						
							
							Herm op  
						
						
						
						
					 
					
						2015-05-25 13:42:36 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d30c013721 
					 
					
						
						
							
							red black fix  
						
						
						
						
					 
					
						2015-05-25 13:42:12 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5cf285bce9 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/paboyle/Grid  
						
						
						
						
					 
					
						2015-05-23 09:36:08 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						613a73b1b6 
					 
					
						
						
							
							Added  
						
						
						
						
					 
					
						2015-05-23 09:36:01 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f681baa9cd 
					 
					
						
						
							
							Extra targets  
						
						
						
						
					 
					
						2015-05-23 09:35:37 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d21411ead9 
					 
					
						
						
							
							More targets  
						
						
						
						
					 
					
						2015-05-23 09:34:50 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						64fcbd0387 
					 
					
						
						
							
							Improving even odd sector; lot of work and through required cleaning this  
						
						
						
						
					 
					
						2015-05-23 09:34:16 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						bef9bf0d38 
					 
					
						
						
							
							Rely on default constructors  
						
						
						
						
					 
					
						2015-05-23 09:33:42 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						eadfb5be67 
					 
					
						
						
							
							Better pragma use  
						
						
						
						
					 
					
						2015-05-23 09:32:37 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						33737ef57a 
					 
					
						
						
							
							Cosmetic  
						
						
						
						
					 
					
						2015-05-23 09:31:15 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						32c3f16f95 
					 
					
						
						
							
							Iterator required  
						
						
						
						
					 
					
						2015-05-23 09:30:28 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						9e29ac6549 
					 
					
						
						
							
							Completed implementation of new Grid_simd classes  
						
						... 
						
						
						
						Tested performance for SSE4, Ok.
AVX1/2, AVX512 yet untested 
						
						
					 
					
						2015-05-22 17:33:15 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						24c68a697b 
					 
					
						
						
							
							Merge pull request  #7  from coppolachan/master  
						
						... 
						
						
						
						Added full support for SSE4 
						
						
					 
					
						2015-05-22 05:58:59 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						9601890549 
					 
					
						
						
							
							Streaming store option ifdef  
						
						
						
						
					 
					
						2015-05-21 06:47:05 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						1559dd4adc 
					 
					
						
						
							
							Compile time select if we do the streaming store copy. Relies on Clang++ eliminating object copies,  
						
						... 
						
						
						
						and other compliers do not necessarily cope. 
						
						
					 
					
						2015-05-21 06:39:00 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						22bfbd0f8d 
					 
					
						
						
							
							adding two routines containing only a single operation so I can easily see the assembly dump  
						
						
						
						
					 
					
						2015-05-21 06:37:46 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3a441c3e94 
					 
					
						
						
							
							Minor change  
						
						
						
						
					 
					
						2015-05-21 06:37:20 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d4ca8647dc 
					 
					
						
						
							
							useful to dump assembler  
						
						
						
						
					 
					
						2015-05-21 06:36:47 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d0d41b8bce 
					 
					
						
						
							
							Didn't like a print statement  
						
						
						
						
					 
					
						2015-05-21 06:36:15 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						341096dce8 
					 
					
						
						
							
							better comms benchmarking  
						
						
						
						
					 
					
						2015-05-21 06:35:46 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						34960ca50c 
					 
					
						
						
							
							Unroll pragma abstraction  
						
						
						
						
					 
					
						2015-05-21 06:34:33 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						d03c4e5901 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/master'  
						
						... 
						
						
						
						Conflicts:
	lib/simd/Grid_vector_types.h
	tests/Makefile.am 
						
						
					 
					
						2015-05-20 17:32:46 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						cf7be0e461 
					 
					
						
						
							
							Implemented all SSE4 functions.  
						
						... 
						
						
						
						A test code Grid_simd_new.cc has been created to test the new class.
Tests are all OK. 
						
						
					 
					
						2015-05-20 17:22:40 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						221902a882 
					 
					
						
						
							
							Merging in  
						
						... 
						
						
						
						Merge branch 'master' of https://github.com/paboyle/Grid  
						
						
					 
					
						2015-05-19 21:30:13 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d3931111fb 
					 
					
						
						
							
							Build a simple kernel to compare intel compiler and clang in simple environment  
						
						
						
						
					 
					
						2015-05-19 21:29:40 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a21036e69a 
					 
					
						
						
							
							Reworking to keep intel compiler happy  
						
						
						
						
					 
					
						2015-05-19 21:29:07 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						8220794c44 
					 
					
						
						
							
							Optimisation...  
						
						
						
						
					 
					
						2015-05-19 15:50:47 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						7571a6b021 
					 
					
						
						
							
							Merge branch 'coppolachan-master'  
						
						
						
						
					 
					
						2015-05-19 15:05:32 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						fde7f8d6b9 
					 
					
						
						
							
							Merged  
						
						... 
						
						
						
						Merge branch 'master' of https://github.com/coppolachan/Grid  into coppolachan-master
Conflicts:
	lib/simd/Grid_vector_types.h 
						
						
					 
					
						2015-05-19 15:05:07 +01:00 
						 
				 
			
				
					
						
							
							
								azusayamaguchi 
							
						 
					 
					
						
						
							
						
						2d2da8364f 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/paboyle/Grid  
						
						
						
						
					 
					
						2015-05-19 14:55:26 +01:00 
						 
				 
			
				
					
						
							
							
								azusayamaguchi 
							
						 
					 
					
						
						
							
						
						91f29d4a68 
					 
					
						
						
							
							Add messages to get the number of threads for openmp  
						
						
						
						
					 
					
						2015-05-19 14:54:42 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						4dba8522a1 
					 
					
						
						
							
							Got unpreconditioned conjugate gradient to run and converge on a random (uniform random,  
						
						... 
						
						
						
						not even SU(3) for now) gauge field. Convergence history is correctly indepdendent of decomposition
on 1,2,4,8,16 mpi tasks.
Found a couple of simd bugs which required fixed and enhanced the Grid_simd.cc test suite.
Implemented the Mdag, M, MdagM, Meooe Mooee schur type stuff in the wilson dop. 
						
						
					 
					
						2015-05-19 13:57:35 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						74e91cd925 
					 
					
						
						
							
							Partial implementation of the vector types SIMD  
						
						... 
						
						
						
						Implementing SSE4 now
A systematic series of tests must be written. 
						
						
					 
					
						2015-05-19 17:21:17 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						baa382f055 
					 
					
						
						
							
							Added check of mpfr and gmp at configure time  
						
						... 
						
						
						
						It generates automatically the linker flags or complains if not found. 
						
						
					 
					
						2015-05-19 13:54:55 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						7ad705066d 
					 
					
						
						
							
							Merging with upstream  
						
						
						
						
					 
					
						2015-05-19 13:36:03 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						05d862782f 
					 
					
						
						
							
							Merge branch 'coppolachan-master'  
						
						
						
						
					 
					
						2015-05-18 16:36:58 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3f17423d36 
					 
					
						
						
							
							lib/algorithms/approx/bigfloat.h  
						
						
						
						
					 
					
						2015-05-18 16:35:48 +01:00