neo 
							
						 
					 
					
						
						
							
						
						57feda4328 
					 
					
						
						
							
							Completed implementation of new Grid_simd classes  
						
						... 
						
						
						
						Tested performance for SSE4, Ok.
AVX1/2, AVX512 yet untested 
						
						
					 
					
						2015-05-22 17:33:15 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						f8d8958884 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/master'  
						
						... 
						
						
						
						Conflicts:
	lib/simd/Grid_vector_types.h
	tests/Makefile.am 
						
						
					 
					
						2015-05-20 17:32:46 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						e529210f43 
					 
					
						
						
							
							Implemented all SSE4 functions.  
						
						... 
						
						
						
						A test code Grid_simd_new.cc has been created to test the new class.
Tests are all OK. 
						
						
					 
					
						2015-05-20 17:22:40 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						2d8b5a8191 
					 
					
						
						
							
							Optimisation...  
						
						
						
						
					 
					
						2015-05-19 15:50:47 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3fe7275332 
					 
					
						
						
							
							Merged  
						
						... 
						
						
						
						Merge branch 'master' of https://github.com/coppolachan/Grid  into coppolachan-master
Conflicts:
	lib/simd/Grid_vector_types.h 
						
						
					 
					
						2015-05-19 15:05:07 +01:00 
						 
				 
			
				
					
						
							
							
								azusayamaguchi 
							
						 
					 
					
						
						
							
						
						ee8cf77071 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/paboyle/Grid  
						
						
						
						
					 
					
						2015-05-19 14:55:26 +01:00 
						 
				 
			
				
					
						
							
							
								azusayamaguchi 
							
						 
					 
					
						
						
							
						
						c8c74e591f 
					 
					
						
						
							
							Add messages to get the number of threads for openmp  
						
						
						
						
					 
					
						2015-05-19 14:54:42 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a6e1ea216d 
					 
					
						
						
							
							Got unpreconditioned conjugate gradient to run and converge on a random (uniform random,  
						
						... 
						
						
						
						not even SU(3) for now) gauge field. Convergence history is correctly indepdendent of decomposition
on 1,2,4,8,16 mpi tasks.
Found a couple of simd bugs which required fixed and enhanced the Grid_simd.cc test suite.
Implemented the Mdag, M, MdagM, Meooe Mooee schur type stuff in the wilson dop. 
						
						
					 
					
						2015-05-19 13:57:35 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						7fb3221d47 
					 
					
						
						
							
							Partial implementation of the vector types SIMD  
						
						... 
						
						
						
						Implementing SSE4 now
A systematic series of tests must be written. 
						
						
					 
					
						2015-05-19 17:21:17 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						639fd05239 
					 
					
						
						
							
							Added check of mpfr and gmp at configure time  
						
						... 
						
						
						
						It generates automatically the linker flags or complains if not found. 
						
						
					 
					
						2015-05-19 13:54:55 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						d6887beead 
					 
					
						
						
							
							Merging with upstream  
						
						
						
						
					 
					
						2015-05-19 13:36:03 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						30494bd96d 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/coppolachan/Grid  into coppolachan-master  
						
						... 
						
						
						
						Conflicts:
	lib/algorithms/approx/bigfloat.h 
						
						
					 
					
						2015-05-18 16:34:21 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						193fd5532f 
					 
					
						
						
							
							Remez tested  
						
						
						
						
					 
					
						2015-05-18 12:09:25 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						fa1dc5e448 
					 
					
						
						
							
							Minor modification to the configure.ac  
						
						... 
						
						
						
						Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed 
						
						
					 
					
						2015-05-18 17:15:14 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						6d2accba7b 
					 
					
						
						
							
							Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.  
						
						
						
						
					 
					
						2015-05-18 16:48:14 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						1887c77498 
					 
					
						
						
							
							Getting closer to having a wilson solver... introducing a first and untested  
						
						... 
						
						
						
						cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of
algorithms/approx
algorithms/iterative
etc.. to start taking shape. 
						
						
					 
					
						2015-05-18 07:47:05 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						6eb8dfd902 
					 
					
						
						
							
							Working towards solvers  
						
						
						
						
					 
					
						2015-05-17 00:19:03 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e841395dfd 
					 
					
						
						
							
							Updating preparing for solvers etc..  
						
						
						
						
					 
					
						2015-05-16 23:35:08 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						53260e7a39 
					 
					
						
						
							
							Typoo xifed  
						
						
						
						
					 
					
						2015-05-16 05:49:32 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a900790b44 
					 
					
						
						
							
							Update Grid_lattice_trace.h  
						
						
						
						
					 
					
						2015-05-16 04:40:28 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						b731bf6976 
					 
					
						
						
							
							Pretty syntax  
						
						
						
						
					 
					
						2015-05-16 04:37:26 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5f8b82b90c 
					 
					
						
						
							
							Optimisation and syntax pretty  
						
						
						
						
					 
					
						2015-05-16 04:36:22 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						afda459886 
					 
					
						
						
							
							strong inline  
						
						
						
						
					 
					
						2015-05-16 04:33:10 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f43589369a 
					 
					
						
						
							
							Compile options tweak  
						
						
						
						
					 
					
						2015-05-15 12:33:18 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c99922b591 
					 
					
						
						
							
							Out of source compile now working  
						
						
						
						
					 
					
						2015-05-15 12:21:40 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3ed30169ea 
					 
					
						
						
							
							clang++ 3.4/5/7 compile happy for AVX and SSE  
						
						... 
						
						
						
						icpc compiles happy on MacOSX both with -xCOMMON-AV512 and native AVX
gcc-5 does not compile happy; can work around by renaming lattice peek/poke/transpose/trace templates
relative to tensor ones, but gcc goes into a recursive template instantiation due to
matching error. I think this is a gcc bug and have filed a report https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153  
						
						
					 
					
						2015-05-15 11:52:11 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						180b06d7e3 
					 
					
						
						
							
							GCC and ICPC complained on more careful typeing  
						
						
						
						
					 
					
						2015-05-15 11:50:44 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3bd376853c 
					 
					
						
						
							
							Move platform dependent out to Grid_simd.h  
						
						
						
						
					 
					
						2015-05-15 11:50:00 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						6bba16ccf7 
					 
					
						
						
							
							ngo store  
						
						
						
						
					 
					
						2015-05-15 11:49:39 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e8efa6320e 
					 
					
						
						
							
							Parallel for replace  
						
						
						
						
					 
					
						2015-05-15 11:48:04 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e3b61bdfce 
					 
					
						
						
							
							Forces inlining upon icpc  
						
						
						
						
					 
					
						2015-05-15 11:43:49 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						86b9d24b62 
					 
					
						
						
							
							Force inlining upon icpc  
						
						
						
						
					 
					
						2015-05-15 11:43:20 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3e3a8dc0c0 
					 
					
						
						
							
							More elegant enable_if  
						
						
						
						
					 
					
						2015-05-15 11:42:51 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						4350c1e0f7 
					 
					
						
						
							
							More elegant to do boolean logic inside the enable_if construct  
						
						... 
						
						
						
						Should have done that from the beginning and should move this into
a global edit 
						
						
					 
					
						2015-05-15 11:42:03 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						8c59605e05 
					 
					
						
						
							
							Force inlining on ICPC because inline apparently is not enoguh  
						
						
						
						
					 
					
						2015-05-15 11:41:31 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e59b6a805c 
					 
					
						
						
							
							strong_inline forces ICPC to do it.  
						
						
						
						
					 
					
						2015-05-15 11:40:59 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5d8303e94d 
					 
					
						
						
							
							Force strong_inline to force ipcc's hand  
						
						
						
						
					 
					
						2015-05-15 11:40:31 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						1339a7f8b0 
					 
					
						
						
							
							Switch to strong_inline macro to force icpc's hand  
						
						
						
						
					 
					
						2015-05-15 11:40:00 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e58cc72fe5 
					 
					
						
						
							
							Promote to strong inline to force ICPC's hand. Annoying.  
						
						
						
						
					 
					
						2015-05-15 11:39:25 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						074430af0d 
					 
					
						
						
							
							Formatting change  
						
						
						
						
					 
					
						2015-05-15 11:38:54 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						873110d482 
					 
					
						
						
							
							Filed bug report Bug 66153 on GCC-5.  
						
						... 
						
						
						
						https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153  
					
						2015-05-15 11:38:04 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						4cee0e8653 
					 
					
						
						
							
							Silly formatting change  
						
						
						
						
					 
					
						2015-05-15 11:37:07 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						6b2a786779 
					 
					
						
						
							
							gcc doesn't like collapse(2) for some reason I can't figure  
						
						
						
						
					 
					
						2015-05-15 11:36:22 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						cf27f22dc0 
					 
					
						
						
							
							ICPC and GCC5 fixes  
						
						
						
						
					 
					
						2015-05-15 11:35:02 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						40192841a4 
					 
					
						
						
							
							Using boolean logic inside enable_if is more elegant  
						
						
						
						
					 
					
						2015-05-15 11:32:45 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						1771f97551 
					 
					
						
						
							
							Key of mm_malloc.h  
						
						
						
						
					 
					
						2015-05-15 11:32:11 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						2eaf73e8b3 
					 
					
						
						
							
							strong inline required to force icpc  
						
						
						
						
					 
					
						2015-05-15 11:31:41 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						43bdbb5080 
					 
					
						
						
							
							Linear op added  
						
						
						
						
					 
					
						2015-05-13 11:25:34 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						7f3ae64a31 
					 
					
						
						
							
							OMP dslash working  
						
						
						
						
					 
					
						2015-05-13 10:59:22 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d388b831b4 
					 
					
						
						
							
							cout IO for all types  
						
						
						
						
					 
					
						2015-05-13 09:24:10 +01:00