Peter Boyle 
							
						 
					 
					
						
						
							
						
						f9b070d64d 
					 
					
						
						
							
							Reorganise of file naming  
						
						
						
						
					 
					
						2015-06-03 12:47:05 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						6cb38dc5dc 
					 
					
						
						
							
							Overlap Wilson Cayley tanh & zolo  
						
						
						
						
					 
					
						2015-06-03 11:26:54 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						2b083ca987 
					 
					
						
						
							
							CG test written and passes i.e. converges with small true residual  
						
						... 
						
						
						
						in RedBlack MpcDagMpc, Unprec MdagM and Schur red black solver for
each of.
DomainWallFermion
MobiusFermion
MobiusZolotarevFermion
ScaledShamirFermion
ScaledShamirZolotarevFermion 
						
						
					 
					
						2015-06-03 10:54:03 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c659c76053 
					 
					
						
						
							
							Scaled Shamir and Scaled Shamir Zolotarev aliases for special cases of Mobius.  
						
						
						
						
					 
					
						2015-06-03 09:51:06 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						68e26140ee 
					 
					
						
						
							
							Mobius Caley form, Mobius Zolotarev operators. Pass Even Odd vs unprec test and hermiticity checks  
						
						... 
						
						
						
						in tests/Grid_any_evenodd.cc; will work on inversion tests shortly. 
						
						
					 
					
						2015-06-03 09:36:26 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						494d2b8b61 
					 
					
						
						
							
							Reorg; moving prec/unprec/schur CG for Wilson and DWF into tests as these are really tests and not benchmarks  
						
						... 
						
						
						
						(no performance reports, only convergence test). 
						
						
					 
					
						2015-06-02 17:25:26 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						66d997e031 
					 
					
						
						
							
							Large scale change to support 5d fermion formulations.  
						
						... 
						
						
						
						Have 5d replicated wilson with 4d gauge working and matrix regressing
to Ls copies of wilson. 
						
						
					 
					
						2015-05-31 15:09:02 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						9ad6d0c65f 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/master'  
						
						... 
						
						
						
						Conflicts:
	lib/math/Grid_math_tensors.h
	lib/simd/Grid_vector_types.h 
						
						
					 
					
						2015-05-26 13:14:06 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						377083e6ae 
					 
					
						
						
							
							checked performance of new vector libaries.  
						
						... 
						
						
						
						Added check for c++11 support on the configure.ac 
						
						
					 
					
						2015-05-26 12:02:54 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d7f5172860 
					 
					
						
						
							
							Schur complement based red-black inversion working  
						
						
						
						
					 
					
						2015-05-25 13:47:12 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						41ba13f951 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/paboyle/Grid  
						
						
						
						
					 
					
						2015-05-23 09:36:08 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						57feda4328 
					 
					
						
						
							
							Completed implementation of new Grid_simd classes  
						
						... 
						
						
						
						Tested performance for SSE4, Ok.
AVX1/2, AVX512 yet untested 
						
						
					 
					
						2015-05-22 17:33:15 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						f8d8958884 
					 
					
						
						
							
							Merge remote-tracking branch 'upstream/master'  
						
						... 
						
						
						
						Conflicts:
	lib/simd/Grid_vector_types.h
	tests/Makefile.am 
						
						
					 
					
						2015-05-20 17:32:46 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						e529210f43 
					 
					
						
						
							
							Implemented all SSE4 functions.  
						
						... 
						
						
						
						A test code Grid_simd_new.cc has been created to test the new class.
Tests are all OK. 
						
						
					 
					
						2015-05-20 17:22:40 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						8fdff33b3a 
					 
					
						
						
							
							Merging in  
						
						... 
						
						
						
						Merge branch 'master' of https://github.com/paboyle/Grid  
						
						
					 
					
						2015-05-19 21:30:13 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						efc0d1e0b9 
					 
					
						
						
							
							Reworking to keep intel compiler happy  
						
						
						
						
					 
					
						2015-05-19 21:29:07 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3fe7275332 
					 
					
						
						
							
							Merged  
						
						... 
						
						
						
						Merge branch 'master' of https://github.com/coppolachan/Grid  into coppolachan-master
Conflicts:
	lib/simd/Grid_vector_types.h 
						
						
					 
					
						2015-05-19 15:05:07 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a6e1ea216d 
					 
					
						
						
							
							Got unpreconditioned conjugate gradient to run and converge on a random (uniform random,  
						
						... 
						
						
						
						not even SU(3) for now) gauge field. Convergence history is correctly indepdendent of decomposition
on 1,2,4,8,16 mpi tasks.
Found a couple of simd bugs which required fixed and enhanced the Grid_simd.cc test suite.
Implemented the Mdag, M, MdagM, Meooe Mooee schur type stuff in the wilson dop. 
						
						
					 
					
						2015-05-19 13:57:35 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						7fb3221d47 
					 
					
						
						
							
							Partial implementation of the vector types SIMD  
						
						... 
						
						
						
						Implementing SSE4 now
A systematic series of tests must be written. 
						
						
					 
					
						2015-05-19 17:21:17 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						30494bd96d 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/coppolachan/Grid  into coppolachan-master  
						
						... 
						
						
						
						Conflicts:
	lib/algorithms/approx/bigfloat.h 
						
						
					 
					
						2015-05-18 16:34:21 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						193fd5532f 
					 
					
						
						
							
							Remez tested  
						
						
						
						
					 
					
						2015-05-18 12:09:25 +01:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						fa1dc5e448 
					 
					
						
						
							
							Minor modification to the configure.ac  
						
						... 
						
						
						
						Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed 
						
						
					 
					
						2015-05-18 17:15:14 +09:00 
						 
				 
			
				
					
						
							
							
								neo 
							
						 
					 
					
						
						
							
						
						6d2accba7b 
					 
					
						
						
							
							Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.  
						
						
						
						
					 
					
						2015-05-18 16:48:14 +09:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						1887c77498 
					 
					
						
						
							
							Getting closer to having a wilson solver... introducing a first and untested  
						
						... 
						
						
						
						cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of
algorithms/approx
algorithms/iterative
etc.. to start taking shape. 
						
						
					 
					
						2015-05-18 07:47:05 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c99922b591 
					 
					
						
						
							
							Out of source compile now working  
						
						
						
						
					 
					
						2015-05-15 12:21:40 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						457cc0d5a3 
					 
					
						
						
							
							RNG test  
						
						
						
						
					 
					
						2015-05-13 09:24:30 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d388b831b4 
					 
					
						
						
							
							cout IO for all types  
						
						
						
						
					 
					
						2015-05-13 09:24:10 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						52174da232 
					 
					
						
						
							
							Enhanced SIMD interfacing  
						
						
						
						
					 
					
						2015-05-12 20:41:44 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						65c91eae64 
					 
					
						
						
							
							Threading support rework.  
						
						... 
						
						
						
						Placed parallel pragmas as macros; implemented deterministic thread reduction in style of
BFM. 
						
						
					 
					
						2015-05-12 07:51:41 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						8b765be2b1 
					 
					
						
						
							
							Moving some things around for pretty  
						
						
						
						
					 
					
						2015-05-11 19:09:49 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a411b48a91 
					 
					
						
						
							
							Adding a better controlled threading class, preparing to  
						
						... 
						
						
						
						force in deterministic reduction. 
						
						
					 
					
						2015-05-11 18:59:03 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						ebcb87abe1 
					 
					
						
						
							
							Got command line args working  
						
						
						
						
					 
					
						2015-05-11 14:36:48 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						1576b7837a 
					 
					
						
						
							
							CML parse  
						
						
						
						
					 
					
						2015-05-11 12:56:27 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						fa5779537c 
					 
					
						
						
							
							Command line args and a general clean up  
						
						
						
						
					 
					
						2015-05-11 12:43:10 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						242e447bc5 
					 
					
						
						
							
							Lots of changes required to compile for MIC under ICPC  
						
						
						
						
					 
					
						2015-05-10 23:29:21 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						352bccf6ca 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/paboyle/Grid  
						
						... 
						
						
						
						Conflicts:
	lib/qcd/Grid_qcd_wilson_dop.cc 
						
						
					 
					
						2015-05-10 15:37:47 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5fcf42cb30 
					 
					
						
						
							
							Hack; must bring norm2 into the unary operator list.  
						
						... 
						
						
						
						ET's are still incomplete. 
						
						
					 
					
						2015-05-10 15:30:29 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e647cf0459 
					 
					
						
						
							
							Default to single node. Move to command line args.  
						
						
						
						
					 
					
						2015-05-10 15:27:38 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						8919bf9e0a 
					 
					
						
						
							
							Single node default. Should expose this as command line args, but haven't sorted out  
						
						... 
						
						
						
						Grid_initialize to handle this. Should put this on the TODO list. 
						
						
					 
					
						2015-05-10 15:26:06 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5415180676 
					 
					
						
						
							
							Wilson perf improvements with Gauge prefetching  
						
						
						
						
					 
					
						2015-05-06 06:37:21 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						7b0dd6c5d6 
					 
					
						
						
							
							Cleaned up for Linux  
						
						
						
						
					 
					
						2015-05-05 22:09:22 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						9d93d1e6d4 
					 
					
						
						
							
							Comms and memory benchmarks added  
						
						
						
						
					 
					
						2015-05-03 09:44:47 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						6a39089a43 
					 
					
						
						
							
							Starting a benchmarking sub dir  
						
						
						
						
					 
					
						2015-05-02 17:52:36 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c0ead94791 
					 
					
						
						
							
							Integrated Lebesgue code and been playing with alternate implementations of the wilson dop without  
						
						... 
						
						
						
						any particular success in increasing the performance. 
						
						
					 
					
						2015-04-30 16:39:06 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d8ffa09e3b 
					 
					
						
						
							
							Benchmark wilson dhop now; 14.6GF on one core, not as fast as SU(3)xSU(3) [23GF] but still not too shabby.  
						
						... 
						
						
						
						Disassembling output shows ugly sequences in the permute sector. Could comparatively benchmark with and without
the if-else structure to see how much I'm losing.
Drops to 9GF as it falls out of cache. Moving to Lebesgue ordering should help there. Substantive progress. 
						
						
					 
					
						2015-04-29 06:50:18 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						dcc23faa4a 
					 
					
						
						
							
							Fixed the stencil sector and Wilson now agrees between stencil based implementation  
						
						... 
						
						
						
						and the cshift based implementation. Managed to reduce the volume of code in this
sector a little, but consolidation would be good, perhaps taking common
logic out into simple helper functions 
						
						
					 
					
						2015-04-29 06:23:56 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						b0485894b3 
					 
					
						
						
							
							Shaken out stencil to the point where I think wilson dslash is correct.  
						
						... 
						
						
						
						Need to audit code carefully, consolidate between stencil and cshift,
and then benchmark and optimise. 
						
						
					 
					
						2015-04-28 08:11:59 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						0b7d389258 
					 
					
						
						
							
							Reworking CSHIFT and Stencil. Implementing Wilson and discovered rework is required  
						
						
						
						
					 
					
						2015-04-27 13:45:07 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						35cfef2129 
					 
					
						
						
							
							Big updates with progress towards wilson matrix  
						
						
						
						
					 
					
						2015-04-26 15:51:09 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						2d8cf9e456 
					 
					
						
						
							
							Added two spinor functionality required to support the Wilson hopping term.  
						
						
						
						
					 
					
						2015-04-25 12:54:06 +01:00