Peter Boyle 
							
						 
					 
					
						
						
							
						
						f761ab0f50 
					 
					
						
						
							
							Filed bug report Bug 66153 on GCC-5.  
						
						... 
						
						
						
						https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153  
					
						2015-05-15 11:38:04 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						2a28cfb3a3 
					 
					
						
						
							
							Silly formatting change  
						
						
						
						
					 
					
						2015-05-15 11:37:07 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						b00622302b 
					 
					
						
						
							
							gcc doesn't like collapse(2) for some reason I can't figure  
						
						
						
						
					 
					
						2015-05-15 11:36:22 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3057b2762a 
					 
					
						
						
							
							ICPC and GCC5 fixes  
						
						
						
						
					 
					
						2015-05-15 11:35:02 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						151a6f4e14 
					 
					
						
						
							
							Using boolean logic inside enable_if is more elegant  
						
						
						
						
					 
					
						2015-05-15 11:32:45 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a36c974f26 
					 
					
						
						
							
							Key of mm_malloc.h  
						
						
						
						
					 
					
						2015-05-15 11:32:11 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c0977dcfaa 
					 
					
						
						
							
							strong inline required to force icpc  
						
						
						
						
					 
					
						2015-05-15 11:31:41 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f1255197c2 
					 
					
						
						
							
							Linear op added  
						
						
						
						
					 
					
						2015-05-13 11:25:34 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e179828662 
					 
					
						
						
							
							OMP dslash working  
						
						
						
						
					 
					
						2015-05-13 10:59:22 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a108d5d3b0 
					 
					
						
						
							
							cout IO for all types  
						
						
						
						
					 
					
						2015-05-13 09:24:10 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						48f425d31c 
					 
					
						
						
							
							I have made the Cshift work successfully with open mp threading in  
						
						... 
						
						
						
						every routine. Collapse(2) is now working under clang-omp++. 
						
						
					 
					
						2015-05-13 00:31:00 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						6cec662ac5 
					 
					
						
						
							
							Enhanced SIMD interfacing  
						
						
						
						
					 
					
						2015-05-12 20:41:44 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						6103c29ee3 
					 
					
						
						
							
							Threading support rework.  
						
						... 
						
						
						
						Placed parallel pragmas as macros; implemented deterministic thread reduction in style of
BFM. 
						
						
					 
					
						2015-05-12 07:51:41 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						b1d2c60d07 
					 
					
						
						
							
							Moving some things around for pretty  
						
						
						
						
					 
					
						2015-05-11 19:09:49 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						22d384b07d 
					 
					
						
						
							
							Adding a better controlled threading class, preparing to  
						
						... 
						
						
						
						force in deterministic reduction. 
						
						
					 
					
						2015-05-11 18:59:03 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f5dcca7b1b 
					 
					
						
						
							
							Got command line args working  
						
						
						
						
					 
					
						2015-05-11 14:36:48 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						379943abf5 
					 
					
						
						
							
							Command line args and a general clean up  
						
						
						
						
					 
					
						2015-05-11 12:43:10 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5555a852be 
					 
					
						
						
							
							Lots of changes required to compile for MIC under ICPC  
						
						
						
						
					 
					
						2015-05-10 23:29:21 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						48b9692845 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/paboyle/Grid  
						
						... 
						
						
						
						Conflicts:
	lib/qcd/Grid_qcd_wilson_dop.cc 
						
						
					 
					
						2015-05-10 15:37:47 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						b802abc83f 
					 
					
						
						
							
							Expression template hack  
						
						
						
						
					 
					
						2015-05-10 15:35:30 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						14591c72d6 
					 
					
						
						
							
							Expression template engin  
						
						
						
						
					 
					
						2015-05-10 15:34:20 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						02ae26d091 
					 
					
						
						
							
							Small tweak to enable benchmarking to suppress gauge field bandwidth as a test.  
						
						... 
						
						
						
						This is a short term hack while I benchmark. 
						
						
					 
					
						2015-05-10 15:25:23 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						2ffd941d67 
					 
					
						
						
							
							Assertion should never hit, but did due to a bug  
						
						
						
						
					 
					
						2015-05-10 15:24:37 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						ca554f661b 
					 
					
						
						
							
							Moving operator stuff into separate file so that we can switch on/off replacement with  
						
						... 
						
						
						
						expression templates 
						
						
					 
					
						2015-05-10 15:23:49 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						29be76f958 
					 
					
						
						
							
							Fixing breakage in the Comms non compile  
						
						
						
						
					 
					
						2015-05-10 15:23:09 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e3acb36de6 
					 
					
						
						
							
							Bringing expression templates for faster vector loops  
						
						
						
						
					 
					
						2015-05-10 15:22:31 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						55ccb8ccf4 
					 
					
						
						
							
							Wilson perf improvements with Gauge prefetching  
						
						
						
						
					 
					
						2015-05-06 06:37:21 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						35d949cc17 
					 
					
						
						
							
							Cleaned up for Linux  
						
						
						
						
					 
					
						2015-05-05 22:09:22 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						b9d16a7191 
					 
					
						
						
							
							streaming store cases  
						
						
						
						
					 
					
						2015-05-05 18:14:09 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						07d57b6d55 
					 
					
						
						
							
							Streaming store option  
						
						
						
						
					 
					
						2015-05-05 18:13:06 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5ebc7a1756 
					 
					
						
						
							
							Added streaming stores  
						
						
						
						
					 
					
						2015-05-05 18:09:28 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						aeda7b923d 
					 
					
						
						
							
							Back to vector for now; cost of init loop is clear in the a*x + y  
						
						... 
						
						
						
						loop in memory benchmark and must move to better container class. 
						
						
					 
					
						2015-05-03 09:48:13 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						193860dbc8 
					 
					
						
						
							
							Comms and memory benchmarks added  
						
						
						
						
					 
					
						2015-05-03 09:44:47 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f663be2a6c 
					 
					
						
						
							
							Added a comms benchmark  
						
						
						
						
					 
					
						2015-05-02 23:42:30 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						31fd146cc0 
					 
					
						
						
							
							Improving the byte swap support for portability  
						
						
						
						
					 
					
						2015-05-01 10:57:33 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c770f96be7 
					 
					
						
						
							
							Merge branch 'master' of  https://github.com/paboyle/Grid  
						
						
						
						
					 
					
						2015-04-30 16:40:13 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a98c01c86a 
					 
					
						
						
							
							Integrated Lebesgue code and been playing with alternate implementations of the wilson dop without  
						
						... 
						
						
						
						any particular success in increasing the performance. 
						
						
					 
					
						2015-04-30 16:39:06 +01:00 
						 
				 
			
				
					
						
							
							
								mspraggs 
							
						 
					 
					
						
						
							
						
						6f05404cb8 
					 
					
						
						
							
							Added <map> include to GridNerscIO.h  
						
						... 
						
						
						
						Adding this allows clang to compile Grid to completion. 
						
						
					 
					
						2015-04-29 23:44:03 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c72db6c6f6 
					 
					
						
						
							
							Fixed the stencil sector and Wilson now agrees between stencil based implementation  
						
						... 
						
						
						
						and the cshift based implementation. Managed to reduce the volume of code in this
sector a little, but consolidation would be good, perhaps taking common
logic out into simple helper functions 
						
						
					 
					
						2015-04-29 06:23:56 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						25d523c0f4 
					 
					
						
						
							
							Shaken out stencil to the point where I think wilson dslash is correct.  
						
						... 
						
						
						
						Need to audit code carefully, consolidate between stencil and cshift,
and then benchmark and optimise. 
						
						
					 
					
						2015-04-28 08:11:59 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f159495a9d 
					 
					
						
						
							
							Reworking CSHIFT and Stencil. Implementing Wilson and discovered rework is required  
						
						
						
						
					 
					
						2015-04-27 13:45:07 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						94f728bee4 
					 
					
						
						
							
							Big updates with progress towards wilson matrix  
						
						
						
						
					 
					
						2015-04-26 15:51:09 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						51f0da7b93 
					 
					
						
						
							
							Starting the implementation of wilson; incomplete and committing non-functional code which  
						
						... 
						
						
						
						is not yet included from elsewhere or linked to the build system. 
						
						
					 
					
						2015-04-25 14:33:02 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c5fa18eb20 
					 
					
						
						
							
							Added two spinor functionality required to support the Wilson hopping term.  
						
						
						
						
					 
					
						2015-04-25 12:54:06 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						9ec3529864 
					 
					
						
						
							
							Improved the gamma quite a bit.  
						
						... 
						
						
						
						Serial rng's which are set on node zero and broadcaste 
						
						
					 
					
						2015-04-24 20:21:40 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						42eac283e2 
					 
					
						
						
							
							static names and enum list  
						
						
						
						
					 
					
						2015-04-24 19:12:14 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						38598190c3 
					 
					
						
						
							
							Vectors now too and right multiple of matrix with gamma  
						
						
						
						
					 
					
						2015-04-24 19:08:29 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						2e275e1e65 
					 
					
						
						
							
							Removed summation  
						
						
						
						
					 
					
						2015-04-24 18:42:44 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						80463ecaea 
					 
					
						
						
							
							Cleared the code out from Grid_summation to lattice/Grid_lattice_transfer.h  
						
						
						
						
					 
					
						2015-04-24 18:41:34 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						128ad0999f 
					 
					
						
						
							
							Moved code from summation into transfer and reduction  
						
						
						
						
					 
					
						2015-04-24 18:40:44 +01:00