Peter Boyle
							
						 
					 | 
					
						
						
							
						
						e58cc72fe5
					 | 
					
						
						
							
							Promote to strong inline to force ICPC's hand. Annoying.
						
						
						
						
						
						
					 | 
					
						2015-05-15 11:39:25 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						074430af0d
					 | 
					
						
						
							
							Formatting change
						
						
						
						
						
						
					 | 
					
						2015-05-15 11:38:54 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						873110d482
					 | 
					
						
						
							
							Filed bug report Bug 66153 on GCC-5.
						
						
						
						
						
						
						
						https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153 
						
						
					 | 
					
						2015-05-15 11:38:04 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						4cee0e8653
					 | 
					
						
						
							
							Silly formatting change
						
						
						
						
						
						
					 | 
					
						2015-05-15 11:37:07 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						6b2a786779
					 | 
					
						
						
							
							gcc doesn't like collapse(2) for some reason I can't figure
						
						
						
						
						
						
					 | 
					
						2015-05-15 11:36:22 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						cf27f22dc0
					 | 
					
						
						
							
							ICPC and GCC5 fixes
						
						
						
						
						
						
					 | 
					
						2015-05-15 11:35:02 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						40192841a4
					 | 
					
						
						
							
							Using boolean logic inside enable_if is more elegant
						
						
						
						
						
						
					 | 
					
						2015-05-15 11:32:45 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						1771f97551
					 | 
					
						
						
							
							Key of mm_malloc.h
						
						
						
						
						
						
					 | 
					
						2015-05-15 11:32:11 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						2eaf73e8b3
					 | 
					
						
						
							
							strong inline required to force icpc
						
						
						
						
						
						
					 | 
					
						2015-05-15 11:31:41 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						43bdbb5080
					 | 
					
						
						
							
							Linear op added
						
						
						
						
						
						
					 | 
					
						2015-05-13 11:25:34 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						7f3ae64a31
					 | 
					
						
						
							
							OMP dslash working
						
						
						
						
						
						
					 | 
					
						2015-05-13 10:59:22 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						457cc0d5a3
					 | 
					
						
						
							
							RNG test
						
						
						
						
						
						
					 | 
					
						2015-05-13 09:24:30 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						d388b831b4
					 | 
					
						
						
							
							cout IO for all types
						
						
						
						
						
						
					 | 
					
						2015-05-13 09:24:10 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						b4a570477c
					 | 
					
						
						
							
							I have made the Cshift work successfully with open mp threading in
						
						
						
						
						
						
						
						every routine. Collapse(2) is now working under clang-omp++. 
						
						
					 | 
					
						2015-05-13 00:31:00 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						52174da232
					 | 
					
						
						
							
							Enhanced SIMD interfacing
						
						
						
						
						
						
					 | 
					
						2015-05-12 20:41:44 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						65c91eae64
					 | 
					
						
						
							
							Threading support rework.
						
						
						
						
						
						
						
						Placed parallel pragmas as macros; implemented deterministic thread reduction in style of
BFM. 
						
						
					 | 
					
						2015-05-12 07:51:41 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						8b765be2b1
					 | 
					
						
						
							
							Moving some things around for pretty
						
						
						
						
						
						
					 | 
					
						2015-05-11 19:09:49 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						a411b48a91
					 | 
					
						
						
							
							Adding a better controlled threading class, preparing to
						
						
						
						
						
						
						
						force in deterministic reduction. 
						
						
					 | 
					
						2015-05-11 18:59:03 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						ebcb87abe1
					 | 
					
						
						
							
							Got command line args working
						
						
						
						
						
						
					 | 
					
						2015-05-11 14:36:48 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								paboyle
							
						 
					 | 
					
						
						
							
						
						1576b7837a
					 | 
					
						
						
							
							CML parse
						
						
						
						
						
						
					 | 
					
						2015-05-11 12:56:27 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								paboyle
							
						 
					 | 
					
						
						
							
						
						fa5779537c
					 | 
					
						
						
							
							Command line args and a general clean up
						
						
						
						
						
						
					 | 
					
						2015-05-11 12:43:10 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								paboyle
							
						 
					 | 
					
						
						
							
						
						5548fd6928
					 | 
					
						
						
							
							Updated to do list
						
						
						
						
						
						
					 | 
					
						2015-05-11 09:44:50 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						242e447bc5
					 | 
					
						
						
							
							Lots of changes required to compile for MIC under ICPC
						
						
						
						
						
						
					 | 
					
						2015-05-10 23:29:21 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						352bccf6ca
					 | 
					
						
						
							
							Merge branch 'master' of https://github.com/paboyle/Grid
						
						
						
						
						
						
						
						Conflicts:
	lib/qcd/Grid_qcd_wilson_dop.cc 
						
						
					 | 
					
						2015-05-10 15:37:47 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						c946e77143
					 | 
					
						
						
							
							Expression template hack
						
						
						
						
						
						
					 | 
					
						2015-05-10 15:35:30 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						015fbee772
					 | 
					
						
						
							
							Expression template engin
						
						
						
						
						
						
					 | 
					
						2015-05-10 15:34:20 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						8215893152
					 | 
					
						
						
							
							Updated TODO list
						
						
						
						
						
						
					 | 
					
						2015-05-10 15:32:56 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						5fcf42cb30
					 | 
					
						
						
							
							Hack; must bring norm2 into the unary operator list.
						
						
						
						
						
						
						
						ET's are still incomplete. 
						
						
					 | 
					
						2015-05-10 15:30:29 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						e647cf0459
					 | 
					
						
						
							
							Default to single node. Move to command line args.
						
						
						
						
						
						
					 | 
					
						2015-05-10 15:27:38 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						8919bf9e0a
					 | 
					
						
						
							
							Single node default. Should expose this as command line args, but haven't sorted out
						
						
						
						
						
						
						
						Grid_initialize to handle this. Should put this on the TODO list. 
						
						
					 | 
					
						2015-05-10 15:26:06 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						133493dc79
					 | 
					
						
						
							
							Small tweak to enable benchmarking to suppress gauge field bandwidth as a test.
						
						
						
						
						
						
						
						This is a short term hack while I benchmark. 
						
						
					 | 
					
						2015-05-10 15:25:23 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						58d32a4d0e
					 | 
					
						
						
							
							Assertion should never hit, but did due to a bug
						
						
						
						
						
						
					 | 
					
						2015-05-10 15:24:37 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						6bb17502f9
					 | 
					
						
						
							
							Moving operator stuff into separate file so that we can switch on/off replacement with
						
						
						
						
						
						
						
						expression templates 
						
						
					 | 
					
						2015-05-10 15:23:49 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						8299bc39ea
					 | 
					
						
						
							
							Fixing breakage in the Comms non compile
						
						
						
						
						
						
					 | 
					
						2015-05-10 15:23:09 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						7f04b85368
					 | 
					
						
						
							
							Bringing expression templates for faster vector loops
						
						
						
						
						
						
					 | 
					
						2015-05-10 15:22:31 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						a115f3b086
					 | 
					
						
						
							
							ET ready benchmark with bytes counted assuming loop interchange
						
						
						
						
						
						
					 | 
					
						2015-05-10 15:18:04 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						27c2d13968
					 | 
					
						
						
							
							Updated todo list
						
						
						
						
						
						
					 | 
					
						2015-05-10 15:13:50 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						5415180676
					 | 
					
						
						
							
							Wilson perf improvements with Gauge prefetching
						
						
						
						
						
						
					 | 
					
						2015-05-06 06:37:21 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						7b0dd6c5d6
					 | 
					
						
						
							
							Cleaned up for Linux
						
						
						
						
						
						
					 | 
					
						2015-05-05 22:09:22 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						cb4b82b09f
					 | 
					
						
						
							
							streaming store cases
						
						
						
						
						
						
					 | 
					
						2015-05-05 18:14:09 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						cd990ba13d
					 | 
					
						
						
							
							Streaming store option
						
						
						
						
						
						
					 | 
					
						2015-05-05 18:13:06 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						249165d1b2
					 | 
					
						
						
							
							Added streaming stores
						
						
						
						
						
						
					 | 
					
						2015-05-05 18:09:28 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						b720222d98
					 | 
					
						
						
							
							Updated bandwidth test
						
						
						
						
						
						
					 | 
					
						2015-05-05 18:08:53 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						0e8415de1b
					 | 
					
						
						
							
							Added a makefile
						
						
						
						
						
						
					 | 
					
						2015-05-05 17:56:42 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						2b46ad38e2
					 | 
					
						
						
							
							Back to vector for now; cost of init loop is clear in the a*x + y
						
						
						
						
						
						
						
						loop in memory benchmark and must move to better container class. 
						
						
					 | 
					
						2015-05-03 09:48:13 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						9d93d1e6d4
					 | 
					
						
						
							
							Comms and memory benchmarks added
						
						
						
						
						
						
					 | 
					
						2015-05-03 09:44:47 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						253362f978
					 | 
					
						
						
							
							Added a comms benchmark
						
						
						
						
						
						
					 | 
					
						2015-05-02 23:51:43 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						ea52562527
					 | 
					
						
						
							
							Added a comms benchmark
						
						
						
						
						
						
					 | 
					
						2015-05-02 23:42:30 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						6a39089a43
					 | 
					
						
						
							
							Starting a benchmarking sub dir
						
						
						
						
						
						
					 | 
					
						2015-05-02 17:52:36 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 | 
				
			
				
					
						
							
							
								 
								Peter Boyle
							
						 
					 | 
					
						
						
							
						
						bdf18941a2
					 | 
					
						
						
							
							Improving the byte swap support for portability
						
						
						
						
						
						
					 | 
					
						2015-05-01 10:57:33 +01:00 | 
					
					
						
						
						
							
							
							
							
							
							
						
					 |