Guido Cossu 
							
						 
					 
					
						
						
							
						
						1184ed29ae 
					 
					
						
						
							
							Merge pull request  #124  from nmeyer-ur/feature/arm-neon  
						
						... 
						
						
						
						Added integer reduce functionality 
						
						
							
						
					 
					
						2017-09-08 10:54:35 +02:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						203c7bf6fa 
					 
					
						
						
							
							Merge branch 'hotfix/dirac-ITT-fix' into develop  
						
						
						
						
							
						
					 
					
						2017-09-05 15:08:51 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						c709883f3f 
					 
					
						
						
							
							Merge branch 'hotfix/dirac-ITT-fix'  
						
						
						
						
							
 
						
					 
					
						2017-09-05 15:08:16 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						aed5de4d50 
					 
					
						
						
							
							Patching macos compile  
						
						
						
						
							
						
					 
					
						2017-09-05 15:07:07 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						ba27cc6571 
					 
					
						
						
							
							Mac os happiness  
						
						
						
						
							
						
					 
					
						2017-09-05 15:00:16 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						d856327250 
					 
					
						
						
							
							Merge branch 'release/dirac-ITT' into develop  
						
						
						
						
							
						
					 
					
						2017-09-05 14:56:12 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						d75369cb56 
					 
					
						
						
							
							Merge branch 'release/dirac-ITT'  
						
						
						
						
							
 
						
					 
					
						2017-09-05 14:55:54 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						bf973d0d56 
					 
					
						
						
							
							SHM complete  
						
						
						
						
							
						
					 
					
						2017-09-05 14:30:29 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						837bf8a5be 
					 
					
						
						
							
							Updating to control the SHM allocation scheme under configure time options  
						
						
						
						
							
						
					 
					
						2017-09-05 12:51:02 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c05b2199f6 
					 
					
						
						
							
							Improvements to huge memory  
						
						
						
						
							
						
					 
					
						2017-09-04 10:41:21 -04:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						a5fe07c077 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
							
						
					 
					
						2017-09-04 14:10:15 +01:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						b83b2b1415 
					 
					
						
						
							
							Stability improvement to BCG. Force m_rr hermitian beyond rounding.  
						
						
						
						
							
						
					 
					
						2017-09-04 14:09:47 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						b331be9101 
					 
					
						
						
							
							Better reporting  
						
						
						
						
							
						
					 
					
						2017-08-31 11:32:57 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						49c20a9fa8 
					 
					
						
						
							
							Patch to reporting  
						
						
						
						
							
						
					 
					
						2017-08-31 11:32:21 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						7359df3501 
					 
					
						
						
							
							Full reporting for benchmark; save robustness factor  
						
						
						
						
							
						
					 
					
						2017-08-31 10:42:35 +01:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						59bd1fe21b 
					 
					
						
						
							
							Fix for 'perm' and 'local' not being set for hand-unrolled external-site Dslash, which caused incorrect behavior of G-parity kernel  
						
						
						
						
							
						
					 
					
						2017-08-29 13:07:37 -07:00 
						 
				 
			
				
					
						
							
							
								Nils Meyer 
							
						 
					 
					
						
						
							
						
						4e907fef2c 
					 
					
						
						
							
							Merge remote-tracking branch 'grid/develop' into feature/arm-neon  
						
						
						
						
							
						
					 
					
						2017-08-29 17:47:36 +02:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						67888b657f 
					 
					
						
						
							
							Merge branch 'gparity-handunroll' of  https://github.com/giltirn/Grid  into gparity-handunroll  
						
						
						
						
							
						
					 
					
						2017-08-29 09:52:05 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						74af885d4e 
					 
					
						
						
							
							Removed some no-longer-needed associated with G-parity hand unrolled kernel  
						
						
						
						
							
						
					 
					
						2017-08-29 09:50:37 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						d36d2fb40d 
					 
					
						
						
							
							Added ability to override default Ls in Benchmark_dwf  
						
						
						
						
							
						
					 
					
						2017-08-28 06:53:56 -07:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5b9267e88d 
					 
					
						
						
							
							Cleaner comms benchmark treatment for one node runs  
						
						
						
						
							
						
					 
					
						2017-08-27 18:24:48 -04:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						15fd4003ef 
					 
					
						
						
							
							Improving presentation of results  
						
						
						
						
							
						
					 
					
						2017-08-27 13:46:02 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						4b4c2a715b 
					 
					
						
						
							
							fcntl.h needed  
						
						
						
						
							
						
					 
					
						2017-08-26 11:38:04 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						54a5e6c1d0 
					 
					
						
						
							
							Check if we get huge pages on linux. Larry Meadows piece of magic.  
						
						
						
						
							
						
					 
					
						2017-08-25 22:36:08 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						73aeca7dea 
					 
					
						
						
							
							Merge branch 'feature/multi-communicator' into develop  
						
						
						
						
							
						
					 
					
						2017-08-25 21:55:09 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						ad89abb018 
					 
					
						
						
							
							Fix  
						
						
						
						
							
						
					 
					
						2017-08-25 20:43:37 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						80c5bce5bb 
					 
					
						
						
							
							Merge branch 'develop' into feature/multi-communicator  
						
						
						
						
							
						
					 
					
						2017-08-25 20:21:26 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						f68b5de9c8 
					 
					
						
						
							
							No compile fix on Clang  
						
						
						
						
							
						
					 
					
						2017-08-25 19:35:21 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d0f3d525d5 
					 
					
						
						
							
							Optimal block size for KNL  
						
						
						
						
							
						
					 
					
						2017-08-25 19:33:54 +01:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						f365a83fae 
					 
					
						
						
							
							In G-parity unrolled kernel, replaced calls to permute and exchange with run-time-evaluated permute type with explicit calls to appropriate underlying functions  
						
						
						
						
							
						
					 
					
						2017-08-25 14:24:11 -04:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3a58217405 
					 
					
						
						
							
							Updated  
						
						
						
						
							
						
					 
					
						2017-08-25 14:29:53 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c289699d9a 
					 
					
						
						
							
							updated from cambridge mpi3 shakeout  
						
						
						
						
							
						
					 
					
						2017-08-25 11:41:01 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c3b1263e75 
					 
					
						
						
							
							Benchmark prep  
						
						
						
						
							
						
					 
					
						2017-08-25 09:25:54 +01:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						34a9aeb331 
					 
					
						
						
							
							Reduced number of if-statement evaluations in G-parity unrolled kernel  
						
						
						
						
							
						
					 
					
						2017-08-24 13:53:50 -07:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						102ea9ae66 
					 
					
						
						
							
							CI update  
						
						
						
						
							
						
					 
					
						2017-08-24 18:17:09 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						5fa386ddc9 
					 
					
						
						
							
							FFT test compile fixed  
						
						
						
						
							
						
					 
					
						2017-08-24 10:17:52 +01:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						edabb3577f 
					 
					
						
						
							
							Imported Benchmark_gparity  
						
						
						
						
							
						
					 
					
						2017-08-23 16:54:06 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						ce5df177ee 
					 
					
						
						
							
							Removed superfluous implementation of G-parity twist for hand-unrolled kernel from GparityWilsonImpl  
						
						
						
						
							
						
					 
					
						2017-08-23 15:05:22 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						a0bb8e5b46 
					 
					
						
						
							
							Added hand-unrolled kernel implementations of all the other dslash precision / comms precision combinations with G-parity  
						
						
						
						
							
						
					 
					
						2017-08-23 14:44:40 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						46f88e6d72 
					 
					
						
						
							
							G-parity hand-unrolled intrinsics twist now uses one less permute and one less temporary  
						
						
						
						
							
						
					 
					
						2017-08-23 13:21:10 -04:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						dd8f1ea189 
					 
					
						
						
							
							Vectorized Mobius EOFA Dperp + shift operation  
						
						
						
						
							
						
					 
					
						2017-08-23 13:17:26 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						b61835c1a5 
					 
					
						
						
							
							Added inplace version of intrinsic G-parity twist to hand-unrolled kernel  
						
						
						
						
							
						
					 
					
						2017-08-23 12:33:48 -04:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						d9cd4f0273 
					 
					
						
						
							
							Staggered multinode block cg debugged. Missing global sum.  
						
						... 
						
						
						
						Code stalls and resumes on KNL at cambridge. Curious.
CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk? 
						
						
							
						
					 
					
						2017-08-23 15:07:18 +01:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						459f70e8d4 
					 
					
						
						
							
							Check-in of working Mobius EOFA class and tests  
						
						
						
						
							
						
					 
					
						2017-08-22 22:38:30 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						061e48fd73 
					 
					
						
						
							
							Replaced slow unpack-repack in G-parity BC twist with intrinsics version  
						
						
						
						
							
						
					 
					
						2017-08-22 18:12:12 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						ab50145001 
					 
					
						
						
							
							Implemented first, unoptimized version of hand-unrolled G-parity kernels  
						
						... 
						
						
						
						Improved Test_gparity 
						
						
							
						
					 
					
						2017-08-22 17:12:25 -04:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						b49bec0cec 
					 
					
						
						
							
							MAP_HUGETLB portability fix  
						
						
						
						
							
						
					 
					
						2017-08-20 03:08:54 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						ae56e556c6 
					 
					
						
						
							
							finalise issue on new OPA revert  
						
						
						
						
							
						
					 
					
						2017-08-20 02:53:12 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						1cdf999668 
					 
					
						
						
							
							Moving multicommunicator into mpi3 also for threading  
						
						
						
						
							
						
					 
					
						2017-08-20 02:39:10 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						11062fb686 
					 
					
						
						
							
							Comms none fail fix  
						
						
						
						
							
						
					 
					
						2017-08-20 01:37:07 +01:00