Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						5c936d88a0 
					 
					
						
						
							
							Merge branch 'feature/staggered-comms-compute' of  https://github.com/paboyle/Grid  into feature/staggered-comms-compute  
						
						
						
						
					 
					
						2018-04-26 10:18:37 +01:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						2cbb72a81c 
					 
					
						
						
							
							Provide info if EE term is trivial (m^2 factor)  
						
						... 
						
						
						
						Better timing in staggered 4d case 
						
						
					 
					
						2018-04-26 10:10:07 +01:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						31d83ee046 
					 
					
						
						
							
							Enable special treatment of constEE cases  
						
						
						
						
					 
					
						2018-04-26 10:08:46 +01:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						80302e95a8 
					 
					
						
						
							
							MILC Interface  
						
						
						
						
					 
					
						2018-03-08 15:34:03 +00:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						b938202081 
					 
					
						
						
							
							Overlapped Comm for Wilson DhopInternal  
						
						
						
						
					 
					
						2018-03-07 14:08:43 +00:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						0f468e2179 
					 
					
						
						
							
							OverlappedComm for Staggered 5D and 4D.  
						
						
						
						
					 
					
						2018-02-22 12:50:09 +00:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						97b9c6f03d 
					 
					
						
						
							
							No option for interior/exterior split of asm kernels since different directions get interleaved  
						
						
						
						
					 
					
						2018-01-22 11:04:19 +00:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						63982819c6 
					 
					
						
						
							
							No option to overlap comms and compute for asm implementation since different directions are interleaved  
						
						... 
						
						
						
						in the kernels, introducing if else structure would be too painful 
						
						
					 
					
						2018-01-22 11:03:39 +00:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						24162c9ead 
					 
					
						
						
							
							Staggered overlap comms comput  
						
						
						
						
					 
					
						2018-01-09 13:02:52 +00:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						bf58557fb1 
					 
					
						
						
							
							Block compressed Lanczos  
						
						
						
						
					 
					
						2017-10-10 14:15:11 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						d54807b8c0 
					 
					
						
						
							
							MPIT works with split grid now  
						
						
						
						
					 
					
						2017-10-02 23:14:56 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						946a8671b9 
					 
					
						
						
							
							Merge pull request  #129  from djm2131/feature/eofa  
						
						... 
						
						
						
						Add support for DWF with the exact one flavor algorithm 
						
						
					 
					
						2017-09-21 10:15:21 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						bfb68e6f02 
					 
					
						
						
							
							Merge pull request  #130  from giltirn/gparity-handunroll  
						
						... 
						
						
						
						Gparity handunroll 
						
						
					 
					
						2017-09-21 10:11:00 +01:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						59bd1fe21b 
					 
					
						
						
							
							Fix for 'perm' and 'local' not being set for hand-unrolled external-site Dslash, which caused incorrect behavior of G-parity kernel  
						
						
						
						
					 
					
						2017-08-29 13:07:37 -07:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						74af885d4e 
					 
					
						
						
							
							Removed some no-longer-needed associated with G-parity hand unrolled kernel  
						
						
						
						
					 
					
						2017-08-29 09:50:37 -04:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						80c5bce5bb 
					 
					
						
						
							
							Merge branch 'develop' into feature/multi-communicator  
						
						
						
						
					 
					
						2017-08-25 20:21:26 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						f68b5de9c8 
					 
					
						
						
							
							No compile fix on Clang  
						
						
						
						
					 
					
						2017-08-25 19:35:21 +01:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						f365a83fae 
					 
					
						
						
							
							In G-parity unrolled kernel, replaced calls to permute and exchange with run-time-evaluated permute type with explicit calls to appropriate underlying functions  
						
						
						
						
					 
					
						2017-08-25 14:24:11 -04:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c289699d9a 
					 
					
						
						
							
							updated from cambridge mpi3 shakeout  
						
						
						
						
					 
					
						2017-08-25 11:41:01 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c3b1263e75 
					 
					
						
						
							
							Benchmark prep  
						
						
						
						
					 
					
						2017-08-25 09:25:54 +01:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						34a9aeb331 
					 
					
						
						
							
							Reduced number of if-statement evaluations in G-parity unrolled kernel  
						
						
						
						
					 
					
						2017-08-24 13:53:50 -07:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						ce5df177ee 
					 
					
						
						
							
							Removed superfluous implementation of G-parity twist for hand-unrolled kernel from GparityWilsonImpl  
						
						
						
						
					 
					
						2017-08-23 15:05:22 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						a0bb8e5b46 
					 
					
						
						
							
							Added hand-unrolled kernel implementations of all the other dslash precision / comms precision combinations with G-parity  
						
						
						
						
					 
					
						2017-08-23 14:44:40 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						46f88e6d72 
					 
					
						
						
							
							G-parity hand-unrolled intrinsics twist now uses one less permute and one less temporary  
						
						
						
						
					 
					
						2017-08-23 13:21:10 -04:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						dd8f1ea189 
					 
					
						
						
							
							Vectorized Mobius EOFA Dperp + shift operation  
						
						
						
						
					 
					
						2017-08-23 13:17:26 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						b61835c1a5 
					 
					
						
						
							
							Added inplace version of intrinsic G-parity twist to hand-unrolled kernel  
						
						
						
						
					 
					
						2017-08-23 12:33:48 -04:00 
						 
				 
			
				
					
						
							
							
								Azusa Yamaguchi 
							
						 
					 
					
						
						
							
						
						d9cd4f0273 
					 
					
						
						
							
							Staggered multinode block cg debugged. Missing global sum.  
						
						... 
						
						
						
						Code stalls and resumes on KNL at cambridge. Curious.
CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk? 
						
						
					 
					
						2017-08-23 15:07:18 +01:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						459f70e8d4 
					 
					
						
						
							
							Check-in of working Mobius EOFA class and tests  
						
						
						
						
					 
					
						2017-08-22 22:38:30 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						061e48fd73 
					 
					
						
						
							
							Replaced slow unpack-repack in G-parity BC twist with intrinsics version  
						
						
						
						
					 
					
						2017-08-22 18:12:12 -04:00 
						 
				 
			
				
					
						
							
							
								Christopher Kelly 
							
						 
					 
					
						
						
							
						
						ab50145001 
					 
					
						
						
							
							Implemented first, unoptimized version of hand-unrolled G-parity kernels  
						
						... 
						
						
						
						Improved Test_gparity 
						
						
					 
					
						2017-08-22 17:12:25 -04:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						a446d95c33 
					 
					
						
						
							
							Trying to pass TeamCity and Travis  
						
						
						
						
					 
					
						2017-08-20 01:10:50 +01:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						9d45fca8bc 
					 
					
						
						
							
							Implement MobiusEOFAFermioncache.cc  
						
						
						
						
					 
					
						2017-08-17 23:45:36 -04:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						ac9e6b63c0 
					 
					
						
						
							
							More re-import of Mobius EOFA  
						
						
						
						
					 
					
						2017-08-17 19:28:53 -04:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						e140b3f802 
					 
					
						
						
							
							Beginning to re-import Mobius EOFA  
						
						
						
						
					 
					
						2017-08-16 23:36:23 -04:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						d9d3d30cc7 
					 
					
						
						
							
							Minor clean-up  
						
						
						
						
					 
					
						2017-08-16 20:57:51 -04:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						6d0786ff9d 
					 
					
						
						
							
							Typo fixes and check-in of G-parity action test for DWF  
						
						
						
						
					 
					
						2017-08-15 22:47:00 -04:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						b7f93aeb4d 
					 
					
						
						
							
							Change CayleyFermion5D::SetCoefficientsInternal to virtual to allow overriding in derived EOFA classes  
						
						
						
						
					 
					
						2017-08-15 14:18:51 -04:00 
						 
				 
			
				
					
						
							
							
								David Murphy 
							
						 
					 
					
						
						
							
						
						202a7fe900 
					 
					
						
						
							
							Re-import DWF and abstract base EOFA fermion classes and tests  
						
						
						
						
					 
					
						2017-08-15 13:36:08 -04:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						14d53e1c9e 
					 
					
						
						
							
							Threaded MPI calls patches  
						
						
						
						
					 
					
						2017-07-29 13:08:10 -04:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						54e94360ad 
					 
					
						
						
							
							Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit  
						
						
						
						
					 
					
						2017-06-24 23:10:24 +01:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						7587df831a 
					 
					
						
						
							
							Merge branch 'develop' into feature/hadrons  
						
						... 
						
						
						
						# Conflicts:
#	lib/qcd/action/scalar/ScalarImpl.h 
						
						
					 
					
						2017-06-20 15:50:39 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						46879e1658 
					 
					
						
						
							
							Complex defined in Impl even for gauge.  
						
						
						
						
					 
					
						2017-06-18 00:11:45 +01:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						0503c028be 
					 
					
						
						
							
							Merge branch 'feature/qed-fvol' into feature/hadrons (non-trivial conflicts on scalar Impl)  
						
						... 
						
						
						
						# Conflicts:
#	configure.ac
#	lib/qcd/action/scalar/Scalar.h 
						
						
					 
					
						2017-06-05 16:37:47 -05:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						9c12c37aaf 
					 
					
						
						
							
							Confirming the fix on the complex boundary conditions  
						
						
						
						
					 
					
						2017-05-09 08:41:29 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						529e78d43f 
					 
					
						
						
							
							Restart the v0.7.0 release  
						
						
						
						
					 
					
						2017-05-08 18:20:04 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						2439999ec8 
					 
					
						
						
							
							Warning elimination; drop to -O2 on G++ bad versions  
						
						
						
						
					 
					
						2017-05-06 14:44:49 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						1d96f662e3 
					 
					
						
						
							
							Fixed 4d fermion gparity force. Put strong tests on make check force tests  
						
						
						
						
					 
					
						2017-05-06 00:46:31 +01:00 
						 
				 
			
				
					
						
							
							
								Guido Cossu 
							
						 
					 
					
						
						
							
						
						20999c1370 
					 
					
						
						
							
							Merge branch 'develop' into feature/hmc_generalise  
						
						
						
						
					 
					
						2017-05-05 12:47:17 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						78ef10e60f 
					 
					
						
						
							
							Mobius force improvement  
						
						
						
						
					 
					
						2017-05-04 19:53:21 +01:00 
						 
				 
			
				
					
						
							
							
								paboyle 
							
						 
					 
					
						
						
							
						
						90f6bc16bb 
					 
					
						
						
							
							No compile clang fix  
						
						
						
						
					 
					
						2017-05-04 12:15:06 +01:00