Peter Boyle 
							
						 
					 
					
						
						
							
						
						bb89a82a07 
					 
					
						
						
							
							Staggered coalseced read  
						
						
						
						
					 
					
						2021-03-29 20:01:15 +02:00 
						 
				 
			
				
					
						
							
							
								Christoph Lehner 
							
						 
					 
					
						
						
							
						
						2bb374daea 
					 
					
						
						
							
							hip-friendly  
						
						
						
						
					 
					
						2021-03-19 11:33:23 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						8bdadbadac 
					 
					
						
						
							
							Cold start  
						
						
						
						
					 
					
						2021-03-18 15:41:14 -04:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						15c50a7442 
					 
					
						
						
							
							Explicit instantiate the template function  
						
						
						
						
					 
					
						2021-03-18 15:40:42 -04:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						49b0af2c95 
					 
					
						
						
							
							Update of tests to compile with the sRNG addition.  
						
						... 
						
						
						
						Audited the code conventions (again) with the CPS momentum denominator
and added anti periodic in time to the Test_mobius_force.cc and
tested the Test_dwf_gpforce.
Promoted thesee to test full HMC hamiltonian, tr P^2/2 + phidag MdagM phi
with the same pdot and Udot as audited in the Integrator.h etc...
With full comments and sources for factors. 
						
						
					 
					
						2021-03-18 09:10:02 -04:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						9c2b37218a 
					 
					
						
						
							
							sRNG parameter added  
						
						
						
						
					 
					
						2021-03-18 06:24:11 -04:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						3c67d626ba 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
					 
					
						2021-03-12 15:36:55 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						51f506553c 
					 
					
						
						
							
							Read out the local ID once, and store  
						
						
						
						
					 
					
						2021-03-12 15:33:04 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						226be84937 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
					 
					
						2021-03-12 09:31:50 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						001814b442 
					 
					
						
						
							
							updated to do list. Start adding DDHMC work items  
						
						
						
						
					 
					
						2021-03-12 09:31:17 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						db3ac67506 
					 
					
						
						
							
							Update thread issue  
						
						
						
						
					 
					
						2021-03-12 14:55:07 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						da91a884ef 
					 
					
						
						
							
							NVCC versions found buggy added as guard  
						
						
						
						
					 
					
						2021-03-11 23:54:53 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a71e6755e3 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
					 
					
						2021-03-11 22:43:06 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						cd5891eecd 
					 
					
						
						
							
							Test that fails on Cuda 11.0  
						
						
						
						
					 
					
						2021-03-11 22:34:28 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						5bb7336f27 
					 
					
						
						
							
							Merge pull request  #347  from pjgeorg/fix-autotools-avx512  
						
						... 
						
						
						
						Fix inconsistent SIMD option AVX512
Thanks 
						
						
					 
					
						2021-03-11 16:29:07 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						ce1fc1f48a 
					 
					
						
						
							
							Possible fallback plan for Fionn's compiler bbug in nvcc  
						
						
						
						
					 
					
						2021-03-11 22:20:53 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Georg 
							
						 
					 
					
						
						
							
						
						82402c6a7c 
					 
					
						
						
							
							Add simd option SKL for ICC  
						
						
						
						
					 
					
						2021-03-11 13:08:40 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Georg 
							
						 
					 
					
						
						
							
						
						d9c4afe5b7 
					 
					
						
						
							
							Fix inconsistent configure option AVX512  
						
						... 
						
						
						
						Before this change AVX512 enabled different instruction sets depending
on the compiler:
For Intel C++ Compiler Classic (ICC):
    AVX512F, AVX512CD, AVX512DQ, AVX512BW, AVX512VL
    i.e. Intel Xeon Skylake and newer
For Intel ICX, gcc, clang:
    AVX512F, AVX512CD, AVX512ER, AVX512PF
    i.e. Intel Xeon Phi x200/x205 (KNL/KNM)
With this commit AVX512 now only enables the common instruction sets
supported by all CPUs supporting any AVX-512 instructions set:
AVX512F and AVX512CD (called COMMON-AVX512 by icc) 
						
						
					 
					
						2021-03-11 12:58:49 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f786ff8d69 
					 
					
						
						
							
							Extend test from Fionn, fails on A100 apparently  
						
						
						
						
					 
					
						2021-03-10 14:32:06 -05:00 
						 
				 
			
				
					
						
							
							
								u61464 
							
						 
					 
					
						
						
							
						
						a651caed5f 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
					 
					
						2021-03-10 06:23:51 -08:00 
						 
				 
			
				
					
						
							
							
								u61464 
							
						 
					 
					
						
						
							
						
						0e21adb3f6 
					 
					
						
						
							
							Gives 200GF/s on SyCL/DG1 8^4, doesn't uglify develop for other platforms too badly.  
						
						... 
						
						
						
						Easy to revert to clean more C++ stylistic code. Theres a SYCL_HACK macro I will clean up later once dpcpp
evolves a central nervous systems. 
						
						
					 
					
						2021-03-10 05:40:51 -08:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						58bf9b9e6d 
					 
					
						
						
							
							Clean up test  
						
						
						
						
					 
					
						2021-03-10 02:45:22 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						2146eebb65 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
					 
					
						2021-03-09 04:31:46 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						6a429ee6d3 
					 
					
						
						
							
							2d loop hits Nvidia 16bit limit on large local vols  
						
						
						
						
					 
					
						2021-03-09 04:31:10 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						4d1ea15c79 
					 
					
						
						
							
							More verbosity. The 16bit limit on Grid.y, Grid.z is annoying  
						
						
						
						
					 
					
						2021-03-09 04:29:37 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a76cb005e0 
					 
					
						
						
							
							Update Tensor_exp.h  
						
						
						
						
					 
					
						2021-03-08 13:37:57 -05:00 
						 
				 
			
				
					
						
							
							
								Christoph Lehner 
							
						 
					 
					
						
						
							
						
						49ecbc81d4 
					 
					
						
						
							
							Merge pull request  #24  from ThomasWurm/feature/gpt  
						
						... 
						
						
						
						Put GlobalSum outside the slice loop in sliceSum 
						
						
					 
					
						2021-03-08 16:01:47 +01:00 
						 
				 
			
				
					
						
							
							
								Thomas Wurm 
							
						 
					 
					
						
						
							
						
						9e5fb52eb9 
					 
					
						
						
							
							Put GlobalSum outside the slice loop  
						
						
						
						
					 
					
						2021-03-08 13:53:34 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						a9604367c1 
					 
					
						
						
							
							Merge pull request  #336  from lehner/feature/gpt  
						
						... 
						
						
						
						Make ShmDims configurable; adjust GRID_MAX_SIMD to allow for 128 byte width on GPUs 
						
						
					 
					
						2021-03-05 13:17:19 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						d7065023cc 
					 
					
						
						
							
							Merge pull request  #332  from mmphys/feature/mres_schur  
						
						... 
						
						
						
						Optional changes to Test_cayley_mres e.g. Schur solver 
						
						
					 
					
						2021-03-05 12:47:07 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						89d299ceec 
					 
					
						
						
							
							Merge pull request  #333  from mmphys/bugfix/LatTransfer  
						
						... 
						
						
						
						Fix convertType for GPU in Lattice_transfer.h 
						
						
					 
					
						2021-03-05 12:46:33 -05:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						e34eda66df 
					 
					
						
						
							
							Merge pull request  #344  from felixerben/feature/XiToSigma  
						
						... 
						
						
						
						Feature/xi to sigma 
						
						
					 
					
						2021-03-05 12:45:44 -05:00 
						 
				 
			
				
					
						
							
							
								Christoph Lehner 
							
						 
					 
					
						
						
							
						
						b24181aa4f 
					 
					
						
						
							
							Update Coordinate.h  
						
						... 
						
						
						
						Revert GRID_MAX_SIMD change 
						
						
					 
					
						2021-03-05 16:56:58 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						aa173e2998 
					 
					
						
						
							
							Update README.md  
						
						
						
						
					 
					
						2021-03-05 10:25:33 -05:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						7a19432e0b 
					 
					
						
						
							
							whitespace  
						
						
						
						
					 
					
						2021-03-05 10:57:09 +00:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						9b15704290 
					 
					
						
						
							
							tested and consitent  
						
						
						
						
					 
					
						2021-03-05 10:42:32 +00:00 
						 
				 
			
				
					
						
							
							
								Michael Marshall 
							
						 
					 
					
						
						
							
						
						017f955b2d 
					 
					
						
						
							
							Merge branch 'develop' into feature/mres_schur  
						
						... 
						
						
						
						* develop:
  Pass serial RNG around
  Sycl happier 
						
						
					 
					
						2021-03-04 20:42:02 +00:00 
						 
				 
			
				
					
						
							
							
								Michael Marshall 
							
						 
					 
					
						
						
							
						
						f252d69eef 
					 
					
						
						
							
							Merge branch 'develop' into bugfix/LatTransfer  
						
						... 
						
						
						
						* develop:
  Pass serial RNG around
  Sycl happier 
						
						
					 
					
						2021-03-04 20:41:30 +00:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						3b06e4655e 
					 
					
						
						
							
							Merge branch 'develop' into feature/XiToSigma  
						
						
						
						
					 
					
						2021-03-04 20:06:16 +00:00 
						 
				 
			
				
					
						
					 
					
						
						
							
						
						d4b4de8f42 
					 
					
						
						
							
							changes  
						
						
						
						
					 
					
						2021-03-04 20:01:24 +00:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						c90beee774 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
					 
					
						2021-03-03 23:50:29 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						1eea9d73b9 
					 
					
						
						
							
							Pass serial RNG around  
						
						
						
						
					 
					
						2021-03-03 23:50:01 +01:00 
						 
				 
			
				
					
						
							
							
								u61464 
							
						 
					 
					
						
						
							
						
						679d1d22f7 
					 
					
						
						
							
							Sycl happier  
						
						
						
						
					 
					
						2021-03-03 11:21:43 -08:00 
						 
				 
			
				
					
						
							
							
								Michael Marshall 
							
						 
					 
					
						
						
							
						
						b2b5e0b98c 
					 
					
						
						
							
							Merge branch 'develop' into feature/mres_schur  
						
						... 
						
						
						
						* develop:
  Hand unrolled to use optimised code paths on GPU for coalesced reads in Wilson case. Other cases to do. This now includes comms code path.
  Better SIMD usage/coalescence 
						
						
					 
					
						2021-03-03 16:15:12 +00:00 
						 
				 
			
				
					
						
							
							
								Michael Marshall 
							
						 
					 
					
						
						
							
						
						03e54722c1 
					 
					
						
						
							
							Merge branch 'develop' into bugfix/LatTransfer  
						
						... 
						
						
						
						* develop:
  Hand unrolled to use optimised code paths on GPU for coalesced reads in Wilson case. Other cases to do. This now includes comms code path. 
						
						
					 
					
						2021-03-03 16:13:23 +00:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						442336bd96 
					 
					
						
						
							
							Hand unrolled to use optimised code paths on GPU for coalesced reads in Wilson case.  
						
						... 
						
						
						
						Other cases to do. This now includes comms code path. 
						
						
					 
					
						2021-03-02 14:50:51 +01:00 
						 
				 
			
				
					
						
							
							
								Christoph Lehner 
							
						 
					 
					
						
						
							
						
						9c9566b9c9 
					 
					
						
						
							
							Merge pull request  #23  from paboyle/develop  
						
						... 
						
						
						
						Sync 
						
						
					 
					
						2021-03-01 12:33:51 +01:00 
						 
				 
			
				
					
						
							
							
								Michael Marshall 
							
						 
					 
					
						
						
							
						
						1059a81a3c 
					 
					
						
						
							
							Merge branch 'develop' into bugfix/LatTransfer  
						
						... 
						
						
						
						* develop:
  Better SIMD usage/coalescence 
						
						
					 
					
						2021-02-27 00:21:36 +00:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						2e61556389 
					 
					
						
						
							
							Merge branch 'develop' of  https://github.com/paboyle/Grid  into develop  
						
						
						
						
					 
					
						2021-02-26 17:52:20 +01:00 
						 
				 
			
				
					
						
							
							
								Peter Boyle 
							
						 
					 
					
						
						
							
						
						f9b1f240f6 
					 
					
						
						
							
							Better SIMD usage/coalescence  
						
						
						
						
					 
					
						2021-02-26 17:51:41 +01:00