| 
							
							
								 Peter Boyle | 936c5ecf69 | Reduction GPU no compile fix | 2020-06-24 17:28:31 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 22cfbdbbb3 | Boost precision in inner products in single | 2020-06-24 12:52:31 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 093d1ee21b | Force initial values | 2020-06-24 08:54:49 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | d6ba2581ce | Merge branch 'develop' of https://github.com/paboyle/Grid into develop | 2020-06-24 08:25:08 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 577c064184 | Memory manager initialise earlier | 2020-06-24 08:24:38 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 2ff1fa6fad | UVM used shared for CPU alloccations andd ddont migrate | 2020-06-23 22:14:56 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 70be1bd8be | Adding code under development | 2020-06-23 10:24:21 -04:00 |  | 
			
				
					|  | 4ef50ba31f | Baryon speedup | 2020-06-23 11:44:20 +01:00 |  | 
			
				
					|  | 3e97a26f90 | BaryonGamm3pt threads -> accelerator | 2020-06-23 11:35:32 +01:00 |  | 
			
				
					|  | 599f28f6ef | Baryon bug fixes | 2020-06-23 11:10:26 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | c48da35921 | Memory Vector UVM and Lattice alignedAllocator separate | 2020-06-22 20:21:53 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 6c5fa8dcd8 | Aligned allocate on CPU put through this interface | 2020-06-20 14:34:29 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 0d2f913a1a | String.h for linux | 2020-06-20 09:37:31 -04:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | 5b117865b2 | Merge pull request #6 from paboyle/sycl Sycl | 2020-06-20 09:44:44 +02:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 1a74816c25 | Hopeefully fixed | 2020-06-19 17:50:52 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 73de335256 | Merge branch 'develop' into sycl | 2020-06-19 17:44:16 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 228fd450ce | Typo fix (excusee - my keyboard is starting to break) | 2020-06-19 17:36:05 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | b949cf6b12 | PeekLocal needs a view to keep thread safe. ALLOCATION_CACHEE reenable | 2020-06-19 17:13:27 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 11bc1aeadc | TThread count defaultt to fastest | 2020-06-19 14:30:35 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 66005929af | Set up the cache size on all ranks | 2020-06-19 12:50:54 -04:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | 05bbc49a99 | Edge case in GetShmDim check | 2020-06-19 12:01:23 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | ff7c847735 | Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl | 2020-06-19 01:22:16 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 1aa988b2af | Comms overlap fix UVM case | 2020-06-19 01:21:14 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | edf17708a8 | Range improvement | 2020-06-18 22:41:06 -04:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | 81a8209749 | ConvertType for blockInnerProduct | 2020-06-18 11:53:21 -04:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | a87e45ba25 | SVE readme update | 2020-06-18 11:23:08 +02:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | 465856331a | switch back to serialized; wrong results on single too | 2020-06-15 15:39:39 +02:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | cc958aa9ed | switch back to standard MPI_init due to wrong results in Benchmark_wilson using comms-overlap | 2020-06-15 14:21:38 +02:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | f46f029dbb | Merge pull request #292 from lehner/feature/gpt-sycl Catch edge case in SharedMemoryMPI::GetShmDims; Change default units … | 2020-06-14 13:43:27 -04:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | 3dccd7aa2c | Catch edge case in SharedMemoryMPI::GetShmDims; Change default units to consistent MB in init args; Want last element not past last element in MemoryManagerCache.cc | 2020-06-14 13:26:01 -04:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | a25e4b3d0c | pred 32/64 for float/double instead of 8 in VLA patch | 2020-06-13 14:44:37 +02:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | d1210ca12a | switch to double/float instead of float64_t/float32_t in VLA patch | 2020-06-13 13:59:32 +02:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | 36ea0e222a | type traits for ComplexF/D in VLA patch; cosmetics in VLS intrinsics | 2020-06-13 13:42:35 +02:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 65e6e7da6f | Merge pull request #291 from lehner/feature/gpt-sycl Feature/gpt sycl | 2020-06-12 20:42:32 -04:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | b5e87e8d97 | summit compile fixes | 2020-06-12 18:16:12 -04:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | 5f5807d60a | cleanup | 2020-06-12 14:48:23 -04:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | 92281ec22d | add 3 op Mult for VLA | 2020-06-12 18:49:05 +02:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | 87266ce099 | comment out fcmla in vector types: need also MultAddReal | 2020-06-12 18:37:19 +02:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | 2a23f133e8 | reenable fcmla for VLA | 2020-06-12 17:30:38 +02:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | 8dbf790f62 | correct tbl2 for sp | 2020-06-12 17:12:34 +02:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | 2402b4940e | vec_imm in float | 2020-06-12 15:17:38 +02:00 |  | 
			
				
					| 
							
							
								 nmeyer-ur | 2111052fbe | apply VLA patch for memcpy reduction suggested by Arm, CAS-162542-D6W7Z7 | 2020-06-12 14:49:19 +02:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | 7974acff54 | merged sycl to feature-gpt | 2020-06-12 06:49:38 -04:00 |  | 
			
				
					|  | f0d17d2b49 | Added Baryon3pt code | 2020-06-12 11:35:52 +01:00 |  | 
			
				
					|  | 244c003a1b | Updated Baryon code | 2020-06-12 11:00:25 +01:00 |  | 
			
				
					|  | 0174f5f742 | look for librt when using shm=shmopen | 2020-06-11 16:50:43 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 32b2b59be4 | Offload | 2020-06-10 20:36:26 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 86bb0cc24b | Keep on GPU | 2020-06-10 20:00:00 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 84c19587e7 | Offload | 2020-06-10 19:59:31 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 237ce92540 | Offload loops | 2020-06-10 19:59:11 -04:00 |  |