|  | 7435315d50 | More blasted shell variables | 2024-03-06 00:03:59 +00:00 |  | 
			
				
					|  | 9b5f741e85 | Reproducing CG can be more useful now | 2024-03-06 00:03:16 +00:00 |  | 
			
				
					|  | 517822fdd2 | SPR HBM benchmarking right and also PVC batched GEMM | 2024-03-06 00:02:27 +00:00 |  | 
			
				
					|  | 1b93a9be88 | Print out the hostname | 2024-03-06 00:01:58 +00:00 |  | 
			
				
					|  | 783a66b348 | Deterministic reduction please | 2024-03-06 00:01:37 +00:00 |  | 
			
				
					|  | 976c3e9b59 | Hack for flight logging CG inner products. Can be made to work, but could put in some more serious infrastructure
for repro testing and blame attribution (Britney test) if necessary | 2024-03-05 23:59:57 +00:00 |  | 
			
				
					|  | f8ca971dae | Use of a bare PRECISION macro is not namespace safe and collides with SYCL | 2024-03-05 23:59:13 +00:00 |  | 
			
				
					|  | 21bc8c24df | OneMKL batched blas starting | 2024-03-05 23:58:20 +00:00 |  | 
			
				
					|  | 30228214f7 | SYCL conflict with Eigen | 2024-03-05 23:56:10 +00:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 2ae980ae43 | Update sourceme.sh | 2024-03-05 13:39:18 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 6153dec2e4 | Update setup.sh | 2024-03-05 13:38:32 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | c805f86343 | USQCD benchmark | 2024-03-01 00:05:04 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 04ca065281 | Only one rank opens | 2024-02-29 20:09:11 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 88d8fa43d7 | Benchmark development | 2024-02-29 20:01:44 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 3c49762875 | Propagate in the blas routine | 2024-02-29 15:33:06 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 436bf1d9d3 | Merge pull request #455 from clarkedavida/hisq_fat_links Hisq fat links | 2024-02-29 15:29:39 -05:00 |  | 
			
				
					| 
							
							
								 david clarke | f70df6e195 | changed NO_SHIFT and BACKWARD_CONST from define to enum | 2024-02-29 12:29:30 -07:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | fce3852dff | Merge pull request #451 from paboyle/feature/eigen-3.4.0-update updating Eigen to 3.4.0 | 2024-02-28 18:03:37 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | ee1b8bbdbd | Merge pull request #454 from edbennett/adjoint-broke fix HMC for non-fundamental representations | 2024-02-28 14:05:27 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 3f1636637d | Merge pull request #453 from dbollweg/feature/sliceSum_gpu Feature/slice sum gpu | 2024-02-28 14:04:43 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 2e570f5300 | Merge pull request #457 from lehner/feature/gpt Import GPT-related updates | 2024-02-28 13:59:04 -05:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | 9f89486df5 | remove unnecessary code path | 2024-02-28 19:56:23 +01:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | 22b43b86cb | Make GPT test suite work with SYCL | 2024-02-28 12:57:17 +01:00 |  | 
			
				
					| 
							
							
								 dbollweg | 3c9012676a | CUDA cub refuses to reduce vSpinColourMatrix, breaking up into smaller parts like already done for HIP case. | 2024-02-27 12:41:45 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | ee3b3c4c56 | relocate deflation support | 2024-02-27 11:52:23 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 462d706a63 | Move to a blas directory | 2024-02-27 11:51:04 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | ee0d460c8e | Blas based block project & deflate for multiRHS | 2024-02-27 11:41:44 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | cd15abe9d1 | Mrhs prep | 2024-02-27 11:41:13 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 9f40467e24 | Warning squash | 2024-02-27 11:40:36 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | d0b6593823 | More verbose on checksum | 2024-02-27 11:40:14 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 79fc821d8d | reorg headers | 2024-02-27 11:39:37 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | d7fdb9a7e6 | Reorg headers | 2024-02-27 11:39:06 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | b74de51c18 | Reorder headers | 2024-02-27 11:38:52 -05:00 |  | 
			
				
					| 
							
							
								 Dennis Bollweg | b507fe209c | Added SpinColourMatrix case to sliceSum Test | 2024-02-27 11:28:32 -05:00 |  | 
			
				
					| 
							
							
								 Dennis Bollweg | 6cd2d8fcd5 | Replace cuda/hip memcpy with Grid functions | 2024-02-26 09:55:07 -05:00 |  | 
			
				
					| 
							
							
								 david clarke | b02d022993 | fixed race condition (thx michael) | 2024-02-23 17:14:28 -07:00 |  | 
			
				
					| 
							
							
								 david clarke | 94581e3c7a | accelerator_for is broken | 2024-02-23 15:58:33 -07:00 |  | 
			
				
					| 
							
							
								 david clarke | 88b52cc045 | Merge branch 'develop' into hisq_fat_links | 2024-02-23 14:47:15 -07:00 |  | 
			
				
					| 
							
							
								 dbollweg | 0a816b5509 | Merge branch 'feature/sliceSum_gpu' of https://github.com/dbollweg/Grid into feature/sliceSum_gpu | 2024-02-22 21:43:06 -05:00 |  | 
			
				
					| 
							
							
								 dbollweg | 1c8b807c2e | free malloc'd memory | 2024-02-22 21:42:44 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 44b466e072 | Make InsertSliceFast the default at some point in future. Should I do this now? | 2024-02-21 14:51:24 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 5e5b471bb2 | Put/Get and DEviceToDevice | 2024-02-21 14:47:06 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 9c2565f64e | Working and faster version | 2024-02-21 14:46:43 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | e1d0a7cec3 | Batched blas | 2024-02-21 14:38:20 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | b19ae8f465 | Nbasis method for convenience | 2024-02-21 14:36:19 -05:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | cdff2c8e18 | Updated mrhs adef | 2024-02-21 14:27:19 -05:00 |  | 
			
				
					| 
							
							
								 Christoph Lehner | 66391f84f2 | Merge branch 'feature/gpt' of ../Grid into develop | 2024-02-21 19:05:00 +01:00 |  | 
			
				
					|  | 97f7a9ecb3 | fix HMC for non-fundamental representations | 2024-02-21 08:27:55 +00:00 |  | 
			
				
					| 
							
							
								 Dennis Bollweg | 15878f7613 | sliceSumReduction_cub_large now also faster than CPU on Frontier | 2024-02-16 13:55:21 -05:00 |  | 
			
				
					| 
							
							
								 dbollweg | e0d5e3c6c7 | Merge branch 'paboyle:develop' into feature/sliceSum_gpu | 2024-02-16 13:16:37 -05:00 |  |