| 
							
							
								 Peter Boyle | 7bf82f5b37 | Offload the face handling to GPU | 2018-09-10 11:28:42 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 704ca162c1 | Offloadable compression | 2018-09-10 11:20:50 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | da17a015c7 | Pack the stencil smaller for 128 bit access | 2018-07-23 06:12:45 -04:00 |  | 
			
				
					| 
							
							
								 paboyle | e6be7416f4 | Use managed memory | 2018-06-13 20:14:00 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | 26863b6d95 | User Managed memory | 2018-06-13 20:13:42 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | d4ce7d9905 | GPU friendly Stencil needs a view | 2018-03-19 07:11:21 -04:00 |  | 
			
				
					| 
							
							
								 paboyle | ad739f042a | Introduce views for passing lattice indexing to accelerators. | 2018-03-04 15:56:14 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | bf5fb89aff | Coordinate handling GPU friendly | 2018-02-24 22:21:36 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 51eb2c5dfc | Make referencign the stencil and all info required to evaluate the kernel accelerator marked up | 2018-02-02 11:37:13 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 79b50feacf | fixme updates | 2018-01-29 16:00:40 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 70e276e1ab | parallel_for elimination -> thread_loop | 2018-01-28 01:01:14 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | c4f82e072b | _grid becomes private ; use Grid()§ | 2018-01-27 00:04:12 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 912b50f6fa | Hiding lattice internals | 2018-01-26 23:08:45 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 063603b1ea | Warning elimination | 2018-01-24 13:12:14 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 5e48b701ec | FOrmatting | 2018-01-13 00:11:53 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 7f6bffe5ad | NAMESPACE | 2018-01-13 00:11:30 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 0a68470f9a | Simplify comms layers | 2018-01-08 11:28:30 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 5918769f97 | Subtle Naik term bug updated in Stencil; less on logical && with a function call on right | 2017-09-16 12:51:26 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | c289699d9a | updated from cambridge mpi3 shakeout | 2017-08-25 11:41:01 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | c3b1263e75 | Benchmark prep | 2017-08-25 09:25:54 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | a446d95c33 | Trying to pass TeamCity and Travis | 2017-08-20 01:10:50 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 7d88198387 | Merge branch 'develop' into feature/multi-communicator | 2017-08-19 13:03:35 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 14d53e1c9e | Threaded MPI calls patches | 2017-07-29 13:08:10 -04:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | b5a6e4f1fd | Best option for Xeon cache blocking set | 2017-06-30 10:53:22 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 38325ebbc6 | Interleave code path; not enabled | 2017-06-30 10:23:51 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | 54e94360ad | Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit | 2017-06-24 23:10:24 +01:00 |  | 
			
				
					|  | 35fa3d1dfd | Merge branch 'master' into feature/scalar_adjointFT | 2017-05-12 10:41:39 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | 2439999ec8 | Warning elimination; drop to -O2 on G++ bad versions | 2017-05-06 14:44:49 +01:00 |  | 
			
				
					| 
							
							
								 Guido Cossu | 741bc836f6 | Exposing support for Ncolours and Ndimensions and JSON input file for the ScalarAction | 2017-05-05 17:36:43 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | 99220f6531 | Fixes and better timing | 2017-04-26 17:24:11 -04:00 |  | 
			
				
					| 
							
							
								 paboyle | ab66bac4e6 | Think I'm getting on top of the reduced cost exterior precomputed list of links | 2017-04-25 08:50:26 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | 56277a11c8 | Build a list of whats on the surface | 2017-04-24 17:06:15 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | 3703b718aa | Mark up a table if a given site only receives from itself; including MPI3 splitting info. | 2017-04-22 19:28:37 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | 736bf3c866 | Major rework of stencil. Half precision and MPI3 now working. | 2017-04-22 11:33:50 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | e1a2319d01 | Simple compressor moved out of cshift into stencil | 2017-04-20 13:18:15 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | d2312e9874 | Drop compressor entirely from Cshift to only Stencil. | 2017-04-20 13:16:55 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | fc4ab9ccd5 | Working half precision comms | 2017-04-20 11:20:26 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | 4a340aa5ca | Massive compressor rework to support reduced precision comms | 2017-04-20 09:28:27 +01:00 |  | 
			
				
					|  | 8ef4300412 | spurious .dirstamp files removed | 2017-04-10 17:00:22 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | 8c8473998d | Average over whole cluster the comm time. | 2017-03-21 22:29:51 -04:00 |  | 
			
				
					| 
							
							
								 paboyle | 4e7ab3166f | Refactoring header layout | 2017-02-22 18:09:33 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | aca7a3ef0a | Optimisation control improvements | 2017-02-10 18:22:31 -05:00 |  | 
			
				
					|  | c56707e003 | useless debug message removed | 2016-12-07 08:59:20 +09:00 |  | 
			
				
					| 
							
							
								 paboyle | bb94ddd0eb | Tidy up of mpi3; also some cleaning of the dslash controls. | 2016-11-02 08:07:09 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 680645f849 | Merge branch 'release/v0.5.0' | 2016-06-30 15:15:03 -07:00 |  | 
			
				
					| 
							
							
								 paboyle | 6d58cb2a68 | Enable reordering of the loops in the assembler for cache friendly. This gets in the way of L2 prefetching however. Do next next link in stencil
prefetching. | 2016-06-30 14:35:01 -07:00 |  | 
			
				
					| 
							
							
								 Guido Cossu | 5e02392f9c | Fixed compilation error for benchmark_dwf Some parts were assuming floating point precision | 2016-06-20 12:30:51 +01:00 |  | 
			
				
					| 
							
							
								 Peter Boyle | f78d89bcbe | Update Lebesgue.cc kill verbose | 2016-06-03 13:33:42 +01:00 |  | 
			
				
					| 
							
							
								 paboyle | 139cc5f1ae | Large change with KNL preparation | 2016-06-03 03:24:26 -07:00 |  | 
			
				
					| 
							
							
								 paboyle | 090e7aa930 | Merge remote-tracking branch 'origin/chulwoo-dec12-2015' Merge Chulwoo's Lanczos related improvements.
Merge Nd!=4 fixes for pure gauge HMC from Evan. | 2016-03-08 09:55:14 +00:00 |  |