| 
							
							
								 paboyle | 3277bda130 | View introduction to prepare for accelerator offload. Probably same problem exists for stencil object | 2018-03-04 16:38:08 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 442b0b406c | View related changes | 2018-03-04 16:34:14 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 8824a54269 | View related changes | 2018-03-04 16:33:33 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | c03423250f | Indexable changes | 2018-03-04 16:31:35 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 317fd0da44 | Views introduced. Need to accelerator offload these routines. | 2018-03-04 16:30:45 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 783795a44a | Views introduced | 2018-03-04 16:12:49 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 0e6197fbed | Introduce accelerator friendly expression template rewrite. Must obtain and access lattice indexing through a view object that is safe
to copy construct in copy to GPU (without copying the lattice). | 2018-03-04 16:03:19 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | dad7862f91 | Go through a view object that can be copied to GPU | 2018-03-04 16:02:02 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | c89a883448 | where was deprecated and integrated to ET engine a long time ago. Remove dead old original code | 2018-03-04 15:58:02 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | c204288fbc | Remove a couple of print statements | 2018-03-04 15:57:15 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | ad739f042a | Introduce views for passing lattice indexing to accelerators. | 2018-03-04 15:56:14 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | db988301d0 | Introduce view objects for indexing lattices. Used to pass the view to acccelerators | 2018-03-04 15:55:16 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 9b1f29c4c2 | Support a view for passing to accelerator | 2018-03-04 15:54:35 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | e5ea04ee0c | Need to support precision change, and real replication in multiple simd lanes | 2018-03-04 15:53:04 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | c92a3c6068 | Need to support any vector type template and run on accelerator | 2018-03-04 15:52:14 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | c1fc947bb8 | Coordinate handling GPU friendly + some GPU merge/extract improvements | 2018-02-24 22:26:10 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | ff7b19a71b | Coordinate handling GPU ready avoid malloc | 2018-02-24 22:25:39 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 1c16ffa1c1 | Coordinate GPU ready. No malloc | 2018-02-24 22:25:09 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 4962f59477 | Eliminate both GPU issue and threading bottle neck by avoiding malloc in coordinate handling | 2018-02-24 22:24:37 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 34820bec27 | Coordinate handling GPU ready. No malloc | 2018-02-24 22:23:18 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | eed9aa9f0c | Extract merge gpu ready | 2018-02-24 22:23:01 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 8792ff6439 | Coordinate handling gpu ready | 2018-02-24 22:22:43 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 078901278c | Coordinate handling gpu friendly | 2018-02-24 22:22:02 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | bf5fb89aff | Coordinate handling GPU friendly | 2018-02-24 22:21:36 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 7574c18cef | Massive clean up extract merge. Simpler and GPU friendly | 2018-02-24 22:21:08 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | b9b5bdfc3a | Proper offload (accelerator access) will require a mutable copy lambda. | 2018-02-02 11:38:19 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 51eb2c5dfc | Make referencign the stencil and all info required to evaluate the kernel accelerator marked up | 2018-02-02 11:37:13 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | ede0dff794 | Mark up as an accelerator function | 2018-02-02 11:36:44 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | aa6de818e2 | Copy data needed by Kernels out of the grid object to avoid host reference | 2018-02-02 11:36:11 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | dcf6517a93 | Accelerator offload and copy Opt into the kernel for GPU  host var safety | 2018-02-02 11:35:35 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | a308dff410 | accelerator loop, copy Opt into the GPU | 2018-02-02 11:34:37 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 14ba20898a | Accelerator loop the key kernel call | 2018-02-02 11:30:07 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | a53d3ee19a | Add Opt to the lambda capture to get it into the GPU | 2018-02-02 11:28:39 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 5df435319d | Use constexpr | 2018-02-02 11:27:56 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 0da2d3e222 | accelerator off load some more stuff | 2018-02-02 11:27:35 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 9c9dfbfa78 | Force accelerator | 2018-02-02 11:25:09 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | e4df025d01 | Accelerator related | 2018-02-01 23:20:05 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | cfeda9d536 | constexpr on const ints | 2018-02-01 22:59:12 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 4450b1993a | Offload | 2018-02-01 22:45:47 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | d03ce5c2a4 | Provide a way to get around std::vector for a known type on device. Use template specialisation to access a private member in the Clang++ STL implementation | 2018-02-01 22:44:25 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 7d6522c1ef | Accelerator inline | 2018-02-01 22:43:56 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | b96832a922 | Accelerator inline | 2018-02-01 22:43:26 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 5d7af47b05 | accelerator_inline | 2018-02-01 22:42:54 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 053ef25c90 | constexpr makes GPU happy | 2018-02-01 22:42:29 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 8ae77d3706 | Small simplification of FermionOperatorImpl towards GPU but not there yet | 2018-02-01 22:41:54 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 79b50feacf | fixme updates | 2018-01-29 16:00:40 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | c67c1544cd | abs no compile on travis fix attempt | 2018-01-28 10:26:04 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | e657f9a344 | OMP collapse changes to make NVCC happy | 2018-01-28 01:21:53 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | b6ebf35af5 | Intel compiler doesn't like Nvidia error disable pragmas | 2018-01-28 01:03:10 +00:00 |  | 
			
				
					| 
							
							
								 paboyle | 70e276e1ab | parallel_for elimination -> thread_loop | 2018-01-28 01:01:14 +00:00 |  |