Peter Boyle
|
b473405652
|
Tensor ambiguous fix
|
2019-08-29 09:36:41 -05:00 |
|
Peter Boyle
|
28d6be2a4e
|
Fix GCC complaint
|
2019-08-22 18:56:37 +01:00 |
|
Peter Boyle
|
96ac56cace
|
Double precision variants for summation accuracy
|
2019-08-14 13:08:01 +01:00 |
|
Peter Boyle
|
a23dc295ac
|
Remove compiler errors and warnings
|
2019-07-18 14:47:02 +01:00 |
|
Peter Boyle
|
08904f830e
|
Merge develop
|
2019-07-16 11:59:56 +01:00 |
|
Peter Boyle
|
fa9cd50c5b
|
Merge branch 'develop' into feature/gpu-port
|
2019-07-16 11:55:17 +01:00 |
|
Peter Boyle
|
d6ffadb33b
|
Coalesced write
|
2019-07-02 17:25:13 +01:00 |
|
Peter Boyle
|
b8f7bfbb26
|
Dont stream as poor perf in some cases
|
2019-07-01 07:30:25 +01:00 |
|
Peter Boyle
|
d976e5c514
|
Pow is being awkward in thrust for reasons I don't understand. Possible thrust bug.
|
2019-06-16 12:05:11 +01:00 |
|
Peter Boyle
|
b285138be4
|
Better checking on types
|
2019-06-15 08:27:48 +01:00 |
|
Peter Boyle
|
29a244e423
|
Test of using a lane variable instead of repeated reference to threadIdx.y
|
2019-06-08 13:46:26 +01:00 |
|
Peter Boyle
|
0ee6e77cbc
|
Compiles GPU and CPU, still gives good performance on CPU
|
2019-06-05 13:28:16 +01:00 |
|
Peter Boyle
|
8794d35c78
|
GPU
|
2019-06-04 20:52:27 +01:00 |
|
Peter Boyle
|
6e2e904a0e
|
NVCC compiles happy. Start to develop strategy for writing generic
code for GPU kernels and CPU kernels.
|
2019-06-04 20:46:35 +01:00 |
|
Peter Boyle
|
ffde81f22a
|
Nsimd() and coalesced support
|
2019-05-25 12:44:07 +01:00 |
|
Peter Boyle
|
d8098f1ecd
|
coalesced support
|
2019-05-25 12:43:31 +01:00 |
|
Michael Marshall
|
12d8bf1ced
|
Eigen::Tensor serialisation. Tested on single and double precision builds
|
2019-03-20 22:27:41 +00:00 |
|
|
91cffef883
|
Updates after review with Peter.
|
2019-03-07 14:30:35 +00:00 |
|
|
b7db99967a
|
Recommendations for Traits classes
|
2019-02-28 20:06:59 +00:00 |
|
Peter Boyle
|
e73b909a48
|
Make tests running past nvcc. Different NVCC versions proving tricky to keep happy. This is 9.2
|
2019-01-02 12:05:30 +00:00 |
|
Peter Boyle
|
8c91e82ee8
|
GPU clean up, remove parallel_for. Split into accelerator_loop, thread_loop
cases, and collides with parallel_for in thrust
|
2019-01-01 15:06:46 +00:00 |
|
Peter Boyle
|
422764757d
|
Updates in tests to make all of Grid compile
|
2018-12-14 16:55:54 +00:00 |
|
Peter Boyle
|
b57a4d32aa
|
Merge branch 'develop' into feature/gpu-port
|
2018-12-13 05:11:34 +00:00 |
|
|
fb7d021b9d
|
Hadrons: moving Hadrons to root directory, build system improvements
|
2018-08-28 15:00:40 +01:00 |
|