Peter Boyle
|
9b7a6d197f
|
Fix for GCC preprocessor/pragma handling bug
|
2019-08-23 14:37:46 +01:00 |
|
Peter Boyle
|
275c1c920f
|
More info dump on error from CUDA
|
2019-07-26 12:18:53 +01:00 |
|
Peter Boyle
|
f15eeb0283
|
localise scope of variables declared in macro
|
2019-07-12 06:47:01 +01:00 |
|
Peter Boyle
|
7379047482
|
Threading and acceleration primitives further changes. accelerator_barrier() needed and used
|
2019-06-15 08:22:48 +01:00 |
|
Peter Boyle
|
d836ce3b78
|
Clean up of acceleration and threading primitives
|
2019-06-15 08:14:21 +01:00 |
|
Peter Boyle
|
8adc5da7dd
|
Testig out approaches to kernel writing introducing SIMT_loop temporarily
|
2019-06-08 13:47:04 +01:00 |
|
Peter Boyle
|
8113845f9c
|
coalesce loop. Need to rationalise this file
|
2019-06-04 23:49:29 +01:00 |
|
Peter Boyle
|
c2625a127e
|
Non blocking loop. Want to change the naming here.
|
2019-06-04 20:52:59 +01:00 |
|
Peter Boyle
|
a584b16c4a
|
Adding a non-blocking kernel launch
|
2019-05-18 17:39:54 +01:00 |
|
Peter Boyle
|
8c91e82ee8
|
GPU clean up, remove parallel_for. Split into accelerator_loop, thread_loop
cases, and collides with parallel_for in thrust
|
2019-01-01 15:06:46 +00:00 |
|
Peter Boyle
|
2fcedb13dd
|
Step size modification in HMC; ICC happy thread pragmas
|
2018-12-20 09:32:33 +00:00 |
|
Peter Boyle
|
afc462bd58
|
Bracketing issue in macro
|
2018-12-13 10:53:22 +00:00 |
|
Peter Boyle
|
b57a4d32aa
|
Merge branch 'develop' into feature/gpu-port
|
2018-12-13 05:11:34 +00:00 |
|
|
f592ec8baa
|
Hadrons: contractor performance fix
|
2018-11-16 20:59:49 +00:00 |
|
|
8b007b5c24
|
Hadrons: remove the use of OpenMP reductions
|
2018-11-16 20:00:29 +00:00 |
|
|
88d9922e4f
|
Hadrons: fast A2A matrix contraction kernels
|
2018-11-06 19:49:09 +00:00 |
|
|
1651111d18
|
Hadrons: final, portable form of the contractor benchmark
|
2018-11-05 21:29:13 +00:00 |
|
|
fb7d021b9d
|
Hadrons: moving Hadrons to root directory, build system improvements
|
2018-08-28 15:00:40 +01:00 |
|