paboyle
|
ebd730bd54
|
Adding 2D loops
|
2018-06-13 20:13:01 +01:00 |
|
Peter Boyle
|
d3f51065c2
|
Give command line control of blocks/threads split
|
2018-05-12 17:58:56 -04:00 |
|
Peter Boyle
|
6411caad67
|
work distribution
|
2018-04-12 11:41:41 -04:00 |
|
Peter Boyle
|
5412628ea6
|
begin end lamda
|
2018-03-24 19:31:45 -04:00 |
|
Peter Boyle
|
299d119013
|
GPU work allocation improved
|
2018-03-22 18:04:24 -04:00 |
|
Peter Boyle
|
9875c446c6
|
Clean up pragmas
|
2018-03-20 07:19:17 -04:00 |
|
paboyle
|
3a3e3cac40
|
Pull the trigger on offload
|
2018-03-18 14:45:29 +00:00 |
|
paboyle
|
b9b5bdfc3a
|
Proper offload (accelerator access) will require a mutable copy lambda.
|
2018-02-02 11:38:19 +00:00 |
|
paboyle
|
d03ce5c2a4
|
Provide a way to get around std::vector for a known type on device.
Use template specialisation to access a private member in the Clang++ STL implementation
|
2018-02-01 22:44:25 +00:00 |
|
paboyle
|
e657f9a344
|
OMP collapse changes to make NVCC happy
|
2018-01-28 01:21:53 +00:00 |
|
paboyle
|
70e276e1ab
|
parallel_for elimination -> thread_loop
|
2018-01-28 01:01:14 +00:00 |
|
paboyle
|
45df59720e
|
Zero changes and VA_ARGS changes
|
2018-01-27 23:46:58 +00:00 |
|
paboyle
|
f292106db6
|
Split out pragms from threads.h;
More work needed; renam threads directory to "parallelism" or something like that
|
2018-01-24 13:11:04 +00:00 |
|