Peter Boyle
|
462900b48d
|
Modified entire test directory to suit new GPU constructs for looping
|
2019-06-15 12:53:27 +01:00 |
|
Peter Boyle
|
0561c2edeb
|
Benchmarks modified for new GPU constructs
|
2019-06-15 12:52:56 +01:00 |
|
Peter Boyle
|
0184719216
|
Change to predicate type
|
2019-06-15 12:52:26 +01:00 |
|
Peter Boyle
|
24202dbc51
|
Thread loop construct change
|
2019-06-15 12:52:07 +01:00 |
|
Peter Boyle
|
d763c303c5
|
Clean acceleerator barrier
|
2019-06-15 12:51:45 +01:00 |
|
Peter Boyle
|
8e394d3bf9
|
New loop construct
|
2019-06-15 12:51:15 +01:00 |
|
Peter Boyle
|
b881d5489b
|
Move SchurDiagTwoKappa to Algorithms
|
2019-06-15 12:50:45 +01:00 |
|
Peter Boyle
|
82306913a8
|
Move Schur operator into correct place
|
2019-06-15 12:49:22 +01:00 |
|
Peter Boyle
|
49f90cc7eb
|
use pragma once
|
2019-06-15 12:45:22 +01:00 |
|
Peter Boyle
|
b77af0210b
|
Thread loop. Probably deprecate this impl
|
2019-06-15 12:44:56 +01:00 |
|
Peter Boyle
|
5254ede2d8
|
New loops. Revisit as accelerator loop in future audit
|
2019-06-15 12:44:29 +01:00 |
|
Peter Boyle
|
16e5d7945e
|
Hard to make 5D vec work with GPU code
|
2019-06-15 12:43:43 +01:00 |
|
Peter Boyle
|
decc99ca76
|
Accelerator version
|
2019-06-15 12:43:00 +01:00 |
|
Peter Boyle
|
464cd65931
|
Still to test this fully
|
2019-06-15 12:35:14 +01:00 |
|
Peter Boyle
|
a1ec2f4723
|
Still to test this routine fully
|
2019-06-15 12:33:55 +01:00 |
|
Peter Boyle
|
ea9662ec85
|
Thread loop changes
|
2019-06-15 09:09:57 +01:00 |
|
Peter Boyle
|
52c74f1cac
|
Thread loop changes
|
2019-06-15 09:08:16 +01:00 |
|
Peter Boyle
|
9a13d2992c
|
lean up
|
2019-06-15 09:05:16 +01:00 |
|
Peter Boyle
|
b0449ae270
|
Thread loop changes
|
2019-06-15 09:04:19 +01:00 |
|
Peter Boyle
|
1299225105
|
Accelerator loop changes
|
2019-06-15 09:03:46 +01:00 |
|
Peter Boyle
|
5925e7f405
|
Thread for changes
|
2019-06-15 09:01:30 +01:00 |
|
Peter Boyle
|
be1fd4930f
|
Template instantiation make happy changes
|
2019-06-15 08:37:34 +01:00 |
|
Peter Boyle
|
377fa5dec1
|
looping construct
|
2019-06-15 08:36:48 +01:00 |
|
Peter Boyle
|
e8b78f596e
|
Looping construct changes
|
2019-06-15 08:35:57 +01:00 |
|
Peter Boyle
|
09720c40cd
|
Coalesced loops
|
2019-06-15 08:35:26 +01:00 |
|
Peter Boyle
|
bb024dd114
|
Loop construct changed
|
2019-06-15 08:30:05 +01:00 |
|
Peter Boyle
|
52456b9ec7
|
New loop construct
|
2019-06-15 08:28:45 +01:00 |
|
Peter Boyle
|
b285138be4
|
Better checking on types
|
2019-06-15 08:27:48 +01:00 |
|
Peter Boyle
|
c7dbf4c87e
|
Scalar support for GPU threads
|
2019-06-15 08:25:43 +01:00 |
|
Peter Boyle
|
1e889c93b8
|
Insert a GPU synchronise
|
2019-06-15 08:23:26 +01:00 |
|
Peter Boyle
|
7379047482
|
Threading and acceleration primitives further changes. accelerator_barrier() needed and used
|
2019-06-15 08:22:48 +01:00 |
|
Peter Boyle
|
d836ce3b78
|
Clean up of acceleration and threading primitives
|
2019-06-15 08:14:21 +01:00 |
|
Peter Boyle
|
cefaacbc07
|
Changing accelerator loop. Still have work to do for multi-GPU code
|
2019-06-15 08:10:24 +01:00 |
|
Peter Boyle
|
0074ef7f69
|
thread loops
|
2019-06-15 08:04:29 +01:00 |
|
Peter Boyle
|
20359ca15f
|
Coalesced loops.
|
2019-06-15 08:03:57 +01:00 |
|
Peter Boyle
|
736358b0cb
|
Coalesced loops
|
2019-06-15 08:03:13 +01:00 |
|
Peter Boyle
|
6b692aa726
|
Thread loops
|
2019-06-15 08:02:26 +01:00 |
|
Peter Boyle
|
7f99e1cd3b
|
Coalesced loops
|
2019-06-15 08:01:39 +01:00 |
|
Peter Boyle
|
f3c89df948
|
Thread loop changes
|
2019-06-15 08:00:37 +01:00 |
|
Peter Boyle
|
b7e6d111d7
|
Thread loop changes. Need to offload this file
|
2019-06-15 07:59:10 +01:00 |
|
Peter Boyle
|
f39cf69c33
|
Accelerator loop change
|
2019-06-15 07:58:23 +01:00 |
|
Peter Boyle
|
8e27338df2
|
Rationalise number of loop macros
|
2019-06-15 07:57:40 +01:00 |
|
Peter Boyle
|
bcbb5e9d26
|
Remove assembly tests
|
2019-06-15 07:57:05 +01:00 |
|
Peter Boyle
|
0ea7f5279d
|
Accelerator loop changes
|
2019-06-15 07:56:14 +01:00 |
|
Peter Boyle
|
18e5de426d
|
There is a stray use of predicatedWhere introduced by Andrew Lawson in the conserve currents.
The conserved currents need rewritten using data parallel operations.
|
2019-06-15 07:53:58 +01:00 |
|
Peter Boyle
|
e896d81235
|
Accelerator loop redefine. Coalesce most accesses, but ET engine still to go clean.
|
2019-06-15 07:52:44 +01:00 |
|
Peter Boyle
|
7b8ccff4f4
|
Accelerated coalesced loops in most cases
|
2019-06-15 07:48:00 +01:00 |
|
Peter Boyle
|
68541606ab
|
Thread loop changes. Soon try these with accelerator loops and benchmark
|
2019-06-15 07:46:42 +01:00 |
|
Peter Boyle
|
339ea10cc7
|
First touch only on CPU code
|
2019-06-15 07:45:43 +01:00 |
|
Peter Boyle
|
d0d8dc8042
|
Thread loop changes
|
2019-06-15 07:45:09 +01:00 |
|