Peter Boyle
|
24202dbc51
|
Thread loop construct change
|
2019-06-15 12:52:07 +01:00 |
|
Peter Boyle
|
d763c303c5
|
Clean acceleerator barrier
|
2019-06-15 12:51:45 +01:00 |
|
Peter Boyle
|
8e394d3bf9
|
New loop construct
|
2019-06-15 12:51:15 +01:00 |
|
Peter Boyle
|
b881d5489b
|
Move SchurDiagTwoKappa to Algorithms
|
2019-06-15 12:50:45 +01:00 |
|
Peter Boyle
|
49f90cc7eb
|
use pragma once
|
2019-06-15 12:45:22 +01:00 |
|
Peter Boyle
|
b77af0210b
|
Thread loop. Probably deprecate this impl
|
2019-06-15 12:44:56 +01:00 |
|
Peter Boyle
|
5254ede2d8
|
New loops. Revisit as accelerator loop in future audit
|
2019-06-15 12:44:29 +01:00 |
|
Peter Boyle
|
16e5d7945e
|
Hard to make 5D vec work with GPU code
|
2019-06-15 12:43:43 +01:00 |
|
Peter Boyle
|
decc99ca76
|
Accelerator version
|
2019-06-15 12:43:00 +01:00 |
|
Peter Boyle
|
464cd65931
|
Still to test this fully
|
2019-06-15 12:35:14 +01:00 |
|
Peter Boyle
|
a1ec2f4723
|
Still to test this routine fully
|
2019-06-15 12:33:55 +01:00 |
|
Peter Boyle
|
ea9662ec85
|
Thread loop changes
|
2019-06-15 09:09:57 +01:00 |
|
Peter Boyle
|
52c74f1cac
|
Thread loop changes
|
2019-06-15 09:08:16 +01:00 |
|
Peter Boyle
|
9a13d2992c
|
lean up
|
2019-06-15 09:05:16 +01:00 |
|
Peter Boyle
|
b0449ae270
|
Thread loop changes
|
2019-06-15 09:04:19 +01:00 |
|
Peter Boyle
|
1299225105
|
Accelerator loop changes
|
2019-06-15 09:03:46 +01:00 |
|
Peter Boyle
|
5925e7f405
|
Thread for changes
|
2019-06-15 09:01:30 +01:00 |
|
Peter Boyle
|
be1fd4930f
|
Template instantiation make happy changes
|
2019-06-15 08:37:34 +01:00 |
|
Peter Boyle
|
36f06555a2
|
Simplify Impl
|
2019-06-09 22:26:27 +01:00 |
|
Peter Boyle
|
d6c0e0756d
|
Remove GPU version
|
2019-06-09 11:23:42 +01:00 |
|
Peter Boyle
|
3e41b1055c
|
Remove Gpu only kernels.
|
2019-06-09 11:20:01 +01:00 |
|
Peter Boyle
|
e78a5e7838
|
ASM instantiation without link errors
|
2019-06-09 01:25:21 +01:00 |
|
Peter Boyle
|
8e3a05d89b
|
Moving the instantiation into a cleaner structure
|
2019-06-08 13:48:33 +01:00 |
|
Peter Boyle
|
c933ac2248
|
Temporarily introduce a SIMT_loop to test out approaches prior to making a global change to
accelerator_loop
|
2019-06-08 13:44:27 +01:00 |
|
Peter Boyle
|
ad2c433574
|
Instantiations move. Tried using Gianluca's suggestion about avoiding threadIdx but doesn't
seem to make a difference. Will revisit this and probably remove the lane parameter from the coalescedRead
|
2019-06-08 13:43:12 +01:00 |
|
Peter Boyle
|
86e7fb6e86
|
Instantiation relocation
|
2019-06-08 13:42:46 +01:00 |
|
Peter Boyle
|
fb91dda7be
|
Hand instantiation moved location
|
2019-06-08 13:42:26 +01:00 |
|
Peter Boyle
|
82cf7bc5ab
|
Move instantiation into fermion/instantiation
|
2019-06-08 13:41:46 +01:00 |
|
Peter Boyle
|
e452cc0a22
|
Move static variables into instantiation .cc file
|
2019-06-08 13:41:20 +01:00 |
|
Peter Boyle
|
4d2b938166
|
Remove explict instantiation from here
|
2019-06-08 13:41:01 +01:00 |
|
Peter Boyle
|
10d16ab76c
|
Remove explict instantiation from here
|
2019-06-08 13:40:32 +01:00 |
|
Peter Boyle
|
1f997fa484
|
Instantiate via explict .cc files for parallel make.
|
2019-06-08 13:39:51 +01:00 |
|
Peter Boyle
|
0ee6e77cbc
|
Compiles GPU and CPU, still gives good performance on CPU
|
2019-06-05 13:28:16 +01:00 |
|
Peter Boyle
|
18d3cde29a
|
Compile on GPU workd
|
2019-06-05 00:14:58 +01:00 |
|
Peter Boyle
|
7323099966
|
Instatiation fix
|
2019-06-05 00:14:38 +01:00 |
|
Peter Boyle
|
6379651cdd
|
Generic or GPU ready for benchmark test on GPU
|
2019-06-05 00:13:52 +01:00 |
|
Peter Boyle
|
ba4fd756b9
|
Fix signature, but deprecating this loops style
|
2019-06-05 00:12:36 +01:00 |
|
Peter Boyle
|
d185fc1ebf
|
clean up instantiation
|
2019-06-05 00:11:52 +01:00 |
|
Peter Boyle
|
96b36d8367
|
Instantiation clean up
|
2019-06-05 00:11:27 +01:00 |
|
Peter Boyle
|
899f8b5065
|
Instantiation clean up 5d vec removal
|
2019-06-05 00:11:05 +01:00 |
|
Peter Boyle
|
c8d0483fe9
|
Remove 5d vectorisation
|
2019-06-05 00:10:37 +01:00 |
|
Peter Boyle
|
0f214e5f76
|
Clean up instantiation
|
2019-06-05 00:10:13 +01:00 |
|
Peter Boyle
|
9636324069
|
GPU happy code
|
2019-06-05 00:08:54 +01:00 |
|
Peter Boyle
|
8a5489d9e6
|
Move the loop into a central kernel call.
|
2019-06-05 00:08:13 +01:00 |
|
Peter Boyle
|
b47f73c222
|
GPU happy
|
2019-06-04 21:30:39 +01:00 |
|
Peter Boyle
|
5720ced0fd
|
Simplifying
|
2019-06-04 21:30:08 +01:00 |
|
Peter Boyle
|
2c87b56b53
|
Making GPU happier
|
2019-06-04 21:29:44 +01:00 |
|
Peter Boyle
|
dbad48d802
|
Remove Ls vectorised DWF
|
2019-06-04 21:27:40 +01:00 |
|
Peter Boyle
|
4557a1365a
|
Remove Ls vectorised DWF
|
2019-06-04 20:59:59 +01:00 |
|
Peter Boyle
|
16e9b87d98
|
Remove Ls vectorised DWF as unused and hard to maintain
|
2019-06-04 20:59:01 +01:00 |
|