1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-04-23 20:25:56 +01:00

96 Commits

Author SHA1 Message Date
Peter Boyle
24202dbc51 Thread loop construct change 2019-06-15 12:52:07 +01:00
Peter Boyle
d763c303c5 Clean acceleerator barrier 2019-06-15 12:51:45 +01:00
Peter Boyle
8e394d3bf9 New loop construct 2019-06-15 12:51:15 +01:00
Peter Boyle
b881d5489b Move SchurDiagTwoKappa to Algorithms 2019-06-15 12:50:45 +01:00
Peter Boyle
49f90cc7eb use pragma once 2019-06-15 12:45:22 +01:00
Peter Boyle
b77af0210b Thread loop. Probably deprecate this impl 2019-06-15 12:44:56 +01:00
Peter Boyle
5254ede2d8 New loops. Revisit as accelerator loop in future audit 2019-06-15 12:44:29 +01:00
Peter Boyle
16e5d7945e Hard to make 5D vec work with GPU code 2019-06-15 12:43:43 +01:00
Peter Boyle
decc99ca76 Accelerator version 2019-06-15 12:43:00 +01:00
Peter Boyle
464cd65931 Still to test this fully 2019-06-15 12:35:14 +01:00
Peter Boyle
a1ec2f4723 Still to test this routine fully 2019-06-15 12:33:55 +01:00
Peter Boyle
ea9662ec85 Thread loop changes 2019-06-15 09:09:57 +01:00
Peter Boyle
52c74f1cac Thread loop changes 2019-06-15 09:08:16 +01:00
Peter Boyle
9a13d2992c lean up 2019-06-15 09:05:16 +01:00
Peter Boyle
b0449ae270 Thread loop changes 2019-06-15 09:04:19 +01:00
Peter Boyle
1299225105 Accelerator loop changes 2019-06-15 09:03:46 +01:00
Peter Boyle
5925e7f405 Thread for changes 2019-06-15 09:01:30 +01:00
Peter Boyle
be1fd4930f Template instantiation make happy changes 2019-06-15 08:37:34 +01:00
Peter Boyle
36f06555a2 Simplify Impl 2019-06-09 22:26:27 +01:00
Peter Boyle
d6c0e0756d Remove GPU version 2019-06-09 11:23:42 +01:00
Peter Boyle
3e41b1055c Remove Gpu only kernels. 2019-06-09 11:20:01 +01:00
Peter Boyle
e78a5e7838 ASM instantiation without link errors 2019-06-09 01:25:21 +01:00
Peter Boyle
8e3a05d89b Moving the instantiation into a cleaner structure 2019-06-08 13:48:33 +01:00
Peter Boyle
c933ac2248 Temporarily introduce a SIMT_loop to test out approaches prior to making a global change to
accelerator_loop
2019-06-08 13:44:27 +01:00
Peter Boyle
ad2c433574 Instantiations move. Tried using Gianluca's suggestion about avoiding threadIdx but doesn't
seem to make a difference. Will revisit this and probably remove the lane parameter from the coalescedRead
2019-06-08 13:43:12 +01:00
Peter Boyle
86e7fb6e86 Instantiation relocation 2019-06-08 13:42:46 +01:00
Peter Boyle
fb91dda7be Hand instantiation moved location 2019-06-08 13:42:26 +01:00
Peter Boyle
82cf7bc5ab Move instantiation into fermion/instantiation 2019-06-08 13:41:46 +01:00
Peter Boyle
e452cc0a22 Move static variables into instantiation .cc file 2019-06-08 13:41:20 +01:00
Peter Boyle
4d2b938166 Remove explict instantiation from here 2019-06-08 13:41:01 +01:00
Peter Boyle
10d16ab76c Remove explict instantiation from here 2019-06-08 13:40:32 +01:00
Peter Boyle
1f997fa484 Instantiate via explict .cc files for parallel make. 2019-06-08 13:39:51 +01:00
Peter Boyle
0ee6e77cbc Compiles GPU and CPU, still gives good performance on CPU 2019-06-05 13:28:16 +01:00
Peter Boyle
18d3cde29a Compile on GPU workd 2019-06-05 00:14:58 +01:00
Peter Boyle
7323099966 Instatiation fix 2019-06-05 00:14:38 +01:00
Peter Boyle
6379651cdd Generic or GPU ready for benchmark test on GPU 2019-06-05 00:13:52 +01:00
Peter Boyle
ba4fd756b9 Fix signature, but deprecating this loops style 2019-06-05 00:12:36 +01:00
Peter Boyle
d185fc1ebf clean up instantiation 2019-06-05 00:11:52 +01:00
Peter Boyle
96b36d8367 Instantiation clean up 2019-06-05 00:11:27 +01:00
Peter Boyle
899f8b5065 Instantiation clean up 5d vec removal 2019-06-05 00:11:05 +01:00
Peter Boyle
c8d0483fe9 Remove 5d vectorisation 2019-06-05 00:10:37 +01:00
Peter Boyle
0f214e5f76 Clean up instantiation 2019-06-05 00:10:13 +01:00
Peter Boyle
9636324069 GPU happy code 2019-06-05 00:08:54 +01:00
Peter Boyle
8a5489d9e6 Move the loop into a central kernel call. 2019-06-05 00:08:13 +01:00
Peter Boyle
b47f73c222 GPU happy 2019-06-04 21:30:39 +01:00
Peter Boyle
5720ced0fd Simplifying 2019-06-04 21:30:08 +01:00
Peter Boyle
2c87b56b53 Making GPU happier 2019-06-04 21:29:44 +01:00
Peter Boyle
dbad48d802 Remove Ls vectorised DWF 2019-06-04 21:27:40 +01:00
Peter Boyle
4557a1365a Remove Ls vectorised DWF 2019-06-04 20:59:59 +01:00
Peter Boyle
16e9b87d98 Remove Ls vectorised DWF as unused and hard to maintain 2019-06-04 20:59:01 +01:00