1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-25 19:22:03 +01:00
Commit Graph

148 Commits

Author SHA1 Message Date
1282e1067f Do the force term on the accelerator too. Needed particularly because comms buffers
are device memory.
2019-07-29 22:58:35 +01:00
fe700a183a Getting HMC to run 2019-07-26 12:18:29 +01:00
fa9cd50c5b Merge branch 'develop' into feature/gpu-port 2019-07-16 11:55:17 +01:00
bd155ca5c0 Overlap comms with comput now supported 2019-07-12 09:09:40 +01:00
d7b3efe893 Compile fix 2019-06-15 17:03:15 +01:00
decc99ca76 Accelerator version 2019-06-15 12:43:00 +01:00
464cd65931 Still to test this fully 2019-06-15 12:35:14 +01:00
a1ec2f4723 Still to test this routine fully 2019-06-15 12:33:55 +01:00
ea9662ec85 Thread loop changes 2019-06-15 09:09:57 +01:00
52c74f1cac Thread loop changes 2019-06-15 09:08:16 +01:00
9a13d2992c lean up 2019-06-15 09:05:16 +01:00
b0449ae270 Thread loop changes 2019-06-15 09:04:19 +01:00
1299225105 Accelerator loop changes 2019-06-15 09:03:46 +01:00
5925e7f405 Thread for changes 2019-06-15 09:01:30 +01:00
36f06555a2 Simplify Impl 2019-06-09 22:26:27 +01:00
d6c0e0756d Remove GPU version 2019-06-09 11:23:42 +01:00
3e41b1055c Remove Gpu only kernels. 2019-06-09 11:20:01 +01:00
e78a5e7838 ASM instantiation without link errors 2019-06-09 01:25:21 +01:00
c933ac2248 Temporarily introduce a SIMT_loop to test out approaches prior to making a global change to
accelerator_loop
2019-06-08 13:44:27 +01:00
ad2c433574 Instantiations move. Tried using Gianluca's suggestion about avoiding threadIdx but doesn't
seem to make a difference. Will revisit this and probably remove the lane parameter from the coalescedRead
2019-06-08 13:43:12 +01:00
86e7fb6e86 Instantiation relocation 2019-06-08 13:42:46 +01:00
fb91dda7be Hand instantiation moved location 2019-06-08 13:42:26 +01:00
82cf7bc5ab Move instantiation into fermion/instantiation 2019-06-08 13:41:46 +01:00
e452cc0a22 Move static variables into instantiation .cc file 2019-06-08 13:41:20 +01:00
4d2b938166 Remove explict instantiation from here 2019-06-08 13:41:01 +01:00
10d16ab76c Remove explict instantiation from here 2019-06-08 13:40:32 +01:00
0ee6e77cbc Compiles GPU and CPU, still gives good performance on CPU 2019-06-05 13:28:16 +01:00
7323099966 Instatiation fix 2019-06-05 00:14:38 +01:00
6379651cdd Generic or GPU ready for benchmark test on GPU 2019-06-05 00:13:52 +01:00
ba4fd756b9 Fix signature, but deprecating this loops style 2019-06-05 00:12:36 +01:00
d185fc1ebf clean up instantiation 2019-06-05 00:11:52 +01:00
96b36d8367 Instantiation clean up 2019-06-05 00:11:27 +01:00
899f8b5065 Instantiation clean up 5d vec removal 2019-06-05 00:11:05 +01:00
c8d0483fe9 Remove 5d vectorisation 2019-06-05 00:10:37 +01:00
0f214e5f76 Clean up instantiation 2019-06-05 00:10:13 +01:00
ade4a126da Getting closer on the GPU port, but will start deleting 5th dim vectorised variants
for code maintainability
2019-06-04 11:53:44 +01:00
7b59ab5bd7 Compiling after reorganisation 2019-06-03 15:46:26 +01:00
fcd8cfe257 Gparity in 2019-06-03 15:45:09 +01:00
b4b53812cb Move implementation to specific implementation headers 2019-06-03 15:43:01 +01:00
085cac583f Implementation in header 2019-06-03 15:42:36 +01:00
25e3b8640c Move to header 2019-06-03 15:42:05 +01:00
44bbec50b0 Making GPU compile happy 2019-06-03 14:57:04 +01:00
ec68b67d5d Attempt at unified GPU and CPU kernel 2019-06-03 14:55:51 +01:00
778450e0c8 Move to implementation subdir 2019-06-03 14:53:56 +01:00
567aa5f366 Move to implementation subdir 2019-06-03 14:53:33 +01:00
2ab7e2b175 Force instantiation in .cc files.
Eventually move into multiple files
2019-06-03 14:52:59 +01:00
6f61be044d Dont instantiate in header 2019-06-03 14:52:01 +01:00
269e00509e Don't instantiate in header 2019-06-03 14:51:24 +01:00