1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-11-14 17:55:38 +00:00
Commit Graph

3795 Commits

Author SHA1 Message Date
paboyle
26863b6d95 User Managed memory 2018-06-13 20:13:42 +01:00
paboyle
ebd730bd54 Adding 2D loops 2018-06-13 20:13:01 +01:00
paboyle
066be31a3b Optional GPU target SIMD types; work in progress and trying experiments 2018-06-13 20:07:55 +01:00
paboyle
7a4c142955 Add GPU specific simd targets 2018-06-13 19:55:30 +01:00
Peter Boyle
eb7d34a4cc GPU version 2018-05-14 19:41:47 -04:00
Peter Boyle
aab27a655a Start of GPU kernels 2018-05-14 19:41:17 -04:00
Peter Boyle
93280bae85 Gpu option 2018-05-14 19:40:58 -04:00
Peter Boyle
c5f93abcd7 GPU clean up 2018-05-14 19:40:33 -04:00
Peter Boyle
d5deef782d Useful debug comments 2018-05-14 19:39:52 -04:00
Peter Boyle
5f50473c0d Clean up 2018-05-14 19:39:11 -04:00
Peter Boyle
13f50406e3 Suppress print statement 2018-05-12 18:00:00 -04:00
Peter Boyle
09cd46d337 Lane by Lane operation 2018-05-12 17:59:35 -04:00
Peter Boyle
d3f51065c2 Give command line control of blocks/threads split 2018-05-12 17:58:56 -04:00
Peter Boyle
925ac4173d Thread count control for warp scheduler thingy doodaa thing 2018-05-12 17:58:22 -04:00
Peter Boyle
eb921041d0 Perf count control 2018-05-12 17:57:32 -04:00
Peter Boyle
87c5c0271b Ficxing eigen 2018-04-16 19:08:07 -04:00
Peter Boyle
a3f5a13591 Better Eigen handling 2018-04-16 18:02:55 -04:00
Peter Boyle
9fe28f00eb Eigen sim link off head revision 2018-04-16 17:54:46 -04:00
Peter Boyle
a8a0bb85cc Control scalar execution or vector under generic. Disable Eigen vectorisation on powerpc / SUmmit 2018-04-12 12:32:57 -04:00
Peter Boyle
6411caad67 work distribution 2018-04-12 11:41:41 -04:00
Peter Boyle
7533035a99 Control Eigen vectorisatoin 2018-04-12 11:40:56 -04:00
Peter Boyle
b15db11c60 Kernels -> pure static object to enable device execution 2018-03-24 19:35:20 -04:00
Peter Boyle
f6077f9d48 Kernels -> not instantiaed otherwise object ref on GPU 2018-03-24 19:33:44 -04:00
Peter Boyle
572954ef12 Kernels not an instantiated object, just static 2018-03-24 19:33:13 -04:00
Peter Boyle
cedeaae7db Lebesge -> StencilView if necessary 2018-03-24 19:32:41 -04:00
Peter Boyle
e6cf0b1e17 View typedefs go to OperatorImpl 2018-03-24 19:32:11 -04:00
Peter Boyle
5412628ea6 begin end lamda 2018-03-24 19:31:45 -04:00
Peter Boyle
1f70cedbab Have to make all kernel called routines static since object reference will be a host pointer on GPU 2018-03-24 19:29:26 -04:00
Peter Boyle
b50f37cfb4 Remove overlap comms flag 2018-03-24 19:28:53 -04:00
Peter Boyle
cb0d2a1b03 threaded rng init; I thought this was on 2018-03-24 19:28:17 -04:00
Peter Boyle
6fe9b28a82 Cosmetic 2018-03-24 19:27:14 -04:00
Peter Boyle
b002587d7c Simplify 2018-03-24 19:26:44 -04:00
Peter Boyle
6c08385782 Simplify 2018-03-24 19:26:19 -04:00
Peter Boyle
4e1272fabf Kernels need to be static to work on GPU. No reference to host resident data 2018-03-22 18:44:53 -04:00
Peter Boyle
607dc2d3c6 Remove lebesgue order 2018-03-22 18:23:09 -04:00
Peter Boyle
23c880b009 Remove lebesgue order; stick in stencil if need 2018-03-22 18:13:41 -04:00
Peter Boyle
334bb6792f Lebesgue order removed. Stick in the stencil view 2018-03-22 18:12:12 -04:00
Peter Boyle
a3690071b4 Warm up GPu 2018-03-22 18:05:20 -04:00
Peter Boyle
299d119013 GPU work allocation improved 2018-03-22 18:04:24 -04:00
Peter Boyle
55be842d23 Dont force l1p.h so early 2018-03-22 18:01:43 -04:00
Peter Boyle
9875c446c6 Clean up pragmas 2018-03-20 07:19:17 -04:00
Peter Boyle
9c25eb35ca Eigen develop branch for now 2018-03-20 07:18:56 -04:00
Peter Boyle
5ac96dbdc6 Warm behaviour in SU3 benchmark 2018-03-20 07:18:31 -04:00
Peter Boyle
5cc9aca85d Use 64bit index for looping 2018-03-20 06:34:52 -04:00
Peter Boyle
ac29ebcb95 Clean up debug prints 2018-03-20 06:33:59 -04:00
Peter Boyle
a5cfb89304 Update eigen process direct from develop on github. Dangerous, but needed from GPU 2018-03-19 07:20:48 -04:00
Peter Boyle
f04a7251cc Gpu welcome message and device info 2018-03-19 07:12:12 -04:00
Peter Boyle
d4ce7d9905 GPU friendly Stencil needs a view 2018-03-19 07:11:21 -04:00
Peter Boyle
8a1d303ab9 GPU friendly stencil improvements 2018-03-19 07:11:03 -04:00
Peter Boyle
bf0a4de919 GPU friendly params object 2018-03-19 07:10:12 -04:00