1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-11-10 07:55:35 +00:00
Commit Graph

3786 Commits

Author SHA1 Message Date
Peter Boyle
5f50473c0d Clean up 2018-05-14 19:39:11 -04:00
Peter Boyle
13f50406e3 Suppress print statement 2018-05-12 18:00:00 -04:00
Peter Boyle
09cd46d337 Lane by Lane operation 2018-05-12 17:59:35 -04:00
Peter Boyle
d3f51065c2 Give command line control of blocks/threads split 2018-05-12 17:58:56 -04:00
Peter Boyle
925ac4173d Thread count control for warp scheduler thingy doodaa thing 2018-05-12 17:58:22 -04:00
Peter Boyle
eb921041d0 Perf count control 2018-05-12 17:57:32 -04:00
Peter Boyle
87c5c0271b Ficxing eigen 2018-04-16 19:08:07 -04:00
Peter Boyle
a3f5a13591 Better Eigen handling 2018-04-16 18:02:55 -04:00
Peter Boyle
9fe28f00eb Eigen sim link off head revision 2018-04-16 17:54:46 -04:00
Peter Boyle
a8a0bb85cc Control scalar execution or vector under generic. Disable Eigen vectorisation on powerpc / SUmmit 2018-04-12 12:32:57 -04:00
Peter Boyle
6411caad67 work distribution 2018-04-12 11:41:41 -04:00
Peter Boyle
7533035a99 Control Eigen vectorisatoin 2018-04-12 11:40:56 -04:00
Peter Boyle
b15db11c60 Kernels -> pure static object to enable device execution 2018-03-24 19:35:20 -04:00
Peter Boyle
f6077f9d48 Kernels -> not instantiaed otherwise object ref on GPU 2018-03-24 19:33:44 -04:00
Peter Boyle
572954ef12 Kernels not an instantiated object, just static 2018-03-24 19:33:13 -04:00
Peter Boyle
cedeaae7db Lebesge -> StencilView if necessary 2018-03-24 19:32:41 -04:00
Peter Boyle
e6cf0b1e17 View typedefs go to OperatorImpl 2018-03-24 19:32:11 -04:00
Peter Boyle
5412628ea6 begin end lamda 2018-03-24 19:31:45 -04:00
Peter Boyle
1f70cedbab Have to make all kernel called routines static since object reference will be a host pointer on GPU 2018-03-24 19:29:26 -04:00
Peter Boyle
b50f37cfb4 Remove overlap comms flag 2018-03-24 19:28:53 -04:00
Peter Boyle
cb0d2a1b03 threaded rng init; I thought this was on 2018-03-24 19:28:17 -04:00
Peter Boyle
6fe9b28a82 Cosmetic 2018-03-24 19:27:14 -04:00
Peter Boyle
b002587d7c Simplify 2018-03-24 19:26:44 -04:00
Peter Boyle
6c08385782 Simplify 2018-03-24 19:26:19 -04:00
Peter Boyle
4e1272fabf Kernels need to be static to work on GPU. No reference to host resident data 2018-03-22 18:44:53 -04:00
Peter Boyle
607dc2d3c6 Remove lebesgue order 2018-03-22 18:23:09 -04:00
Peter Boyle
23c880b009 Remove lebesgue order; stick in stencil if need 2018-03-22 18:13:41 -04:00
Peter Boyle
334bb6792f Lebesgue order removed. Stick in the stencil view 2018-03-22 18:12:12 -04:00
Peter Boyle
a3690071b4 Warm up GPu 2018-03-22 18:05:20 -04:00
Peter Boyle
299d119013 GPU work allocation improved 2018-03-22 18:04:24 -04:00
Peter Boyle
55be842d23 Dont force l1p.h so early 2018-03-22 18:01:43 -04:00
Peter Boyle
9875c446c6 Clean up pragmas 2018-03-20 07:19:17 -04:00
Peter Boyle
9c25eb35ca Eigen develop branch for now 2018-03-20 07:18:56 -04:00
Peter Boyle
5ac96dbdc6 Warm behaviour in SU3 benchmark 2018-03-20 07:18:31 -04:00
Peter Boyle
5cc9aca85d Use 64bit index for looping 2018-03-20 06:34:52 -04:00
Peter Boyle
ac29ebcb95 Clean up debug prints 2018-03-20 06:33:59 -04:00
Peter Boyle
a5cfb89304 Update eigen process direct from develop on github. Dangerous, but needed from GPU 2018-03-19 07:20:48 -04:00
Peter Boyle
f04a7251cc Gpu welcome message and device info 2018-03-19 07:12:12 -04:00
Peter Boyle
d4ce7d9905 GPU friendly Stencil needs a view 2018-03-19 07:11:21 -04:00
Peter Boyle
8a1d303ab9 GPU friendly stencil improvements 2018-03-19 07:11:03 -04:00
Peter Boyle
bf0a4de919 GPU friendly params object 2018-03-19 07:10:12 -04:00
Peter Boyle
6fe5885fe4 Warning suppress 2018-03-19 07:09:49 -04:00
Peter Boyle
17ac309e84 Fix the compile 2018-03-19 07:08:59 -04:00
Peter Boyle
7467a1c027 Latest eigen needed for GPU 2018-03-19 07:08:10 -04:00
Peter Boyle
fdfb8a26a8 Disable eigen vectorisation on GPU because of Summit compile issues 2018-03-19 07:07:30 -04:00
paboyle
2df4e422ad Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port 2018-03-18 14:45:41 +00:00
paboyle
3a3e3cac40 Pull the trigger on offload 2018-03-18 14:45:29 +00:00
paboyle
b1c02ec310 MallocManaged in GPU 2018-03-18 14:44:46 +00:00
paboyle
38eadee2c9 Prettier code 2018-03-18 14:44:22 +00:00
paboyle
42c70437be Views 2018-03-18 14:43:47 +00:00