Peter Boyle
|
b15db11c60
|
Kernels -> pure static object to enable device execution
|
2018-03-24 19:35:20 -04:00 |
|
Peter Boyle
|
f6077f9d48
|
Kernels -> not instantiaed otherwise object ref on GPU
|
2018-03-24 19:33:44 -04:00 |
|
Peter Boyle
|
572954ef12
|
Kernels not an instantiated object, just static
|
2018-03-24 19:33:13 -04:00 |
|
Peter Boyle
|
cedeaae7db
|
Lebesge -> StencilView if necessary
|
2018-03-24 19:32:41 -04:00 |
|
Peter Boyle
|
e6cf0b1e17
|
View typedefs go to OperatorImpl
|
2018-03-24 19:32:11 -04:00 |
|
Peter Boyle
|
5412628ea6
|
begin end lamda
|
2018-03-24 19:31:45 -04:00 |
|
Peter Boyle
|
1f70cedbab
|
Have to make all kernel called routines static since object reference will be a host pointer on GPU
|
2018-03-24 19:29:26 -04:00 |
|
Peter Boyle
|
b50f37cfb4
|
Remove overlap comms flag
|
2018-03-24 19:28:53 -04:00 |
|
Peter Boyle
|
cb0d2a1b03
|
threaded rng init; I thought this was on
|
2018-03-24 19:28:17 -04:00 |
|
Peter Boyle
|
6fe9b28a82
|
Cosmetic
|
2018-03-24 19:27:14 -04:00 |
|
Peter Boyle
|
b002587d7c
|
Simplify
|
2018-03-24 19:26:44 -04:00 |
|
Peter Boyle
|
6c08385782
|
Simplify
|
2018-03-24 19:26:19 -04:00 |
|
Peter Boyle
|
4e1272fabf
|
Kernels need to be static to work on GPU. No reference to host resident data
|
2018-03-22 18:44:53 -04:00 |
|
Peter Boyle
|
607dc2d3c6
|
Remove lebesgue order
|
2018-03-22 18:23:09 -04:00 |
|
Peter Boyle
|
23c880b009
|
Remove lebesgue order; stick in stencil if need
|
2018-03-22 18:13:41 -04:00 |
|
Peter Boyle
|
334bb6792f
|
Lebesgue order removed. Stick in the stencil view
|
2018-03-22 18:12:12 -04:00 |
|
Peter Boyle
|
a3690071b4
|
Warm up GPu
|
2018-03-22 18:05:20 -04:00 |
|
Peter Boyle
|
299d119013
|
GPU work allocation improved
|
2018-03-22 18:04:24 -04:00 |
|
Peter Boyle
|
55be842d23
|
Dont force l1p.h so early
|
2018-03-22 18:01:43 -04:00 |
|
Peter Boyle
|
9875c446c6
|
Clean up pragmas
|
2018-03-20 07:19:17 -04:00 |
|
Peter Boyle
|
9c25eb35ca
|
Eigen develop branch for now
|
2018-03-20 07:18:56 -04:00 |
|
Peter Boyle
|
5ac96dbdc6
|
Warm behaviour in SU3 benchmark
|
2018-03-20 07:18:31 -04:00 |
|
Peter Boyle
|
5cc9aca85d
|
Use 64bit index for looping
|
2018-03-20 06:34:52 -04:00 |
|
Peter Boyle
|
ac29ebcb95
|
Clean up debug prints
|
2018-03-20 06:33:59 -04:00 |
|
Peter Boyle
|
a5cfb89304
|
Update eigen process direct from develop on github. Dangerous, but needed from GPU
|
2018-03-19 07:20:48 -04:00 |
|
Peter Boyle
|
f04a7251cc
|
Gpu welcome message and device info
|
2018-03-19 07:12:12 -04:00 |
|
Peter Boyle
|
d4ce7d9905
|
GPU friendly Stencil needs a view
|
2018-03-19 07:11:21 -04:00 |
|
Peter Boyle
|
8a1d303ab9
|
GPU friendly stencil improvements
|
2018-03-19 07:11:03 -04:00 |
|
Peter Boyle
|
bf0a4de919
|
GPU friendly params object
|
2018-03-19 07:10:12 -04:00 |
|
Peter Boyle
|
6fe5885fe4
|
Warning suppress
|
2018-03-19 07:09:49 -04:00 |
|
Peter Boyle
|
17ac309e84
|
Fix the compile
|
2018-03-19 07:08:59 -04:00 |
|
Peter Boyle
|
7467a1c027
|
Latest eigen needed for GPU
|
2018-03-19 07:08:10 -04:00 |
|
Peter Boyle
|
fdfb8a26a8
|
Disable eigen vectorisation on GPU because of Summit compile issues
|
2018-03-19 07:07:30 -04:00 |
|
paboyle
|
2df4e422ad
|
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
|
2018-03-18 14:45:41 +00:00 |
|
paboyle
|
3a3e3cac40
|
Pull the trigger on offload
|
2018-03-18 14:45:29 +00:00 |
|
paboyle
|
b1c02ec310
|
MallocManaged in GPU
|
2018-03-18 14:44:46 +00:00 |
|
paboyle
|
38eadee2c9
|
Prettier code
|
2018-03-18 14:44:22 +00:00 |
|
paboyle
|
42c70437be
|
Views
|
2018-03-18 14:43:47 +00:00 |
|
paboyle
|
65274b4d7f
|
Tidy up
|
2018-03-18 14:43:16 +00:00 |
|
Peter Boyle
|
7e8be32755
|
Typo fix
|
2018-03-13 19:22:31 -04:00 |
|
paboyle
|
ff761ea4e6
|
Bound check improvement
|
2018-03-09 20:00:46 +00:00 |
|
paboyle
|
a31d3e60d8
|
Better bounds check
|
2018-03-09 18:10:21 +00:00 |
|
paboyle
|
4d60b92b7f
|
Update oSites
|
2018-03-08 21:00:25 +00:00 |
|
paboyle
|
c159c70c84
|
View introduced
|
2018-03-08 14:58:04 +00:00 |
|
paboyle
|
28b5572755
|
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
|
2018-03-08 13:01:42 +00:00 |
|
paboyle
|
5fac7080bc
|
Adding -fno-strict-aliasing by default
|
2018-03-08 13:01:24 +00:00 |
|
Peter Boyle
|
4548523ecc
|
This modification eliminates what looks like a compiler bug
on Intel 2017.
|
2018-03-08 04:41:16 -08:00 |
|
paboyle
|
4154fc6f44
|
Revert a change
|
2018-03-07 16:54:11 +00:00 |
|
paboyle
|
4e3458516a
|
Reverting after fixing issue with extract merge
|
2018-03-07 16:50:13 +00:00 |
|
Peter Boyle
|
90a2efb9b3
|
Hit an annoying strict alias optimisation in GCC 4.9 through 6.3
Chris K was correct. It appears that an additional memcpy (UGHHH) is enough
to suppress the compiler
|
2018-03-07 07:27:26 -08:00 |
|