1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-10-26 09:39:34 +00:00
Commit Graph

3807 Commits

Author SHA1 Message Date
Peter Boyle
299d119013 GPU work allocation improved 2018-03-22 18:04:24 -04:00
Peter Boyle
55be842d23 Dont force l1p.h so early 2018-03-22 18:01:43 -04:00
Peter Boyle
9875c446c6 Clean up pragmas 2018-03-20 07:19:17 -04:00
Peter Boyle
9c25eb35ca Eigen develop branch for now 2018-03-20 07:18:56 -04:00
Peter Boyle
5ac96dbdc6 Warm behaviour in SU3 benchmark 2018-03-20 07:18:31 -04:00
Peter Boyle
5cc9aca85d Use 64bit index for looping 2018-03-20 06:34:52 -04:00
Peter Boyle
ac29ebcb95 Clean up debug prints 2018-03-20 06:33:59 -04:00
Peter Boyle
a5cfb89304 Update eigen process direct from develop on github. Dangerous, but needed from GPU 2018-03-19 07:20:48 -04:00
Peter Boyle
f04a7251cc Gpu welcome message and device info 2018-03-19 07:12:12 -04:00
Peter Boyle
d4ce7d9905 GPU friendly Stencil needs a view 2018-03-19 07:11:21 -04:00
Peter Boyle
8a1d303ab9 GPU friendly stencil improvements 2018-03-19 07:11:03 -04:00
Peter Boyle
bf0a4de919 GPU friendly params object 2018-03-19 07:10:12 -04:00
Peter Boyle
6fe5885fe4 Warning suppress 2018-03-19 07:09:49 -04:00
Peter Boyle
17ac309e84 Fix the compile 2018-03-19 07:08:59 -04:00
Peter Boyle
7467a1c027 Latest eigen needed for GPU 2018-03-19 07:08:10 -04:00
Peter Boyle
fdfb8a26a8 Disable eigen vectorisation on GPU because of Summit compile issues 2018-03-19 07:07:30 -04:00
paboyle
2df4e422ad Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port 2018-03-18 14:45:41 +00:00
paboyle
3a3e3cac40 Pull the trigger on offload 2018-03-18 14:45:29 +00:00
paboyle
b1c02ec310 MallocManaged in GPU 2018-03-18 14:44:46 +00:00
paboyle
38eadee2c9 Prettier code 2018-03-18 14:44:22 +00:00
paboyle
42c70437be Views 2018-03-18 14:43:47 +00:00
paboyle
65274b4d7f Tidy up 2018-03-18 14:43:16 +00:00
Peter Boyle
7e8be32755 Typo fix 2018-03-13 19:22:31 -04:00
paboyle
ff761ea4e6 Bound check improvement 2018-03-09 20:00:46 +00:00
paboyle
a31d3e60d8 Better bounds check 2018-03-09 18:10:21 +00:00
paboyle
4d60b92b7f Update oSites 2018-03-08 21:00:25 +00:00
paboyle
c159c70c84 View introduced 2018-03-08 14:58:04 +00:00
paboyle
28b5572755 Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port 2018-03-08 13:01:42 +00:00
paboyle
5fac7080bc Adding -fno-strict-aliasing by default 2018-03-08 13:01:24 +00:00
Peter Boyle
4548523ecc This modification eliminates what looks like a compiler bug
on Intel 2017.
2018-03-08 04:41:16 -08:00
paboyle
4154fc6f44 Revert a change 2018-03-07 16:54:11 +00:00
paboyle
4e3458516a Reverting after fixing issue with extract merge 2018-03-07 16:50:13 +00:00
Peter Boyle
90a2efb9b3 Hit an annoying strict alias optimisation in GCC 4.9 through 6.3
Chris K was correct. It appears that an additional memcpy (UGHHH) is enough
to suppress the compiler
2018-03-07 07:27:26 -08:00
paboyle
40699221e2 Dont alias lhs and rhs in a where statement 2018-03-06 04:14:13 -08:00
paboyle
3cb1b545d0 Don't alias the variables with a where statement. 2018-03-06 04:13:26 -08:00
paboyle
e199ba7e88 Fix the Charge conjugate BC's 2018-03-05 13:59:02 +00:00
paboyle
4d53703c67 Scalar type differeing allowed, eg. precisoin change 2018-03-05 11:39:52 +00:00
paboyle
d506c59efa Warnings disabled 2018-03-05 11:39:20 +00:00
paboyle
44188a5c6f AVX512 fix 2018-03-05 00:32:24 +00:00
paboyle
2018077770 Make NVCC happy with the compile. This is warning free on 9.1 on my laptop (both make and make tests). 2018-03-05 00:28:24 +00:00
paboyle
984e06e2b5 Introduce view objects that can safely be copied to GPU for access 2018-03-04 16:40:11 +00:00
paboyle
aead94e9a7 View introduced 2018-03-04 16:39:29 +00:00
paboyle
3277bda130 View introduction to prepare for accelerator offload.
Probably same problem exists for stencil object
2018-03-04 16:38:08 +00:00
paboyle
442b0b406c View related changes 2018-03-04 16:34:14 +00:00
paboyle
8824a54269 View related changes 2018-03-04 16:33:33 +00:00
paboyle
c03423250f Indexable changes 2018-03-04 16:31:35 +00:00
paboyle
317fd0da44 Views introduced. Need to accelerator offload these routines. 2018-03-04 16:30:45 +00:00
paboyle
783795a44a Views introduced 2018-03-04 16:12:49 +00:00
paboyle
0e6197fbed Introduce accelerator friendly expression template rewrite.
Must obtain and access lattice indexing through a view object that is safe
to copy construct in copy to GPU (without copying the lattice).
2018-03-04 16:03:19 +00:00
paboyle
dad7862f91 Go through a view object that can be copied to GPU 2018-03-04 16:02:02 +00:00