Peter Boyle
|
7e8be32755
|
Typo fix
|
2018-03-13 19:22:31 -04:00 |
|
paboyle
|
ff761ea4e6
|
Bound check improvement
|
2018-03-09 20:00:46 +00:00 |
|
paboyle
|
a31d3e60d8
|
Better bounds check
|
2018-03-09 18:10:21 +00:00 |
|
paboyle
|
4d60b92b7f
|
Update oSites
|
2018-03-08 21:00:25 +00:00 |
|
paboyle
|
c159c70c84
|
View introduced
|
2018-03-08 14:58:04 +00:00 |
|
paboyle
|
28b5572755
|
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
|
2018-03-08 13:01:42 +00:00 |
|
paboyle
|
5fac7080bc
|
Adding -fno-strict-aliasing by default
|
2018-03-08 13:01:24 +00:00 |
|
Peter Boyle
|
4548523ecc
|
This modification eliminates what looks like a compiler bug
on Intel 2017.
|
2018-03-08 04:41:16 -08:00 |
|
paboyle
|
4154fc6f44
|
Revert a change
|
2018-03-07 16:54:11 +00:00 |
|
paboyle
|
4e3458516a
|
Reverting after fixing issue with extract merge
|
2018-03-07 16:50:13 +00:00 |
|
Peter Boyle
|
90a2efb9b3
|
Hit an annoying strict alias optimisation in GCC 4.9 through 6.3
Chris K was correct. It appears that an additional memcpy (UGHHH) is enough
to suppress the compiler
|
2018-03-07 07:27:26 -08:00 |
|
paboyle
|
40699221e2
|
Dont alias lhs and rhs in a where statement
|
2018-03-06 04:14:13 -08:00 |
|
paboyle
|
3cb1b545d0
|
Don't alias the variables with a where statement.
|
2018-03-06 04:13:26 -08:00 |
|
paboyle
|
e199ba7e88
|
Fix the Charge conjugate BC's
|
2018-03-05 13:59:02 +00:00 |
|
paboyle
|
4d53703c67
|
Scalar type differeing allowed, eg. precisoin change
|
2018-03-05 11:39:52 +00:00 |
|
paboyle
|
d506c59efa
|
Warnings disabled
|
2018-03-05 11:39:20 +00:00 |
|
paboyle
|
44188a5c6f
|
AVX512 fix
|
2018-03-05 00:32:24 +00:00 |
|
paboyle
|
2018077770
|
Make NVCC happy with the compile. This is warning free on 9.1 on my laptop (both make and make tests).
|
2018-03-05 00:28:24 +00:00 |
|
paboyle
|
984e06e2b5
|
Introduce view objects that can safely be copied to GPU for access
|
2018-03-04 16:40:11 +00:00 |
|
paboyle
|
aead94e9a7
|
View introduced
|
2018-03-04 16:39:29 +00:00 |
|
paboyle
|
3277bda130
|
View introduction to prepare for accelerator offload.
Probably same problem exists for stencil object
|
2018-03-04 16:38:08 +00:00 |
|
paboyle
|
442b0b406c
|
View related changes
|
2018-03-04 16:34:14 +00:00 |
|
paboyle
|
8824a54269
|
View related changes
|
2018-03-04 16:33:33 +00:00 |
|
paboyle
|
c03423250f
|
Indexable changes
|
2018-03-04 16:31:35 +00:00 |
|
paboyle
|
317fd0da44
|
Views introduced. Need to accelerator offload these routines.
|
2018-03-04 16:30:45 +00:00 |
|
paboyle
|
783795a44a
|
Views introduced
|
2018-03-04 16:12:49 +00:00 |
|
paboyle
|
0e6197fbed
|
Introduce accelerator friendly expression template rewrite.
Must obtain and access lattice indexing through a view object that is safe
to copy construct in copy to GPU (without copying the lattice).
|
2018-03-04 16:03:19 +00:00 |
|
paboyle
|
dad7862f91
|
Go through a view object that can be copied to GPU
|
2018-03-04 16:02:02 +00:00 |
|
paboyle
|
c89a883448
|
where was deprecated and integrated to ET engine a long time ago. Remove dead old original code
|
2018-03-04 15:58:02 +00:00 |
|
paboyle
|
c204288fbc
|
Remove a couple of print statements
|
2018-03-04 15:57:15 +00:00 |
|
paboyle
|
ad739f042a
|
Introduce views for passing lattice indexing to accelerators.
|
2018-03-04 15:56:14 +00:00 |
|
paboyle
|
db988301d0
|
Introduce view objects for indexing lattices. Used to pass the view to acccelerators
|
2018-03-04 15:55:16 +00:00 |
|
paboyle
|
9b1f29c4c2
|
Support a view for passing to accelerator
|
2018-03-04 15:54:35 +00:00 |
|
paboyle
|
e5ea04ee0c
|
Need to support precision change, and real replication in multiple simd lanes
|
2018-03-04 15:53:04 +00:00 |
|
paboyle
|
c92a3c6068
|
Need to support any vector type template and run on accelerator
|
2018-03-04 15:52:14 +00:00 |
|
paboyle
|
03f8da8fbc
|
enable-debug option for debug flags in compile
|
2018-03-04 15:51:47 +00:00 |
|
paboyle
|
78a9e31ff0
|
options more obvious
|
2018-02-24 22:26:32 +00:00 |
|
paboyle
|
c1fc947bb8
|
Coordinate handling GPU friendly + some GPU merge/extract improvements
|
2018-02-24 22:26:10 +00:00 |
|
paboyle
|
ff7b19a71b
|
Coordinate handling GPU ready avoid malloc
|
2018-02-24 22:25:39 +00:00 |
|
paboyle
|
1c16ffa1c1
|
Coordinate GPU ready. No malloc
|
2018-02-24 22:25:09 +00:00 |
|
paboyle
|
4962f59477
|
Eliminate both GPU issue and threading bottle neck by avoiding malloc in coordinate handling
|
2018-02-24 22:24:37 +00:00 |
|
paboyle
|
e158b60bce
|
GPU friendly coords
|
2018-02-24 22:23:47 +00:00 |
|
paboyle
|
34820bec27
|
Coordinate handling GPU ready. No malloc
|
2018-02-24 22:23:18 +00:00 |
|
paboyle
|
eed9aa9f0c
|
Extract merge gpu ready
|
2018-02-24 22:23:01 +00:00 |
|
paboyle
|
8792ff6439
|
Coordinate handling gpu ready
|
2018-02-24 22:22:43 +00:00 |
|
paboyle
|
078901278c
|
Coordinate handling gpu friendly
|
2018-02-24 22:22:02 +00:00 |
|
paboyle
|
bf5fb89aff
|
Coordinate handling GPU friendly
|
2018-02-24 22:21:36 +00:00 |
|
paboyle
|
7574c18cef
|
Massive clean up extract merge.
Simpler and GPU friendly
|
2018-02-24 22:21:08 +00:00 |
|
paboyle
|
36ea5f6b77
|
gpu friendly coordinates ; no std::vector on GPU
|
2018-02-24 22:20:14 +00:00 |
|
paboyle
|
285deab432
|
Coordinate handling GPU friendly. Avoid std::vector
|
2018-02-24 22:19:28 +00:00 |
|