1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-13 04:37:05 +01:00

Compiles GPU and CPU, still gives good performance on CPU

This commit is contained in:
Peter Boyle
2019-06-05 13:28:16 +01:00
parent 18d3cde29a
commit 0ee6e77cbc
71 changed files with 1512 additions and 33769 deletions

7
TODO
View File

@ -3,19 +3,23 @@
GPU branch code item work list
-----------------------------
1) Common source GPU and CPU generic kernels???
- coalescedRead, coalescedWrite in expressions.
- Uniform coding between GPU kernels and CPU kernels attempt
- Clean up PRAGMAS
-- Figure what to do about "multLinkGpu" etc.. in FermionOperatorImpl.
-- Gparity is the awkward one
-- Solve non-Gparity first.
-- Simplify the operator IMPL support
2) - SIMD dirs in stencil
3) Merge develop and test HMC
4) GPU accelerate EOFA
4) GPU accelerate EOFA
5) Accelerate the cshift
@ -43,7 +47,6 @@ Single GPU simd target (VGPU)
15) Staggered kernels inline for GPU
-----
Gianluca's changes
- Performance impact of construct in aligned allocator???