mirror of
https://github.com/paboyle/Grid.git
synced 2025-06-13 04:37:05 +01:00
Compiles GPU and CPU, still gives good performance on CPU
This commit is contained in:
7
TODO
7
TODO
@ -3,19 +3,23 @@
|
||||
GPU branch code item work list
|
||||
-----------------------------
|
||||
|
||||
|
||||
|
||||
1) Common source GPU and CPU generic kernels???
|
||||
- coalescedRead, coalescedWrite in expressions.
|
||||
- Uniform coding between GPU kernels and CPU kernels attempt
|
||||
- Clean up PRAGMAS
|
||||
|
||||
-- Figure what to do about "multLinkGpu" etc.. in FermionOperatorImpl.
|
||||
-- Gparity is the awkward one
|
||||
-- Solve non-Gparity first.
|
||||
-- Simplify the operator IMPL support
|
||||
|
||||
2) - SIMD dirs in stencil
|
||||
|
||||
3) Merge develop and test HMC
|
||||
|
||||
4) GPU accelerate EOFA
|
||||
4) GPU accelerate EOFA
|
||||
|
||||
5) Accelerate the cshift
|
||||
|
||||
@ -43,7 +47,6 @@ Single GPU simd target (VGPU)
|
||||
|
||||
15) Staggered kernels inline for GPU
|
||||
|
||||
|
||||
-----
|
||||
Gianluca's changes
|
||||
- Performance impact of construct in aligned allocator???
|
||||
|
Reference in New Issue
Block a user