1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-04-04 11:15:55 +01:00

Plan for GPU port

This commit is contained in:
Peter Boyle 2019-05-20 09:46:19 +01:00
parent a9342c6ae5
commit a0e9f3b0a0

40
TODO
View File

@ -3,36 +3,42 @@
GPU branch code item work list GPU branch code item work list
----------------------------- -----------------------------
- Investigate why slower than september 1) Common source GPU and CPU generic kernels???
- Common source GPU and CPU generic kernels???
- coalescedRead, coalescedWrite in expressions. - coalescedRead, coalescedWrite in expressions.
- Uniform coding between GPU kernels and CPU kernels attempt - Uniform coding between GPU kernels and CPU kernels attempt
- SIMD dirs in stencil 2) - SIMD dirs in stencil
- Merge develop and test HMC 3) Merge develop and test HMC
- GPU accelerate EOFA 4) GPU accelerate EOFA
- Make GPU offload reductions optionally deterministic 5) Accelerate the cshift
- Accelerate the cshift 6) Make GPU offload reductions optionally deterministic -- Gianluca
- Gamma tables on GPU; check this. 7) Investigate why slower than september
- Mobius kernel fusion. Single GPU simd target (VGPU)
- Reread WilsonKernels and check diffs 8) Gamma tables on GPU; check this.
- thread_loop interface revisit. 9) Mobius kernel fusion. -- Gianluca?
- pragma once uniformly
- Audit changes
- Audit NAMESPACE CHANGES
- Staggered kernels inline for GPU 10) Reread WilsonKernels and check diffs
11) thread_loop interface revisit.
for_n
for
12) pragma once uniformly
13) Audit changes
14) Audit NAMESPACE CHANGES
15) Staggered kernels inline for GPU
- Single GPU simd target (VGPU)
----- -----
Gianluca's changes Gianluca's changes