mirror of
https://github.com/paboyle/Grid.git
synced 2024-11-09 23:45:36 +00:00
Update todo list
This commit is contained in:
parent
e73b909a48
commit
535a6aaf05
36
TODO
36
TODO
@ -1,10 +1,11 @@
|
||||
TODO:
|
||||
---------------
|
||||
|
||||
GPU branch code item work list
|
||||
-----------------------------
|
||||
|
||||
- GPU offload reductions; thrust initial ; inclusive_scan vs reduce?
|
||||
TODO:
|
||||
---------------
|
||||
|
||||
- Make GPU offload reductions optionally deterministic
|
||||
- Accelerate the cshift
|
||||
- Accelerate non-dslash elements of Mobius; check accelerator_loop uniformly used in fermion operators
|
||||
- Gamma tables on GPU
|
||||
@ -24,17 +25,12 @@ GPU branch code item work list
|
||||
- AVX512 still broken, lebesgue order missing ?
|
||||
- Neon ??
|
||||
|
||||
DONE:
|
||||
-----------------------------
|
||||
Physics item work list:
|
||||
|
||||
2)- Consistent linear solver flop count/rate -- PARTIAL, time but no flop/s yet
|
||||
4)- Multigrid Wilson and DWF, compare to other Multigrid implementations
|
||||
5)- HDCR resume
|
||||
|
||||
-----------------------------
|
||||
|
||||
DONE
|
||||
|
||||
- GPU offload reductions: using thrust::reduce?
|
||||
- Deprecate JSON.
|
||||
- pugixml difficult.
|
||||
- Eigen problematic.
|
||||
- Audit HMC timestep / traj length size
|
||||
- GPU offload reductions; thrust initial ; inclusive_scan vs reduce?
|
||||
- Pragmas.h - prune and remove strong_inline (?)
|
||||
@ -45,7 +41,7 @@ DONE
|
||||
- - (3) omp for collapse(n)
|
||||
- - (4) omp parallel for collapse(n)
|
||||
- - Only (1) has a natural mirror in accelerator_loop
|
||||
- - Nested loop macros get cumbersome
|
||||
- - Nested loop macros get cumbersome made a generic interface for N deep
|
||||
- - Don't like thread_region and thread_loop_in_region
|
||||
- - Could replace with
|
||||
|
||||
@ -64,6 +60,18 @@ DONE
|
||||
|
||||
and same "in_region".
|
||||
|
||||
|
||||
-----------------------------
|
||||
Physics item work list:
|
||||
-----------------------------
|
||||
|
||||
2)- Consistent linear solver flop count/rate -- PARTIAL, time but no flop/s yet
|
||||
4)- Multigrid Wilson and DWF, compare to other Multigrid implementations
|
||||
5)- HDCR resume
|
||||
|
||||
-----------------------------
|
||||
|
||||
|
||||
Nov 2018
|
||||
|
||||
1)- BG/Q port and check ; Andrew says ok.
|
||||
|
Loading…
Reference in New Issue
Block a user