Update todo list

2025-07-21 23:27:06 +01:00 · 2019-01-02 22:07:51 +00:00
parent e73b909a48
commit 535a6aaf05
1 changed files with 22 additions and 14 deletions
--- a/36
+++ b/36
@@ -1,10 +1,11 @@
-TODO:
---------------

 GPU branch code item work list
 -----------------------------

- GPU offload reductions; thrust initial ; inclusive_scan vs reduce?
+TODO:
+---------------
+
+- Make GPU offload reductions optionally deterministic
 - Accelerate the cshift
 - Accelerate non-dslash elements of Mobius; check accelerator_loop uniformly used in fermion operators
 - Gamma tables on GPU
@@ -24,17 +25,12 @@ GPU branch code item work list
 - AVX512 still broken, lebesgue order missing ?
 - Neon ??

+DONE:
 -----------------------------
-Physics item work list:
-
-2)- Consistent linear solver flop count/rate -- PARTIAL, time but no flop/s yet
-4)- Multigrid Wilson and DWF, compare to other Multigrid implementations
-5)- HDCR resume
-
-----------------------------
-
-DONE
-
+- GPU offload reductions: using thrust::reduce?
+- Deprecate JSON.
+- pugixml difficult.
+- Eigen problematic.
 - Audit HMC timestep / traj length size
 - GPU offload reductions; thrust initial ; inclusive_scan vs reduce?
 - Pragmas.h - prune and remove strong_inline (?)
@@ -45,7 +41,7 @@ DONE
 - -      (3) omp for collapse(n)
 - -      (4) omp parallel for collapse(n)
 - - Only (1) has a natural mirror in accelerator_loop
- - Nested loop macros get cumbersome
+- - Nested loop macros get cumbersome made a generic interface for N deep
 - - Don't like thread_region and thread_loop_in_region
 - - Could replace with 

@@ -64,6 +60,18 @@ DONE

    and same "in_region".

+
+-----------------------------
+Physics item work list:
+-----------------------------
+
+2)- Consistent linear solver flop count/rate -- PARTIAL, time but no flop/s yet
+4)- Multigrid Wilson and DWF, compare to other Multigrid implementations
+5)- HDCR resume
+
+-----------------------------
+
+
 Nov 2018

 1)- BG/Q port and check ; Andrew says ok.