diff --git a/TODO b/TODO index 380af2b7..a6d0f2ac 100644 --- a/TODO +++ b/TODO @@ -1,3 +1,10 @@ +- - Slice sum optimisation & A2A - atomic addition +- - Also faster non-atomic reduction +- - Remaining PRs +- - DDHMC + +================= +================= Lattice_basis.h -- > HIP and SYCL GPU code @@ -8,6 +15,7 @@ DDHMC -- Multishift Mixed Precision - DONE -- Pole dependent residual - DONE + ======= -- comms threads issue?? -- Part done: Staggered kernel performance on GPU