Grid/tests/debug at 26c3c7d8f9f8c30ed6b9e97ac57a3171847f2952 - Grid - DiRAC Tursa git server

portelli/Grid

mirror of https://github.com/paboyle/Grid.git synced 2026-06-21 03:08:15 +01:00

Files

T

History

Peter Boyle 068f95ad2d Revert to hand-rolled reduction; drop Lattice_reduction_gpu_cub.h

Remove the CUB/hipCUB direction entirely. Restore Lattice_reduction_gpu.h,
Lattice_reduction_sycl.h, and Lattice_reduction.h to the state before the
CUB rewrite (commit 969b0a39), recovering the original primary function names
(sumD_gpu_small, sumD_gpu_large, sumD_gpu, sum_gpu, sum_gpu_large) and the
hand-rolled shared-memory reduction kernel.

Delete Lattice_reduction_gpu_cub.h. Update Test_reduction to remove the
old/new comparison sections that depended on sum_gpu_old.

The lesson: CUB DeviceReduce is slower than the hand-rolled kernel for small
types, and the smem sizing problem for the extraction pass has no clean
solution within the accelerator_for abstraction. The right improvement is
a higher radix (12 then 4) in sumD_gpu_large, applied directly to the
existing hand-rolled kernel.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-18 21:52:18 -04:00

..

Makefile.am

build system: local Grid link flag moved to configure.ac

2016-08-03 15:07:42 +01:00

Test_8888.cc

8^4 test for PETSc

2024-07-22 15:25:17 -04:00

Test_cayley_cg.cc

Schur additional case

2024-07-10 22:04:32 +00:00

Test_cayley_coarsen_support.cc

Tests clean build on HIP

2022-11-16 20:15:51 -05:00

Test_cayley_even_odd.cc

Tests clean build on HIP

2022-11-16 20:15:51 -05:00

Test_cayley_ldop_cr.cc

Tests clean build on HIP

2022-11-16 20:15:51 -05:00

Test_cayley_mres.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_cheby.cc

GLobal edit for QCD namespace removal & NAMESPACE macros

2018-01-15 09:37:58 +00:00

Test_general_coarse_hdcg_phys48_blockcg.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_general_coarse_hdcg_phys48_lanczos_subspace.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_general_coarse_hdcg_phys48_lanczos.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_general_coarse_hdcg_phys48_mixed.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_general_coarse_hdcg_phys48.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_general_coarse_hdcg_phys96_mixed.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_general_coarse_hdcg_phys.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_general_coarse_hdcg.cc

Improvement to 16^3 hdcg

2026-03-05 06:06:32 -05:00

Test_general_coarse_pvdagm_svd_cg.cc

preserving a bunch of experiments on setup and g5 subspace doubling

2026-01-06 05:57:39 -05:00

Test_general_coarse_pvdagm_svd_uv.cc

Alternate multigrids

2026-02-13 17:25:45 -05:00

Test_general_coarse_pvdagm_svd.cc

Alternate multigrids

2026-02-13 17:25:45 -05:00

Test_general_coarse_pvdagm.cc

Optimised

2026-03-05 06:06:32 -05:00

Test_general_coarse_wilson_nog5.cc

Alternate multigrids

2026-02-13 17:25:45 -05:00

Test_general_coarse_wilson_svd_no5g.cc

Alternate multigrids

2026-02-13 17:25:45 -05:00

Test_general_coarse_wilson_svd.cc

Alternate multigrids

2026-02-13 17:25:45 -05:00

Test_general_coarse_wilson.cc

Alternate multigrids

2026-02-13 17:25:45 -05:00

Test_general_coarse.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

test_Grid_jacobi.cc

_grid becomes private ; use Grid()§

2018-01-27 00:04:12 +00:00

Test_heatbath_dwf_eofa_gparity.cc

Make all tests compile

2025-04-24 20:33:26 -04:00

Test_heatbath_dwf_eofa.cc

Tests clean build on HIP

2022-11-16 20:15:51 -05:00

Test_heatbath_mobius_eofa_gparity.cc

Make all tests compile

2025-04-24 20:33:26 -04:00

Test_heatbath_mobius_eofa.cc

Tests clean build on HIP

2022-11-16 20:15:51 -05:00

Test_iwasaki_action_newstaple.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_optimized_staple_gaugebc.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_padded_cell_staple.cc

Alternate multigrids

2026-02-13 17:25:45 -05:00

Test_padded_cell.cc

Assertion updates to macros (mostly) with backtrace.

2025-08-07 15:48:38 +00:00

Test_reduction.cc

Revert to hand-rolled reduction; drop Lattice_reduction_gpu_cub.h

2026-05-18 21:52:18 -04:00

Test_reweight_dwf_eofa_gparity.cc

Make all tests compile

2025-04-24 20:33:26 -04:00

Test_reweight_dwf_eofa.cc

Tests clean build on HIP

2022-11-16 20:15:51 -05:00

Test_reweight_mobius_eofa_gparity.cc

Make all tests compile

2025-04-24 20:33:26 -04:00

Test_reweight_mobius_eofa.cc

Tests clean build on HIP

2022-11-16 20:15:51 -05:00

Test_split_laplacian.cc

Modified entire test directory to suit new GPU constructs for looping

2019-06-15 12:53:27 +01:00