1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-05-22 18:14:17 +01:00

Lattice_reduction_gpu_cub: define GRID_REDUCTION_TIMING in header

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
Peter Boyle
2026-05-18 14:54:08 -04:00
parent e12bc7f07c
commit fca2c5dba0
+2
View File
@@ -52,6 +52,8 @@ NAMESPACE_BEGIN(Grid);
#if defined(GRID_CUDA) || defined(GRID_HIP)
#define GRID_REDUCTION_TIMING
// Direct CUB reduction on the full scalar_objectD.
// Only safe when sizeof(sobjD)*64 <= device sharedMemPerBlock.
// Do not call directly for large composite types (e.g. LatticePropagator).