mirror of
https://github.com/paboyle/Grid.git
synced 2026-05-22 18:14:17 +01:00
Lattice_reduction_gpu_cub: define GRID_REDUCTION_TIMING in header
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -52,6 +52,8 @@ NAMESPACE_BEGIN(Grid);
|
||||
|
||||
#if defined(GRID_CUDA) || defined(GRID_HIP)
|
||||
|
||||
#define GRID_REDUCTION_TIMING
|
||||
|
||||
// Direct CUB reduction on the full scalar_objectD.
|
||||
// Only safe when sizeof(sobjD)*64 <= device sharedMemPerBlock.
|
||||
// Do not call directly for large composite types (e.g. LatticePropagator).
|
||||
|
||||
Reference in New Issue
Block a user