mirror of
https://github.com/paboyle/Grid.git
synced 2025-11-05 06:19:31 +00:00
site and s loop into the kernels. This will save on function call overhead and guarantee L2 prefetching strategy is right since OMP can't distribute the sub-chunks of work.