mirror of
https://github.com/paboyle/Grid.git
synced 2025-09-17 16:51:04 +01:00
site and s loop into the kernels. This will save on function call overhead and guarantee L2 prefetching strategy is right since OMP can't distribute the sub-chunks of work.