mirror of
https://github.com/paboyle/Grid.git
synced 2025-05-14 22:45:47 +01:00
site and s loop into the kernels. This will save on function call overhead and guarantee L2 prefetching strategy is right since OMP can't distribute the sub-chunks of work.