mirror of
https://github.com/paboyle/Grid.git
synced 2025-04-04 03:05:55 +01:00
site and s loop into the kernels. This will save on function call overhead and guarantee L2 prefetching strategy is right since OMP can't distribute the sub-chunks of work.