mirror of
https://github.com/paboyle/Grid.git
synced 2025-05-15 23:15:47 +01:00
site and s loop into the kernels. This will save on function call overhead and guarantee L2 prefetching strategy is right since OMP can't distribute the sub-chunks of work.