1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-11-10 07:55:35 +00:00
Go to file
2016-08-30 14:25:24 +01:00
benchmarks Added generators for the adjoint representation 2016-07-08 15:40:11 +01:00
docs Small modification to the configure files 2015-06-04 14:17:58 +09:00
gcc-bug-report Global edit adding copyright and license info to every source file. 2016-01-02 14:51:32 +00:00
lib Separating travis builds 2016-08-30 13:44:07 +01:00
m4 fix of AX_GCC_X86_AVX_XGETBV macro 2015-07-17 11:15:57 +09:00
scripts Merge branch 'temporary-smearing' into develop 2016-07-07 14:04:59 +01:00
tests Separating travis builds 2016-08-30 13:44:07 +01:00
.gitignore Fixed compilation error for benchmark_dwf 2016-06-20 12:30:51 +01:00
.travis.yml Single and double precision travis matrix 2016-08-30 14:25:24 +01:00
AUTHORS Global edit adding copyright and license info to every source file. 2016-01-02 14:51:32 +00:00
ChangeLog Updating build system 2015-03-04 04:53:40 +00:00
configure Merge remote-tracking branch 'origin/develop' into temporary-smearing 2016-07-04 17:28:40 +01:00
configure.ac timers flag 2016-06-03 03:25:38 -07:00
COPYING GPLv2 clarified, and copyright message and banner in Init function. 2016-01-02 15:22:30 +00:00
LICENSE GPLv2 clarified, and copyright message and banner in Init function. 2016-01-02 15:22:30 +00:00
Makefile.am Reorganise of file naming 2015-06-03 12:47:05 +01:00
NEWS Updating build system 2015-03-04 04:53:40 +00:00
README Added check of mpfr and gmp at configure time 2015-05-19 13:54:55 +09:00
README.md Travis status in README 2016-05-01 11:18:47 -07:00
TODO Plaq, Rectangle, Iwasaki, Symanzik and DBW2 workign and HMC regresses to http://arxiv.org/pdf/hep-lat/0610075.pdf 2015-12-28 16:38:31 +00:00
VERSION Version file 2016-06-30 14:44:09 -07:00

Grid Build Status

Data parallel C++ mathematical object library

Last update 2015/7/30

This library provides data parallel C++ container classes with internal memory layout that is transformed to map efficiently to SIMD architectures. CSHIFT facilities are provided, similar to HPF and cmfortran, and user control is given over the mapping of array indices to both MPI tasks and SIMD processing elements.

  • Identically shaped arrays then be processed with perfect data parallelisation.
  • Such identically shapped arrays are called conformable arrays.

The transformation is based on the observation that Cartesian array processing involves identical processing to be performed on different regions of the Cartesian array.

The library will both geometrically decompose into MPI tasks and across SIMD lanes. Local vector loops are parallelised with OpenMP pragmas.

Data parallel array operations can then be specified with a SINGLE data parallel paradigm, but optimally use MPI, OpenMP and SIMD parallelism under the hood. This is a significant simplification for most programmers.

The layout transformations are parametrised by the SIMD vector length. This adapts according to the architecture. Presently SSE4 (128 bit) AVX, AVX2 (256 bit) and IMCI and AVX512 (512 bit) targets are supported (ARM NEON on the way).

These are presented as

 vRealF, vRealD, vComplexF, vComplexD 

internal vector data types. These may be useful in themselves for other programmers. The corresponding scalar types are named

 RealF, RealD, ComplexF, ComplexD

MPI, OpenMP, and SIMD parallelism are present in the library.

You can give `configure' initial values for configuration parameters by setting variables in the command line or in the environment. Here are examples:

 ./configure CXX=clang++ CXXFLAGS="-std=c++11 -O3 -msse4" --enable-simd=SSE4

 ./configure CXX=clang++ CXXFLAGS="-std=c++11 -O3 -mavx" --enable-simd=AVX

 ./configure CXX=clang++ CXXFLAGS="-std=c++11 -O3 -mavx2" --enable-simd=AVX2

 ./configure CXX=icpc CXXFLAGS="-std=c++11 -O3 -mmic" --enable-simd=AVX512 --host=none

Note: Before running configure it could be necessary to execute the script

   script/filelist

For developers: Use reconfigure_script in the scripts/ directory to create the autotools environment