1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-11-10 07:55:35 +00:00
Go to file
Peter Boyle 1887c77498 Getting closer to having a wilson solver... introducing a first and untested
cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of

algorithms/approx
algorithms/iterative

etc.. to start taking shape.
2015-05-18 07:47:05 +01:00
benchmarks Updating preparing for solvers etc.. 2015-05-16 23:35:08 +01:00
gcc-bug-report Better build automation 2015-05-16 07:16:45 +01:00
lib Getting closer to having a wilson solver... introducing a first and untested 2015-05-18 07:47:05 +01:00
m4 Build progressing 2015-03-04 04:34:51 +00:00
scripts Better build automation 2015-05-16 07:16:45 +01:00
tests Getting closer to having a wilson solver... introducing a first and untested 2015-05-18 07:47:05 +01:00
.gitignore Remove stub files 2015-04-06 11:29:55 +01:00
aclocal.m4 I have made the Cshift work successfully with open mp threading in 2015-05-13 00:31:00 +01:00
AUTHORS Update AUTHORS 2015-03-07 07:00:39 +00:00
ChangeLog Updating build system 2015-03-04 04:53:40 +00:00
compile files 2015-03-04 11:57:14 +00:00
configure Getting closer to having a wilson solver... introducing a first and untested 2015-05-18 07:47:05 +01:00
configure.ac Getting closer to having a wilson solver... introducing a first and untested 2015-05-18 07:47:05 +01:00
COPYING Extra files 2015-03-04 12:03:07 +00:00
depcomp files 2015-03-04 11:57:14 +00:00
INSTALL Update INSTALL 2015-03-07 07:09:09 +00:00
install-sh file 2015-03-04 11:55:44 +00:00
LICENSE Initial commit 2015-03-04 02:30:11 +00:00
Makefile.am Starting a benchmarking sub dir 2015-05-02 17:52:36 +01:00
Makefile.in I have made the Cshift work successfully with open mp threading in 2015-05-13 00:31:00 +01:00
missing files 2015-03-04 11:57:14 +00:00
NEWS Updating build system 2015-03-04 04:53:40 +00:00
README Update README 2015-03-07 07:19:01 +00:00
README.md typo 2015-04-18 12:40:55 +01:00
TODO OMP dslash working 2015-05-13 10:59:22 +01:00

Grid

Data parallel C++ mathematical object library

This library provides data parallel C++ container classes with internal memory layout that is transformed to map efficiently to SIMD architectures. CSHIFT facilities are provided, similar to HPF and cmfortran, and user control is given over the mapping of array indices to both MPI tasks and SIMD processing elements.

  • Identically shaped arrays then be processed with perfect data parallelisation.
  • Such identically shapped arrays are called conformable arrays.

The transformation is based on the observation that Cartesian array processing involves identical processing to be performed on different regions of the Cartesian array.

The library will both geometrically decompose into MPI tasks and across SIMD lanes. Local vector loops are parallelised with OpenMP pragmas.

Data parallel array operations can then be specified with a SINGLE data parallel paradigm, but optimally use MPI, OpenMP and SIMD parallelism under the hood. This is a significant simplification for most programmers.

The layout transformations are parametrised by the SIMD vector length. This adapts according to the architecture. Presently SSE2 (128 bit) AVX, AVX2 (256 bit) and IMCI and AVX512 (512 bit) targets are supported.

These are presented as

vRealF, vRealD, vComplexF, vComplexD

internal vector data types. These may be useful in themselves for other programmers. The corresponding scalar types are named

RealF, RealD, ComplexF, ComplexD

MPI, OpenMP, and SIMD parallelism are present in the library.

You can give `configure' initial values for configuration parameters by setting variables in the command line or in the environment. Here are examples:

 ./configure CXX=clang++ CXXFLAGS="-std=c++11 -O3 -mavx" --enable-simd=AVX1

 ./configure CXX=clang++ CXXFLAGS="-std=c++11 -O3 -mavx2" --enable-simd=AVX2

 ./configure CXX=icpc CXXFLAGS="-std=c++11 -O3 -mmic" --enable-simd=AVX512 --host=none