mirror of https://github.com/paboyle/Grid.git synced 2026-07-17 23:53:27 +01:00

T

Peter Boyle 5644ab1e19 Large scale change to support 5d fermion formulations.

Have 5d replicated wilson with 4d gauge working and matrix regressing
to Ls copies of wilson.

2015-05-31 15:09:02 +01:00

benchmarks

Large scale change to support 5d fermion formulations.

2015-05-31 15:09:02 +01:00

docs

Adding support for doxygen generation

2015-05-27 10:34:56 +09:00

gcc-bug-report

Better build automation

2015-05-16 07:16:45 +01:00

lib

Large scale change to support 5d fermion formulations.

2015-05-31 15:09:02 +01:00

Adding support for doxygen generation

2015-05-27 10:34:56 +09:00

scripts

Large scale change to support 5d fermion formulations.

2015-05-31 15:09:02 +01:00

tests

Large scale change to support 5d fermion formulations.

2015-05-31 15:09:02 +01:00

.gitignore

Added a .gitignore file to eliminate autoconf files from commits

2015-05-27 11:10:51 +09:00

AUTHORS

Update AUTHORS

2015-03-07 07:00:39 +00:00

ChangeLog

Updating build system

2015-03-04 04:53:40 +00:00

configure

Improving the reduction to go through our on permute.

2015-05-27 16:07:17 +01:00

configure.ac

Adding support for doxygen generation

2015-05-27 10:34:56 +09:00

COPYING

Extra files

2015-03-04 12:03:07 +00:00

INSTALL

Improving the reduction to go through our on permute.

2015-05-27 16:07:17 +01:00

LICENSE

Initial commit

2015-03-04 02:30:11 +00:00

Makefile.am

Merge remote-tracking branch 'upstream/master'

2015-05-27 10:41:33 +09:00

NEWS

Updating build system

2015-03-04 04:53:40 +00:00

README

Added check of mpfr and gmp at configure time

2015-05-19 13:54:55 +09:00

README.md

Merging with upstream

2015-05-19 13:36:03 +09:00

reconfigure_script

Corrected AVX regression error. Tested.

2015-05-27 10:49:33 +09:00

TODO

Adding support for doxygen generation

2015-05-27 10:34:56 +09:00

README.md

Grid

Data parallel C++ mathematical object library

This library provides data parallel C++ container classes with internal memory layout that is transformed to map efficiently to SIMD architectures. CSHIFT facilities are provided, similar to HPF and cmfortran, and user control is given over the mapping of array indices to both MPI tasks and SIMD processing elements.

Identically shaped arrays then be processed with perfect data parallelisation.
Such identically shapped arrays are called conformable arrays.

The transformation is based on the observation that Cartesian array processing involves identical processing to be performed on different regions of the Cartesian array.

The library will both geometrically decompose into MPI tasks and across SIMD lanes. Local vector loops are parallelised with OpenMP pragmas.

Data parallel array operations can then be specified with a SINGLE data parallel paradigm, but optimally use MPI, OpenMP and SIMD parallelism under the hood. This is a significant simplification for most programmers.

The layout transformations are parametrised by the SIMD vector length. This adapts according to the architecture. Presently SSE2 (128 bit) AVX, AVX2 (256 bit) and IMCI and AVX512 (512 bit) targets are supported.

These are presented as

vRealF, vRealD, vComplexF, vComplexD

internal vector data types. These may be useful in themselves for other programmers. The corresponding scalar types are named

RealF, RealD, ComplexF, ComplexD

MPI, OpenMP, and SIMD parallelism are present in the library.

You can give `configure' initial values for configuration parameters by setting variables in the command line or in the environment. Here are examples:

 ./configure CXX=clang++ CXXFLAGS="-std=c++11 -O3 -msse4" --enable-simd=SSE4

 ./configure CXX=clang++ CXXFLAGS="-std=c++11 -O3 -mavx" --enable-simd=AVX1

 ./configure CXX=clang++ CXXFLAGS="-std=c++11 -O3 -mavx2" --enable-simd=AVX2

 ./configure CXX=icpc CXXFLAGS="-std=c++11 -O3 -mmic" --enable-simd=AVX512 --host=none