paboyle
1ec5d32369
Chulwoo's test to zmobius helped me shake out
2017-03-30 13:45:13 +09:00
paboyle
98f9318279
Build on AVX2 and MPI passing with clang++
2017-03-28 23:16:04 +09:00
paboyle
4b17e8eba8
Merge branch 'develop' into feature/bgq-asm
...
Conflicts:
lib/qcd/action/fermion/Fermion.h
lib/qcd/action/fermion/WilsonFermion.cc
lib/util/Init.cc
tests/Test_cayley_even_odd_vec.cc
2017-03-28 04:49:30 -04:00
paboyle
18bde08d1b
Merge branch 'feature/staggering' into develop
2017-03-28 15:25:55 +09:00
paboyle
e099dcdae7
Merge branch 'develop' into feature/bgq-asm
2017-02-23 00:25:29 +00:00
azusayamaguchi
1c30e9a961
Verified
2017-02-21 23:01:25 +00:00
azusayamaguchi
bf7e3f20d4
Staggaered fermion optimised version
2017-02-21 14:35:42 +00:00
paboyle
9ff97b4711
Improved stencil tests passing all on KNL multinode
2017-02-07 00:58:34 -05:00
d775fbb2f9
Gammas: code cleaning and gamma_L implementation & test
2017-02-01 15:45:05 -08:00
05cb6d318a
gammas: adjoint implemented as a symbolic operation
2017-01-24 18:07:43 -08:00
068b28af2d
Extensive gamma test program
2017-01-24 17:35:29 -08:00
a37e71f362
New automatic implementation of gamma matrices, Meson and SeqGamma are broken
2017-01-23 19:13:43 -08:00
azusayamaguchi
df9108154d
Debugged 2 versions of assembler; ls vectorised, xyzt vectorised
2016-12-17 23:47:51 +00:00
azusayamaguchi
eabc577940
Assembler possibly working
2016-12-16 16:55:36 +00:00
Azusa Yamaguchi
c097fd041a
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2016-11-29 13:44:17 +00:00
Azusa Yamaguchi
77fb25fb29
Push 5d tests
2016-11-29 13:43:56 +00:00
Azusa Yamaguchi
389e0a77bd
Staggerd Fermion 5D
2016-11-29 13:13:56 +00:00
3cdf945d84
Test_fftf fix
2016-11-24 09:10:03 +09:00
8a5e3a917c
Merge branch 'develop' into release/v0.6.0
...
# Conflicts:
# tests/core/Test_fft_gfix.cc
2016-11-08 16:53:42 +00:00
cd0be8cb24
Test_fft_gfix.c precision fix
2016-11-08 15:32:05 +00:00
azusayamaguchi
343f3e829f
Fixes prerelease to make all tests
2016-11-08 13:42:12 +00:00
azusayamaguchi
3dc2e05d6e
Time as well since MKL returns zero for Mflops
2016-11-08 11:36:18 +00:00
ec232af851
Photon.h references removed
2016-11-03 14:34:16 +00:00
2854e601e6
FFT test typo
2016-11-03 14:09:47 +00:00
ca21003f01
Merge branch 'feature/fft-opt' into feature/feynman-rules
...
# Conflicts:
# lib/FFT.h
# lib/qcd/action/fermion/WilsonFermion5D.h
# tests/core/Test_fft.cc
2016-10-26 18:44:47 +01:00
14ddf2c234
more FFT optimisations
2016-10-26 17:36:26 +01:00
Azusa Yamaguchi
bca861e112
Note:FFT shoud be GridFFT (Not change yet).
...
Gauge fix with FFt is added (tests/core)
2016-10-25 14:21:48 +01:00
997fd882ff
Merge branch 'develop' into feature/feynman-rules
...
# Conflicts:
# lib/Threads.h
# lib/qcd/action/fermion/WilsonFermion.cc
# lib/qcd/action/fermion/WilsonFermion.h
# lib/qcd/utils/SUn.h
# lib/simd/Grid_avx.h
# lib/simd/Intel512common.h
2016-10-19 18:35:18 +01:00
paboyle
db749f103f
Add Wilson, DWF, Overlap feynman rule tests
2016-10-10 23:48:35 +01:00
536e2ff073
*.inc removed: please don't commit these files either!
2016-09-27 11:54:03 +01:00
paboyle
34f887ca1c
Test_fft not complete; preparing for tests of momentum space DWF and Overlap feynman rules but not there yet.
2016-09-26 09:44:36 +01:00
Guido Cossu
510e340e16
Debugged last commit for the Two index representation
2016-09-22 22:16:21 +01:00
Guido Cossu
b6597b74e7
Added support for the Two index Symmetric and Antisymmetric representations
...
Tested for HMC convergence: OK
Added also a test file showing an example for mixed representations
2016-09-22 14:17:37 +01:00
Guido Cossu
0fd179fb33
Merge branch 'develop' into feature/hirep
2016-09-01 12:59:53 +01:00
paboyle
7422953e36
Poisson solver example
2016-08-31 00:42:47 +01:00
paboyle
62febd2823
Wilson prop test
2016-08-31 00:23:09 +01:00
Guido Cossu
fd5614738d
Merge branch 'develop' into feature/hirep
2016-08-30 18:21:36 +01:00
paboyle
ff6da364e8
FFT double and single precision gives good performance now in multithreaded code.
2016-08-24 15:05:00 +01:00
paboyle
e423a09974
FFT improved and test_FFT passing under MPI 8 processes, 8^4 for LatticeComplexD and LatticeSpinMatrixD
2016-08-18 02:23:21 +01:00
paboyle
17097a93ec
FFTW test ran over 4 mpi processes.
2016-08-17 01:33:55 +01:00
629283726b
build system: local Grid link flag moved to configure.ac
2016-08-03 15:07:42 +01:00
9e5b934d21
improved LAPACK configuration
2016-08-02 17:26:54 +01:00
e9f30cab2c
first working version for the new build system
2016-07-30 17:53:18 +01:00
paboyle
f4dd5062d7
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-07-15 19:26:06 +01:00
paboyle
c0e878705e
Updated file list
2016-07-15 00:02:39 +01:00
paboyle
de3e79d300
red black for Ls vectorised is 4d red black. Update accordingly now I've made this choice
2016-07-14 23:49:42 +01:00
paboyle
adbc7c1188
Adding files for multiple implementations (cache opt) and Ls vectorisation
...
of the 5D cayley form chiral fermions for the 5d matrix. With Ls entirely
in the vector direction, s-hopping terms involve rotations.
The serial dependence of the LDU inversion for Mobius and 4d even odd
checkerboarding is removed by simply applying Ls^2 operations (vectorised
many ways) as a dense matrix operation.
This should give similar throughput but high flops (non-compulsory flops)
but enable use of the KNL cache friendly kernels throughout the code.
Ls is still constrained to be a multiple of Nsimd, which is as much as 8 for AVX512
with single precision.
2016-07-14 22:59:21 +01:00
paboyle
a0676beeb1
Open up dependency on Eigen and FFTW
2016-07-07 22:31:07 +01:00