azusayamaguchi
|
8b0d171c9a
|
32bit issue on the KNL code variant where byte offsets were stored
|
2016-10-12 17:49:32 +01:00 |
|
azusayamaguchi
|
8bbd9ebc27
|
Reversing changes to Stencil class
|
2016-10-12 13:47:20 +01:00 |
|
azusayamaguchi
|
6472b431f0
|
__rdpmc needed for gcc, clang++
|
2016-10-12 12:29:08 +01:00 |
|
azusayamaguchi
|
bd205a3293
|
Fixing for non x86 and non KNL
|
2016-10-12 12:09:15 +01:00 |
|
azusayamaguchi
|
496beffa88
|
Fix non-KNL build
|
2016-10-12 12:06:08 +01:00 |
|
azusayamaguchi
|
9b63e97108
|
align not absolutely required and confuses clang++
|
2016-10-12 11:51:21 +01:00 |
|
azusayamaguchi
|
81f2aeaece
|
KNL streaming stores, and KNL performance coutners
|
2016-10-12 11:45:22 +01:00 |
|
paboyle
|
2d4a45c758
|
Typecast pointer
|
2016-10-12 09:14:15 +01:00 |
|
paboyle
|
7240d73184
|
Parallelise the x faces; fix the segv on KNL with comms
|
2016-10-11 22:21:07 +01:00 |
|
paboyle
|
42cd148f5e
|
Base pointer for comms buffer under AVX512 assembly
|
2016-10-11 16:06:06 +01:00 |
|
Guido Cossu
|
611b5d74ba
|
Fix for AVX+FMA3 compilation
|
2016-10-10 15:26:17 +01:00 |
|
Guido Cossu
|
b56c9ffa52
|
Fix for AVXFMA
|
2016-10-10 14:43:37 +01:00 |
|
Guido Cossu
|
2e453dfbf5
|
Added some instrumentation to benchmark the force computation
|
2016-10-06 17:52:45 +01:00 |
|
paboyle
|
4089984431
|
Timing hooks
|
2016-10-06 09:25:12 +01:00 |
|
Guido Cossu
|
c78bbd0f8c
|
Fix ASM compilation
|
2016-10-04 15:37:32 +01:00 |
|
|
536e2ff073
|
*.inc removed: please don't commit these files either!
|
2016-09-27 11:54:03 +01:00 |
|
Guido Cossu
|
5c190a1b8c
|
Merge branch 'develop' into feature/hirep
|
2016-09-23 11:06:06 +01:00 |
|
Guido Cossu
|
c4ac6e7e8f
|
Consolidating HMC interface
Uniformed interface for standard action in fundamental rep and Hirep
|
2016-09-23 10:47:42 +01:00 |
|
Guido Cossu
|
510e340e16
|
Debugged last commit for the Two index representation
|
2016-09-22 22:16:21 +01:00 |
|
Guido Cossu
|
6ffadca153
|
Restored number of colours to 3
|
2016-09-22 14:22:54 +01:00 |
|
Guido Cossu
|
b6597b74e7
|
Added support for the Two index Symmetric and Antisymmetric representations
Tested for HMC convergence: OK
Added also a test file showing an example for mixed representations
|
2016-09-22 14:17:37 +01:00 |
|
Antonin Portelli
|
0724f7af75
|
QPX single precision implementation
|
2016-09-19 18:09:12 +01:00 |
|
|
2e74520821
|
removed libtool use (BG/Q compatibility)
|
2016-09-16 15:25:49 +01:00 |
|
Antonin Portelli
|
6dd75ad9e5
|
Merge branch 'develop' of github.com:paboyle/Grid into feature/bgq
|
2016-09-16 15:07:54 +01:00 |
|
Guido Cossu
|
fda408ee6f
|
Added first lines for supporting Two Index representations
|
2016-09-13 10:43:30 +01:00 |
|
Guido Cossu
|
b9c80318a2
|
Merge branch 'develop' into feature/hirep
|
2016-09-13 10:01:51 +01:00 |
|
Guido Cossu
|
5df5d52d41
|
Fix for the Intel compiler
|
2016-09-12 17:17:20 +01:00 |
|
Guido Cossu
|
f76f281e58
|
Cleaning files after fix
|
2016-09-09 11:34:25 +01:00 |
|
Guido Cossu
|
aa20cc8b52
|
Fixing compilation error with AVX512 flag
|
2016-09-09 02:58:52 -07:00 |
|
Guido Cossu
|
0fd179fb33
|
Merge branch 'develop' into feature/hirep
|
2016-09-01 12:59:53 +01:00 |
|
Guido Cossu
|
f45ef8d114
|
Minor modification in ActionBase.h
|
2016-09-01 11:46:46 +01:00 |
|
Guido Cossu
|
fd5614738d
|
Merge branch 'develop' into feature/hirep
|
2016-08-30 18:21:36 +01:00 |
|
Guido Cossu
|
b0d3e4bb2c
|
Separating travis builds
|
2016-08-30 13:44:07 +01:00 |
|
Guido Cossu
|
b512ccbee6
|
HMC for Adjoint fermions works
Accepts and reproduces known results
Check initial instability of inverters
when starting from hot configurations
|
2016-08-30 11:31:25 +01:00 |
|
paboyle
|
8c89391c02
|
FFTW unresolved fixed when no fftw3.h
|
2016-08-24 16:41:47 +01:00 |
|
paboyle
|
bfac5195b8
|
tidy up
|
2016-08-24 16:38:36 +01:00 |
|
paboyle
|
744691097f
|
Printing
|
2016-08-24 15:05:56 +01:00 |
|
paboyle
|
ff6da364e8
|
FFT double and single precision gives good performance now in multithreaded code.
|
2016-08-24 15:05:00 +01:00 |
|
|
4d11a6f5f2
|
first commit for QPX intrinsics
|
2016-08-23 14:41:44 +01:00 |
|
paboyle
|
88be3b39bb
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2016-08-22 18:29:36 +01:00 |
|
paboyle
|
356e7940fd
|
fftw can be switched off
|
2016-08-22 16:24:49 +01:00 |
|
paboyle
|
73ce476890
|
Include fftw headers
|
2016-08-22 16:24:21 +01:00 |
|
paboyle
|
e423a09974
|
FFT improved and test_FFT passing under MPI 8 processes, 8^4 for LatticeComplexD and LatticeSpinMatrixD
|
2016-08-18 02:23:21 +01:00 |
|
paboyle
|
17097a93ec
|
FFTW test ran over 4 mpi processes.
|
2016-08-17 01:33:55 +01:00 |
|
paboyle
|
4ab7dbfd57
|
Instantiate
|
2016-08-15 23:00:40 +01:00 |
|
paboyle
|
90e70790f3
|
Feature for z-Mobius prep
|
2016-08-15 22:31:29 +01:00 |
|
Guido Cossu
|
9c2e8d5e28
|
Nc=3 just to let all the test pass in Travis
|
2016-08-09 15:46:57 +01:00 |
|
Guido Cossu
|
147e2025b9
|
Added unit tests on the representation transformations
Status: Passing all tests
|
2016-08-08 16:54:22 +01:00 |
|
paboyle
|
32bc7a6ab8
|
MPI back out of change that hangs
AVX2 for clang, gcc needs the -mfma flag.
|
2016-08-05 10:36:00 +01:00 |
|
|
93d29bb699
|
build system improvements after discussion with Peter
|
2016-08-04 16:19:59 +01:00 |
|