Peter Boyle
4d8b01b7ed
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-12-18 00:56:57 +00:00
Peter Boyle
fa6acccf55
Zmobius asm
2016-12-18 00:56:19 +00:00
azusayamaguchi
df9108154d
Debugged 2 versions of assembler; ls vectorised, xyzt vectorised
2016-12-17 23:47:51 +00:00
azusayamaguchi
b3e7f600da
Partial implementation of 4d vectorisation assembler
2016-12-16 23:50:30 +00:00
azusayamaguchi
d4071daf2a
Template specialise
2016-12-16 22:28:29 +00:00
azusayamaguchi
a2a6329094
AVX512 only for ASM compilation
2016-12-16 22:03:29 +00:00
azusayamaguchi
eabc577940
Assembler possibly working
2016-12-16 16:55:36 +00:00
91e98b1dd5
Merge branch 'feature/hadrons' into develop
2016-12-15 18:15:56 +00:00
b791c274b0
Revert "AVX: uninitialised variable fix"
...
This reverts commit c22c3db9ad
.
2016-12-15 18:15:35 +00:00
c22c3db9ad
AVX: uninitialised variable fix
2016-12-13 19:05:58 +00:00
Azusa Yamaguchi
426197e446
Nc=3
2016-12-12 09:10:54 +00:00
Azusa Yamaguchi
99e2c1e666
Kernels options
2016-12-12 09:08:53 +00:00
Azusa Yamaguchi
1440565a10
Decrease verbosity
2016-12-12 09:08:04 +00:00
Azusa Yamaguchi
e9f0c0ea39
Staggered kernels options
2016-12-12 09:07:38 +00:00
Peter Boyle
fe187e9ed3
Compiles and passes under ZMobius with assembler
2016-12-10 00:47:48 +00:00
Peter Boyle
0091b50f49
Zmobius working -- not asm yet
2016-12-09 22:51:32 +00:00
Peter Boyle
fb8d4b2357
Lots of debug on performance Mobius
2016-12-08 17:28:28 +00:00
Peter Boyle
83fa038bdf
Streaming stores
2016-12-08 16:58:42 +00:00
Peter Boyle
7a61feb6d3
Allocator added with caching for Linux VM subsystem optimisation
2016-12-08 16:58:01 +00:00
Peter Boyle
69ae817d1c
Updates for supporting Mobius better
2016-12-08 16:43:28 +00:00
51322da6f8
Hadrons: genetic scheduler improvement
2016-12-07 09:00:45 +09:00
c56707e003
useless debug message removed
2016-12-07 08:59:20 +09:00
Peter Boyle
e27c6b217c
Updating
2016-12-01 12:42:53 +00:00
9ad3d3453e
Hadrons is now a library, the previous XML driven program is now a test
2016-12-01 21:36:29 +09:00
paboyle
6adf35da54
Faster Mobius
2016-12-01 11:39:04 +00:00
paboyle
bd0430b34f
Serialisation in malloc fixed
2016-11-29 22:27:55 +00:00
Azusa Yamaguchi
c097fd041a
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2016-11-29 13:44:17 +00:00
Azusa Yamaguchi
77fb25fb29
Push 5d tests
2016-11-29 13:43:56 +00:00
Azusa Yamaguchi
389e0a77bd
Staggerd Fermion 5D
2016-11-29 13:13:56 +00:00
paboyle
4704f2d009
Actions updated
2016-11-29 00:14:36 +00:00
Guido Cossu
ae9688e343
Reporting also the total mflops
2016-11-28 11:37:02 +00:00
43928846f2
first steps to make Hadrons a library
2016-11-28 16:02:15 +09:00
fabcd4179d
Hadrons: propagator type coming from the fermion implementation
2016-11-28 14:02:10 +09:00
a8843c9af6
Code cleaning, the fermion implementation can be sepcified using the macro FIMPL
2016-11-27 16:47:22 +09:00
7a1a7a685e
Merge branch 'feature/fft-opt' into feature/hadrons
2016-11-27 15:32:03 +09:00
Lanny91
b18950f776
Added simd real divide test with QPX divide fixes
2016-11-25 13:21:33 +00:00
Lanny91
0acbf77bc6
Add QPX Div structure
2016-11-24 13:24:12 +00:00
5833f247fa
more FFt optimisations
2016-11-24 09:09:48 +09:00
Azusa Yamaguchi
95f43d27ae
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2016-11-22 13:49:22 +00:00
Azusa Yamaguchi
668ca57702
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2016-11-22 13:49:11 +00:00
a2cffb0304
AVXFMA target fixed
2016-11-21 17:47:18 +01:00
97cddda49e
Merge branch 'feature/gen-simd' into feature/doxygen
...
# Conflicts:
# Makefile.am
# configure.ac
2016-11-19 13:11:13 +01:00
b873504b90
fully generic SIMD
2016-11-19 01:32:39 +01:00
042ae5b87c
generic 256bits SIMD
2016-11-15 12:16:15 +00:00
paboyle
604f0ea2f6
Merge branch 'develop' into release/v0.6.0
2016-11-09 04:13:01 -08:00
paboyle
33dc1f51b5
Final sign off commits from Cori-1
2016-11-09 04:11:03 -08:00
13a8997789
Merge branch 'release/v0.6.0' into feature/hadrons
...
# Conflicts:
# Makefile.am
2016-11-08 20:43:39 +00:00
9576f0903d
namespace fix
2016-11-08 19:07:47 +00:00
8a5e3a917c
Merge branch 'develop' into release/v0.6.0
...
# Conflicts:
# tests/core/Test_fft_gfix.cc
2016-11-08 16:53:42 +00:00
3d2a22a14d
include fix for MKL
2016-11-08 15:31:47 +00:00
azusayamaguchi
f85b35314d
Fix a routine for single node processor coor from rank
2016-11-08 11:49:13 +00:00
azusayamaguchi
0cff8754d1
Usecs
2016-11-08 11:35:41 +00:00
azusayamaguchi
692b44dac1
Merge branch 'develop' into release/v0.6.0
2016-11-04 22:48:11 +00:00
azusayamaguchi
96ba42a297
omm buf
2016-11-04 22:47:25 +00:00
azusayamaguchi
f7b60004f3
Merge branch 'develop' into release/v0.6.0
2016-11-04 16:08:07 +00:00
ad971ca07b
fftw3.h is now expected to be an external header
2016-11-04 13:12:35 +00:00
f2f16eb972
fftw3.h removed, please don't commit this file back
2016-11-04 13:11:05 +00:00
azusayamaguchi
b7d55f7dfb
Fix a typo in reorg of the --dslash-asm
2016-11-04 11:35:08 +00:00
azusayamaguchi
6e548a8ad5
Linux compile needed
2016-11-04 11:34:16 +00:00
Azusa Yamaguchi
ee686a7d85
Compiles now
2016-11-03 16:58:23 +00:00
Azusa Yamaguchi
1c5b7a6be5
Staggered phases first cut, c1, c2, u0
2016-11-03 16:26:56 +00:00
a5dd4a9bab
Merge branch 'feature/fft-opt' into develop
2016-11-03 14:34:46 +00:00
ec232af851
Photon.h references removed
2016-11-03 14:34:16 +00:00
17e30281e9
Merge branch 'develop' into feature/fft-opt
...
# Conflicts:
# lib/FFT.h
2016-11-03 14:14:03 +00:00
aee44dc694
Photon.h removed from develop branch
2016-11-03 13:54:15 +00:00
75bbf6a0af
Merge branch 'develop' into feature/feynman-rules
2016-11-03 13:52:11 +00:00
paboyle
111bfbc6bc
notimestamp by default
2016-11-03 11:40:26 +00:00
paboyle
f41a230b32
Decrease mpi3l verbose
2016-11-02 19:54:03 +00:00
paboyle
c067051d5f
Merge branch 'develop' into release/v0.6.0
2016-11-02 13:59:18 +00:00
paboyle
9e2ec2719b
Merge branch 'develop' into feature/mpi3-master-slave
2016-11-02 13:02:56 +00:00
paboyle
757a928f9a
Improvement to use own SHM_OPEN call to avoid openmpi bug.
2016-11-02 12:37:46 +00:00
Guido Cossu
bc248b6948
Merge branch 'release/v0.6.0' into feature/KNL_double_prec
...
Conflicts:
lib/simd/Grid_avx512.h
2016-11-02 10:40:49 +00:00
Guido Cossu
ae8561892e
Eliminating useless defines
2016-11-02 10:21:06 +00:00
paboyle
32375aca65
Semaphore sleep/wake up on remote processes.
2016-11-02 09:27:20 +00:00
paboyle
bb94ddd0eb
Tidy up of mpi3; also some cleaning of the dslash controls.
2016-11-02 08:07:09 +00:00
James Harrison
7f0fc0eff5
Remove explicit use of double-precision types in photon.h
2016-11-01 16:02:35 +00:00
Azusa Yamaguchi
164d3691db
Staggered
2016-11-01 14:24:22 +00:00
paboyle
791cb050c8
Comms improvements
2016-11-01 11:35:43 +00:00
d5e95bc350
Merge branch 'release/v0.6.0' into feature/feynman-rules
2016-10-31 18:36:21 +00:00
7a84906b5f
Merge branch 'release/v0.6.0' into feature/fft-opt
2016-10-31 18:31:49 +00:00
66d832c733
FFTW header fix
2016-10-31 16:39:29 +00:00
e74417ca12
big build system polish
2016-10-31 16:31:27 +00:00
Guido Cossu
e8c3174ae2
Small change in the defines
2016-10-30 12:23:11 +00:00
Guido Cossu
9b066e94d0
Compilation with both single and double precision
2016-10-30 12:04:06 +00:00
James Harrison
618abdf302
Add missing volume factor in stochastic QED field
2016-10-29 11:04:02 +01:00
Guido Cossu
e1042aef77
First version of the doube prec for testing purposes
...
It does not compile single and double version at the same time
2016-10-28 17:20:04 +01:00
paboyle
aa6a839c60
avx512 build fix; detect clang/gcc intrinsics vs. ICPC
2016-10-28 09:13:09 +01:00
b4d2af8c89
threaded FFT
2016-10-26 19:46:36 +01:00
434af6aeaa
Merge branch 'develop' into feature/fft-opt
2016-10-26 18:50:38 +01:00
e90f8ac841
Merge branch 'develop' into feature/feynman-rules
2016-10-26 18:50:21 +01:00
a1705a8d53
debug message removed
2016-10-26 18:50:07 +01:00
ca21003f01
Merge branch 'feature/fft-opt' into feature/feynman-rules
...
# Conflicts:
# lib/FFT.h
# lib/qcd/action/fermion/WilsonFermion5D.h
# tests/core/Test_fft.cc
2016-10-26 18:44:47 +01:00
14ddf2c234
more FFT optimisations
2016-10-26 17:36:26 +01:00
Azusa Yamaguchi
bca861e112
Note:FFT shoud be GridFFT (Not change yet).
...
Gauge fix with FFt is added (tests/core)
2016-10-25 14:21:48 +01:00
33d199a0ad
temporary thread safety in FFT
2016-10-25 12:56:40 +01:00
paboyle
b820076b91
Merge branch 'develop' into feature/mpi3
2016-10-25 06:02:33 +01:00
paboyle
09f66100d3
MPI 3 compile on non-linux
2016-10-25 06:01:12 +01:00
azusayamaguchi
d7d92af09d
Travis fail fix attempt
2016-10-25 01:45:53 +01:00
azusayamaguchi
460d0753a1
Merge branch 'develop' into feature/mpi3
...
Conflicts:
lib/simd/Grid_avx512.h
2016-10-25 01:08:51 +01:00
azusayamaguchi
8f8058f8a5
More random bits on parallel seeding
2016-10-25 01:05:52 +01:00
azusayamaguchi
d97a27f483
Verbose
2016-10-25 01:05:31 +01:00
azusayamaguchi
7c3363b91e
Compiles all comms targets
2016-10-25 00:04:17 +01:00
azusayamaguchi
b94478fa51
mpi, mpi3, shmem all compile.
...
mpi, mpi3 pass single node multi-rank
2016-10-24 23:45:31 +01:00
13bf0482e3
FFT optimisation
2016-10-24 19:25:40 +01:00
a795b5705e
memory optimisation
2016-10-24 19:25:15 +01:00
392e064513
fast local peek-poke
2016-10-24 19:24:21 +01:00
azusayamaguchi
b6a65059a2
Update to use shared memory to contain the stencil comms buffers
...
Tested on 2.1.1.1 1.2.1.1 4.1.1.1 1.4.1.1 2.2.1.1 subnode decompositions
2016-10-24 17:30:43 +01:00
azusayamaguchi
ea25a4d9ac
Works
2016-10-23 06:10:05 +01:00
azusayamaguchi
c190221fd3
Internal SHM comms in non-simd directions working
...
Need to fix simd directions
2016-10-22 18:14:27 +01:00
azusayamaguchi
0fcd2e7188
Simplify the comms structure prior to implementing Shared memory direct bouncs
2016-10-21 22:44:10 +01:00
azusayamaguchi
910b8dd6a1
use simd type
2016-10-21 22:35:29 +01:00
azusayamaguchi
75ebd3a0d1
Typo fixes and rotate for CLANG
2016-10-21 22:34:29 +01:00
7c8f79b147
more stochastic QED fixes
2016-10-21 15:20:12 +01:00
azusayamaguchi
09fd5c43a7
Reasonably fast version
2016-10-21 15:17:39 +01:00
462921e549
QED: fix stochastic field
2016-10-21 14:41:08 +01:00
azusayamaguchi
f22317748f
Merge branch 'feature/mpi3' of https://github.com/paboyle/Grid into feature/mpi3
2016-10-21 13:36:35 +01:00
azusayamaguchi
6a9eae6b6b
Reporting improvements
2016-10-21 13:36:18 +01:00
azusayamaguchi
fad96cf250
StencilBufs
2016-10-21 13:36:00 +01:00
azusayamaguchi
f331809c27
Use variable type for loop
2016-10-21 13:35:37 +01:00
bd6a228af6
Merge commit '20a091c3eddfdb67a82ece6413740a93650a2f98' into feature/feynman-rules
2016-10-21 13:10:30 +01:00
63d219498b
first (dirty) implementation of Feynman stoctachtic EM field
2016-10-21 13:10:13 +01:00
paboyle
2c54a53d0a
Compile verbose reduce
2016-10-21 12:12:14 +01:00
paboyle
306160ad9a
bcopy threaded
2016-10-21 12:07:28 +01:00
azusayamaguchi
20a091c3ed
Intel vs. Clang intrinsics differences absorbed
2016-10-21 09:08:36 +01:00
azusayamaguchi
202078eb1b
Cray / OpenSHMEM ordering differs
2016-10-21 09:07:20 +01:00
paboyle
a762b1fb71
MPI3 working with a bounce through shared memory on my laptop.
...
Longer term plan: make the "u_comm_buf" in Stencil point to the shared region and avoid the
send between ranks on same node.
2016-10-21 09:03:26 +01:00
paboyle
5b5925b8e5
Forgot to add
2016-10-20 17:09:40 +01:00
paboyle
b58adc6a4b
commVector
2016-10-20 17:00:15 +01:00
paboyle
f9d5e95d72
allocator template typedefs moved to AlignedAllocator
2016-10-20 16:59:39 +01:00
paboyle
4f8e636a43
commVector
2016-10-20 16:59:16 +01:00
paboyle
9b39f35ae6
commVector different for SHMEM compat
2016-10-20 16:58:53 +01:00
paboyle
5fe2b85cbd
MPI3 and shared memory support
2016-10-20 16:58:01 +01:00
paboyle
c7cccaaa69
Comm vector for shmem
2016-10-20 16:57:31 +01:00
paboyle
cbcfea466f
MPI3
2016-10-20 16:57:14 +01:00
paboyle
4955672fc3
MPI3
2016-10-20 16:57:00 +01:00
paboyle
8c043da5b7
SHMEM and comms allocator made different
2016-10-20 16:56:05 +01:00
paboyle
3cbe974eb4
Layout
2016-10-20 16:55:21 +01:00
997fd882ff
Merge branch 'develop' into feature/feynman-rules
...
# Conflicts:
# lib/Threads.h
# lib/qcd/action/fermion/WilsonFermion.cc
# lib/qcd/action/fermion/WilsonFermion.h
# lib/qcd/utils/SUn.h
# lib/simd/Grid_avx.h
# lib/simd/Intel512common.h
2016-10-19 18:35:18 +01:00
paboyle
7af9b87318
Cache face tables to improve performance.
...
Extract merge now looking poor.
2016-10-18 09:51:37 +01:00
paboyle
811ca45473
GNU clang hack for AVX512 since there are missing reduce intrinsics in Clang 3.9 and GCC-6 AVX512 support
2016-10-17 16:23:21 +01:00
paboyle
bc1a4d40ba
Faster integer handling avoid push_back
2016-10-17 16:16:44 +01:00
paboyle
c8079e6621
Time the face gateher in x-dir more carefully
2016-10-13 22:28:50 +01:00
azusayamaguchi
8b0d171c9a
32bit issue on the KNL code variant where byte offsets were stored
2016-10-12 17:49:32 +01:00
azusayamaguchi
8bbd9ebc27
Reversing changes to Stencil class
2016-10-12 13:47:20 +01:00
azusayamaguchi
6472b431f0
__rdpmc needed for gcc, clang++
2016-10-12 12:29:08 +01:00
azusayamaguchi
bd205a3293
Fixing for non x86 and non KNL
2016-10-12 12:09:15 +01:00
azusayamaguchi
496beffa88
Fix non-KNL build
2016-10-12 12:06:08 +01:00
azusayamaguchi
9b63e97108
align not absolutely required and confuses clang++
2016-10-12 11:51:21 +01:00
azusayamaguchi
81f2aeaece
KNL streaming stores, and KNL performance coutners
2016-10-12 11:45:22 +01:00
paboyle
2d4a45c758
Typecast pointer
2016-10-12 09:14:15 +01:00
paboyle
a123dcd7e9
Static required for shmem. Reading same object twice requires csum reset
2016-10-12 00:29:57 +01:00
paboyle
6b27c42dfe
Cosmetic
2016-10-12 00:29:39 +01:00
paboyle
f7c2aa3ba5
runtime by default
2016-10-12 00:29:13 +01:00
paboyle
7240d73184
Parallelise the x faces; fix the segv on KNL with comms
2016-10-11 22:21:07 +01:00
paboyle
42cd148f5e
Base pointer for comms buffer under AVX512 assembly
2016-10-11 16:06:06 +01:00
paboyle
6e01264bb7
don't use static by default
2016-10-11 10:03:39 +01:00
paboyle
6f408256bc
FMA4 option moved on the align
2016-10-11 10:03:01 +01:00
paboyle
8d11681aac
verbose remove
2016-10-10 23:50:42 +01:00
paboyle
3d5c9a1ee9
No compile fix on clang++ 3.9
2016-10-10 23:50:13 +01:00
paboyle
dc389e467c
axpy_ssp for any coeff type via template
2016-10-10 23:48:05 +01:00
paboyle
3619167d62
Mass parameter
2016-10-10 23:47:33 +01:00
paboyle
96f1d1b828
Debugged Domain wall and Overlap feynman rules (infinite Ls, finite mass).
2016-10-10 23:46:45 +01:00
paboyle
657e0a8f4d
Mass parameter
2016-10-10 23:46:10 +01:00
paboyle
616e7cd83e
Mass parameter
2016-10-10 23:45:48 +01:00
paboyle
6f26d2e8d4
Overlap tree level feynman rule
2016-10-10 23:45:18 +01:00
paboyle
c014574504
A "please implement me" feynman rule. If this were abstract virtual it would
...
require/force implementation
2016-10-10 23:44:00 +01:00
paboyle
d7ce164e6e
Feynman rule for DWF
2016-10-10 23:43:36 +01:00
paboyle
c0d5b99016
Dminus
2016-10-10 23:43:19 +01:00
paboyle
09ca32d678
Dminus added for Cayley
2016-10-10 23:42:55 +01:00
paboyle
082ae350c6
static schedule by default
2016-10-10 23:42:30 +01:00
Guido Cossu
611b5d74ba
Fix for AVX+FMA3 compilation
2016-10-10 15:26:17 +01:00
Guido Cossu
b56c9ffa52
Fix for AVXFMA
2016-10-10 14:43:37 +01:00
cb02b7088f
Merge branch 'develop' into feature/doxygen
...
# Conflicts:
# configure.ac
2016-10-09 13:35:44 +01:00
Guido Cossu
2e453dfbf5
Added some instrumentation to benchmark the force computation
2016-10-06 17:52:45 +01:00
paboyle
4089984431
Timing hooks
2016-10-06 09:25:12 +01:00
Guido Cossu
c78bbd0f8c
Fix ASM compilation
2016-10-04 15:37:32 +01:00
536e2ff073
*.inc removed: please don't commit these files either!
2016-09-27 11:54:03 +01:00
paboyle
87acd06990
Use streaming stores
2016-09-26 10:11:34 +01:00
paboyle
9353b6edfe
Fenv out of grid namespace
2016-09-26 10:09:13 +01:00
paboyle
167cc2650e
GNU SOURCE problem on travis
2016-09-26 09:58:09 +01:00
paboyle
7089b6d5a5
Setting up but not implemented some QED rules
2016-09-26 09:43:40 +01:00
paboyle
2ba7d43ddd
Divide handling
2016-09-26 09:43:14 +01:00
paboyle
836e929565
Divide handling improved
2016-09-26 09:42:22 +01:00
paboyle
b6713ecb60
Momentum space rules for Overlap, DWF untested to date
2016-09-26 09:39:09 +01:00
paboyle
52a39f0fcd
Divide in ET
2016-09-26 09:38:38 +01:00
paboyle
81a7a03076
Integer <<
2016-09-26 09:38:17 +01:00
paboyle
16b37b956c
divide goes to ET
2016-09-26 09:37:59 +01:00
paboyle
567b6cf23f
demangle moves to logging
2016-09-26 09:36:51 +01:00
paboyle
296396646d
FPE's on macos set up
2016-09-26 09:36:14 +01:00
Guido Cossu
5c190a1b8c
Merge branch 'develop' into feature/hirep
2016-09-23 11:06:06 +01:00
Guido Cossu
c4ac6e7e8f
Consolidating HMC interface
...
Uniformed interface for standard action in fundamental rep and Hirep
2016-09-23 10:47:42 +01:00
Guido Cossu
510e340e16
Debugged last commit for the Two index representation
2016-09-22 22:16:21 +01:00
Guido Cossu
6ffadca153
Restored number of colours to 3
2016-09-22 14:22:54 +01:00
Guido Cossu
b6597b74e7
Added support for the Two index Symmetric and Antisymmetric representations
...
Tested for HMC convergence: OK
Added also a test file showing an example for mixed representations
2016-09-22 14:17:37 +01:00
a034e9901b
Merge branch 'develop' into feature/hadrons
2016-09-20 13:49:33 +01:00
Antonin Portelli
0724f7af75
QPX single precision implementation
2016-09-19 18:09:12 +01:00
2e74520821
removed libtool use (BG/Q compatibility)
2016-09-16 15:25:49 +01:00
Antonin Portelli
6dd75ad9e5
Merge branch 'develop' of github.com:paboyle/Grid into feature/bgq
2016-09-16 15:07:54 +01:00
Guido Cossu
fda408ee6f
Added first lines for supporting Two Index representations
2016-09-13 10:43:30 +01:00
Guido Cossu
b9c80318a2
Merge branch 'develop' into feature/hirep
2016-09-13 10:01:51 +01:00
Guido Cossu
5df5d52d41
Fix for the Intel compiler
2016-09-12 17:17:20 +01:00
Guido Cossu
f76f281e58
Cleaning files after fix
2016-09-09 11:34:25 +01:00
Guido Cossu
aa20cc8b52
Fixing compilation error with AVX512 flag
2016-09-09 02:58:52 -07:00
Guido Cossu
0fd179fb33
Merge branch 'develop' into feature/hirep
2016-09-01 12:59:53 +01:00
Guido Cossu
f45ef8d114
Minor modification in ActionBase.h
2016-09-01 11:46:46 +01:00
paboyle
8535d433a7
Cold or hot must support any precisoin
2016-08-31 00:27:53 +01:00
paboyle
b573d1f35a
Wilson tree level added
2016-08-31 00:27:04 +01:00
paboyle
0c1d7e4daf
Mom space prop for Wilson action
2016-08-31 00:26:36 +01:00
paboyle
02e983a0cd
Momentum space prop and free prop convolution
2016-08-31 00:26:02 +01:00
paboyle
d15ab66aae
FFT moves higher in include order
2016-08-31 00:25:22 +01:00
paboyle
9005b82c6d
Multi dim FFT, and normalisation fix
2016-08-31 00:24:52 +01:00
paboyle
3475f45ce7
Demangle support for typeid stuff
2016-08-31 00:23:48 +01:00
paboyle
0744f38866
Demangle support is useful
2016-08-31 00:23:28 +01:00
Guido Cossu
fd5614738d
Merge branch 'develop' into feature/hirep
2016-08-30 18:21:36 +01:00
Guido Cossu
b0d3e4bb2c
Separating travis builds
2016-08-30 13:44:07 +01:00
Guido Cossu
b512ccbee6
HMC for Adjoint fermions works
...
Accepts and reproduces known results
Check initial instability of inverters
when starting from hot configurations
2016-08-30 11:31:25 +01:00
paboyle
8c89391c02
FFTW unresolved fixed when no fftw3.h
2016-08-24 16:41:47 +01:00
paboyle
bfac5195b8
tidy up
2016-08-24 16:38:36 +01:00
paboyle
744691097f
Printing
2016-08-24 15:05:56 +01:00
paboyle
ff6da364e8
FFT double and single precision gives good performance now in multithreaded code.
2016-08-24 15:05:00 +01:00
4d11a6f5f2
first commit for QPX intrinsics
2016-08-23 14:41:44 +01:00
paboyle
88be3b39bb
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-08-22 18:29:36 +01:00
paboyle
356e7940fd
fftw can be switched off
2016-08-22 16:24:49 +01:00
paboyle
73ce476890
Include fftw headers
2016-08-22 16:24:21 +01:00
paboyle
e423a09974
FFT improved and test_FFT passing under MPI 8 processes, 8^4 for LatticeComplexD and LatticeSpinMatrixD
2016-08-18 02:23:21 +01:00
paboyle
17097a93ec
FFTW test ran over 4 mpi processes.
2016-08-17 01:33:55 +01:00
paboyle
4ab7dbfd57
Instantiate
2016-08-15 23:00:40 +01:00
paboyle
90e70790f3
Feature for z-Mobius prep
2016-08-15 22:31:29 +01:00
Guido Cossu
9c2e8d5e28
Nc=3 just to let all the test pass in Travis
2016-08-09 15:46:57 +01:00
Guido Cossu
147e2025b9
Added unit tests on the representation transformations
...
Status: Passing all tests
2016-08-08 16:54:22 +01:00
b1cfb4d661
first try at a nicer Doxygen implementation
2016-08-05 15:29:18 +01:00
paboyle
32bc7a6ab8
MPI back out of change that hangs
...
AVX2 for clang, gcc needs the -mfma flag.
2016-08-05 10:36:00 +01:00
7ff7c7d90d
Merge branch 'develop' into feature/hadrons
2016-08-04 16:22:10 +01:00
93d29bb699
build system improvements after discussion with Peter
2016-08-04 16:19:59 +01:00
2485ef9c9c
Merge branch 'feature/new-build' into feature/hadrons
...
# Conflicts:
# Makefile.am
# scripts/copyright
2016-08-03 16:49:16 +01:00
9e5b934d21
improved LAPACK configuration
2016-08-02 17:26:54 +01:00
Guido Cossu
49b5c49851
Checked the hermiticity of the op in derivative, ok
...
Still CG fails to converge
2016-07-31 12:37:33 +01:00
e9f30cab2c
first working version for the new build system
2016-07-30 17:53:18 +01:00
Guido Cossu
089f0ab582
Debugged HMC for Creutz relation
2016-07-28 16:44:41 +01:00
Guido Cossu
b93e18ed50
Modified the Dirac Kernel class to compile with different number of colours
...
Added the general push_back functionality to accomodate for all defined representations
Compiles, not tested
2016-07-18 16:36:28 +01:00
Guido Cossu
9c77bb69a5
Added all elements for Hirep HMC
...
TODO: Test and debug
2016-07-18 12:05:23 +01:00
paboyle
f9e90eeb1f
Sign error on the force for 4d fields fixed
2016-07-16 01:52:44 +01:00
paboyle
fad5c675eb
sign error on the 4d gparity force
2016-07-16 01:51:56 +01:00
paboyle
4908b77d46
Fixed conflicts. PLEASE avoid making wholesale cosmetic only changes, this created
...
a HUGE amount of difficult to resolve and understand conflicts .
Wholesale formatting, reordering functions etc... in a central file like Tensor_class
or Grid_vector_types while others are also editing without making substantial functionality
changes creates pain.
2016-07-15 20:59:07 +01:00
paboyle
f4dd5062d7
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-07-15 19:26:06 +01:00
paboyle
980ff18956
Solving the instantiation no compile issue
2016-07-15 17:19:44 +01:00
Guido Cossu
7edf4c6c04
Added HMC utitities for the higher representations
...
TODO: Inherit types for the pseudofermions, Debugging, testing
2016-07-15 13:39:47 +01:00
paboyle
1a6c7204ac
Disable instantiation; Use cache version instead
2016-07-15 00:34:39 +01:00
paboyle
49310fbab3
Done with red black change over
2016-07-15 00:08:43 +01:00
paboyle
5c0c8efb9e
Updated file list
2016-07-15 00:02:11 +01:00