paboyle
|
73cdf0fffe
|
Drop f16c from SSE because of a macos compile error on travis
|
2017-04-13 11:23:41 +01:00 |
|
paboyle
|
1c25773319
|
Trap illegal instructions
|
2017-04-13 10:51:40 +01:00 |
|
paboyle
|
94eb829d08
|
Align cast fixed for __mm128i gcc complained
|
2017-04-13 08:40:44 +01:00 |
|
paboyle
|
68392ddb5b
|
Exchange in generic
Precision change in AVX, SSE, AVX512, Generic. QPX still to do.
|
2017-04-13 08:38:12 +01:00 |
|
paboyle
|
cb6b81ae82
|
Half precision conversion
|
2017-04-12 19:32:37 +01:00 |
|
|
53e76b41d2
|
Merge branch 'develop' into feature/hadrons
|
2017-04-10 17:00:53 +01:00 |
|
|
8ef4300412
|
spurious .dirstamp files removed
|
2017-04-10 17:00:22 +01:00 |
|
|
98a24ebf31
|
The macro “magics” is very intensive for the preprocessor in the measurement code which has numerous serialisable classes. Reducing the number of serialisable fields to 64 (instead of 1024) helps a lot, this is enough for now and can be extended trivially if needed in the future.
|
2017-04-10 16:58:54 +01:00 |
|
paboyle
|
b12dc89d26
|
Commenting and clean up
|
2017-04-10 20:38:20 +09:00 |
|
paboyle
|
d80d802f9d
|
MultiRHS solver test
|
2017-04-10 00:12:12 +09:00 |
|
paboyle
|
3d99b09dba
|
Start of blockCG
|
2017-04-09 23:42:10 +09:00 |
|
paboyle
|
db5f6d3ae3
|
Verbose fix
|
2017-04-09 23:41:30 +09:00 |
|
paboyle
|
683550f116
|
Const args improvement
|
2017-04-09 23:41:04 +09:00 |
|
Chulwoo Jung
|
f80a847aef
|
Merge branch 'develop' into bugfix/dminus
|
2017-04-06 23:49:10 -04:00 |
|
Chulwoo Jung
|
93cb5d4e97
|
Working version of Lanczos without the extra copy.
|
2017-04-06 23:35:30 -04:00 |
|
Chulwoo Jung
|
9e48b7dfda
|
MEM_SAVE in Lanczos seems to be working, but not pretty
|
2017-04-06 22:21:56 -04:00 |
|
paboyle
|
86aaa35294
|
Christoph needs SchurDiagTwoKappa which is mobius specific.
|
2017-04-07 11:07:40 +09:00 |
|
Guido Cossu
|
8c540333d5
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-04-05 14:41:04 +01:00 |
|
Chulwoo Jung
|
d0c2c9c71f
|
Merge branch 'develop' of https://github.com/paboyle/Grid into bugfix/dminus
|
2017-04-04 15:20:17 -04:00 |
|
Chulwoo Jung
|
c8cafa77ca
|
Checking in the latest Lacnzos
|
2017-04-04 15:18:12 -04:00 |
|
paboyle
|
5592f7b8c1
|
Creation mode better implementation
|
2017-04-05 02:35:34 +09:00 |
|
paboyle
|
35da4ece0b
|
UID fix
|
2017-04-05 02:18:15 +09:00 |
|
|
ff4e54ef80
|
Merge branch 'develop' into feature/hadrons
|
2017-04-03 18:56:21 +01:00 |
|
paboyle
|
83f6fab8fa
|
Big/Small crush test, and fast SITMO rng init, faster but not ideal
MT and Ranlux init.
|
2017-04-02 12:10:51 +09:00 |
|
paboyle
|
9dc7ca4c3b
|
Sitmo fast init
|
2017-04-02 00:28:22 +09:00 |
|
paboyle
|
935d82f5b1
|
sanity checks
|
2017-04-02 00:27:28 +09:00 |
|
paboyle
|
9cbcdd65d7
|
No random device seed
|
2017-04-02 00:26:57 +09:00 |
|
paboyle
|
7e5faa0f34
|
Multiple RNGs
|
2017-04-02 00:25:44 +09:00 |
|
paboyle
|
1c4bc7ed38
|
Debugged staggered conventions
|
2017-03-31 14:41:48 +09:00 |
|
Chulwoo Jung
|
a3bcad3804
|
Added preconditioned SYM2 solver (SchurRedBlackDiagTwoSolve)
|
2017-03-30 20:33:27 -04:00 |
|
Chulwoo Jung
|
5a5b66292b
|
Merge branch 'develop' of https://github.com/paboyle/Grid into bugfix/dminus
|
2017-03-30 10:44:02 -04:00 |
|
paboyle
|
93ea5d9468
|
Pretty code
|
2017-03-30 15:00:03 +09:00 |
|
paboyle
|
9fd23faadf
|
Pretty layout
|
2017-03-30 13:44:45 +09:00 |
|
paboyle
|
10e4fa0dc8
|
Template instantiation improvements
|
2017-03-30 13:44:25 +09:00 |
|
paboyle
|
c4aca1dde4
|
Conjugate coefficients on adjoint
|
2017-03-30 13:44:05 +09:00 |
|
paboyle
|
b9e8ea3aaa
|
conjugate coefficient on the dagger
|
2017-03-30 13:43:13 +09:00 |
|
paboyle
|
077aa728b9
|
Fix the ZMobius (I think)
|
2017-03-30 13:42:09 +09:00 |
|
paboyle
|
a8d83d886e
|
Macro controls
|
2017-03-30 13:31:34 +09:00 |
|
paboyle
|
7fd46eeec4
|
Trailing whitespace removal
|
2017-03-30 13:31:10 +09:00 |
|
paboyle
|
2b115929dc
|
Small AVX512 asm ifdef patch
|
2017-03-29 18:51:23 +09:00 |
|
paboyle
|
417ec56cca
|
Release candidate
|
2017-03-29 05:45:33 -04:00 |
|
paboyle
|
756bc25008
|
Verbose header print by default
|
2017-03-29 04:44:17 -04:00 |
|
paboyle
|
35695ba57a
|
Bug fix in MPI3
|
2017-03-29 04:43:55 -04:00 |
|
paboyle
|
d805867e02
|
Better init
|
2017-03-28 13:25:05 -04:00 |
|
paboyle
|
98f9318279
|
Build on AVX2 and MPI passing with clang++
|
2017-03-28 23:16:04 +09:00 |
|
paboyle
|
4b17e8eba8
|
Merge branch 'develop' into feature/bgq-asm
Conflicts:
lib/qcd/action/fermion/Fermion.h
lib/qcd/action/fermion/WilsonFermion.cc
lib/util/Init.cc
tests/Test_cayley_even_odd_vec.cc
|
2017-03-28 04:49:30 -04:00 |
|
Chulwoo Jung
|
e63be32ad2
|
zmobius Meooe5D fixed?
|
2017-03-28 03:48:50 -04:00 |
|
paboyle
|
75112a632a
|
IO improvements to fail on IO error
|
2017-03-28 02:28:04 -04:00 |
|
paboyle
|
18bde08d1b
|
Merge branch 'feature/staggering' into develop
|
2017-03-28 15:25:55 +09:00 |
|
Chulwoo Jung
|
33d59c8869
|
Adding Zmobius prec test
|
2017-03-27 21:40:27 -04:00 |
|
Chulwoo Jung
|
a833fd8dbf
|
Merge branch 'develop' of https://github.com/paboyle/Grid into bugfix/dminus
|
2017-03-27 21:37:26 -04:00 |
|
Guido Cossu
|
4c1ea8677e
|
Small cosmetic changes and vscode gitignore
|
2017-03-23 14:09:35 +09:00 |
|
paboyle
|
fc93f0b2ec
|
Save some code for static huge tlb's. It is ifdef'ed out but an interesting root only experiment.
No gain from it.
|
2017-03-21 22:30:29 -04:00 |
|
paboyle
|
8c8473998d
|
Average over whole cluster the comm time.
|
2017-03-21 22:29:51 -04:00 |
|
Guido Cossu
|
120fb59978
|
Adding tests for WilsonFlow classes
|
2017-03-21 16:11:35 +09:00 |
|
Guido Cossu
|
fd56b3ff38
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-03-21 13:33:41 +09:00 |
|
Guido Cossu
|
0ec6829edc
|
Fixing compilation errors for the WilsonFlow
|
2017-03-21 13:06:32 +09:00 |
|
Guido Cossu
|
18b7845b7b
|
Adding WilsonFlow smearing
|
2017-03-21 11:52:05 +09:00 |
|
Guido Cossu
|
3d0fe15374
|
Added topological charge measurement
|
2017-03-17 16:14:57 +09:00 |
|
Guido Cossu
|
91886068fe
|
Fixed seg fault for observable modules
|
2017-03-17 13:59:31 +09:00 |
|
Guido Cossu
|
6d1e9e5f92
|
Small cleanup of the observables
|
2017-03-17 11:42:55 +09:00 |
|
Guido Cossu
|
b640230b1e
|
Moving hmc observables in a different directory
|
2017-03-17 11:40:17 +09:00 |
|
paboyle
|
e7c36771ed
|
ZMobius prep for asm
|
2017-03-15 14:23:33 -04:00 |
|
paboyle
|
8dc57a1e25
|
Layout change
|
2017-03-13 11:11:46 +00:00 |
|
paboyle
|
f57bd770b0
|
Merge branch 'bugfix/dminus' into feature/bgq-asm
|
2017-03-13 11:11:03 +00:00 |
|
paboyle
|
4ed10a3d06
|
Merge branch 'develop' into feature/bgq-asm
|
2017-03-13 11:10:10 +00:00 |
|
Chulwoo Jung
|
33edde245d
|
Changing Dminus(Dag) to use full vectors to work correctly
|
2017-03-12 23:02:42 -04:00 |
|
paboyle
|
447c5e6cd7
|
Z mobius hermiticity correction
|
2017-03-13 01:30:43 +00:00 |
|
paboyle
|
8b99d80d8c
|
Merge branch 'bgq-asm-shmemfixes' into feature/bgq-asm
|
2017-03-12 23:30:09 +00:00 |
|
Guido Cossu
|
b3dede4dd3
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-03-10 23:57:37 +09:00 |
|
Guido Cossu
|
4e34132f4d
|
Correcting modules use in test files
|
2017-03-10 23:54:53 +09:00 |
|
Guido Cossu
|
c07cb10247
|
Merge branch 'feature/hmc_generalise' of https://github.com/paboyle/Grid into feature/hmc_generalise
|
2017-03-10 22:37:25 +09:00 |
|
Guido Cossu
|
d7767a2a62
|
Few more tests
|
2017-03-10 22:33:48 +09:00 |
|
Guido Cossu
|
ec035983fd
|
Fixing the implicit integration
|
2017-03-01 11:56:35 +00:00 |
|
paboyle
|
af230a1fb8
|
Average the time across the whole machine for outliers
|
2017-02-28 17:05:22 -05:00 |
|
Christopher Kelly
|
06a132e3f9
|
Fixes to SHMEM comms
|
2017-02-28 13:31:54 -08:00 |
|
Guido Cossu
|
596dcd85b2
|
Auxiliary fields
|
2017-02-27 13:16:38 +00:00 |
|
paboyle
|
96d44d5c55
|
Header fix
|
2017-02-24 19:12:11 -05:00 |
|
Guido Cossu
|
7270c6a150
|
Integrator works now
|
2017-02-24 17:03:42 +00:00 |
|
Lanny91
|
7fe797daf8
|
SIMD vector length sanity checks
|
2017-02-23 16:49:44 +00:00 |
|
Lanny91
|
486a01294a
|
Corrected QPX SIMD width
|
2017-02-23 16:47:56 +00:00 |
|
paboyle
|
586a7c90b7
|
Merge branch 'develop' into feature/bgq-asm
|
2017-02-23 00:26:59 +00:00 |
|
paboyle
|
e099dcdae7
|
Merge branch 'develop' into feature/bgq-asm
|
2017-02-23 00:25:29 +00:00 |
|
paboyle
|
4e7ab3166f
|
Refactoring header layout
|
2017-02-22 18:09:33 +00:00 |
|
paboyle
|
aac80cbb44
|
Bug fix from Chris K
|
2017-02-22 12:19:09 -05:00 |
|
Lanny91
|
c80948411b
|
Added tRotate function and MaddRealPart struct for generic SIMD, bugfix in MultRealPart and minor cosmetic changes.
|
2017-02-22 14:57:10 +00:00 |
|
Lanny91
|
95625a7bd1
|
Use Grid Integer type
|
2017-02-22 13:09:32 +00:00 |
|
Lanny91
|
0796696733
|
Emulated integer vector type for QPX and generic SIMD instruction sets.
|
2017-02-22 12:01:36 +00:00 |
|
azusayamaguchi
|
1c30e9a961
|
Verified
|
2017-02-21 23:01:25 +00:00 |
|
Francesco Sanfilippo
|
93cc270016
|
making public same serializable parameters in HMC Module
RNGModuleParameters
GridModuleParameters
|
2017-02-21 23:11:56 +01:00 |
|
Francesco Sanfilippo
|
15e668eef1
|
now it is possible to pass {coords list} to a peek or poke
|
2017-02-21 22:48:38 +01:00 |
|
azusayamaguchi
|
bf7e3f20d4
|
Staggaered fermion optimised version
|
2017-02-21 14:35:42 +00:00 |
|
Guido Cossu
|
902afcfbaf
|
Adding metric and the implicit steps
|
2017-02-21 11:30:57 +00:00 |
|
paboyle
|
3ae92fa2e6
|
Global changes to parallel_for structure.
Move the comms flags to more sensible names
|
2017-02-21 05:24:27 -05:00 |
|
paboyle
|
3906cd2149
|
Stencil fix on BNL KNL system
|
2017-02-20 17:51:31 -05:00 |
|
paboyle
|
661fc4d3d1
|
Debug AVX512 exchange code paths
|
2017-02-20 17:48:36 -05:00 |
|
paboyle
|
41009cc142
|
Move excange into the stencil only; keep Cshift fully general
|
2017-02-20 17:48:04 -05:00 |
|
paboyle
|
37720c4db7
|
Count bytes off node only
|
2017-02-20 17:47:40 -05:00 |
|
Guido Cossu
|
97a6b61551
|
Covariant laplacian and implicit integration
|
2017-02-20 11:17:27 +00:00 |
|
paboyle
|
cd0da81196
|
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
|
2017-02-16 18:52:30 -05:00 |
|
paboyle
|
f246fe3304
|
Improvements to avx for invertible to avoid latent bug
|
2017-02-16 23:52:44 +00:00 |
|
paboyle
|
8a29c16bde
|
Faster gather exchange
|
2017-02-16 23:52:22 +00:00 |
|
paboyle
|
d68907fc3e
|
Debug temp
|
2017-02-16 18:51:35 -05:00 |
|
paboyle
|
5c0adf7bf2
|
Make clang happy with parenthesis
|
2017-02-16 23:51:33 +00:00 |
|
paboyle
|
be3a8249c6
|
Faster gather
|
2017-02-16 23:51:15 +00:00 |
|
paboyle
|
bd600702cf
|
Vectorise the XYZT face gathering better.
Hard coded for simd_layout <= 2 in any given spread out direction; full generality is inconsistent
with efficiency.
|
2017-02-15 11:11:04 +00:00 |
|
Guido Cossu
|
bafb101e4f
|
Testing different versions of the Laplacian
|
2017-02-13 15:38:11 +00:00 |
|
Guido Cossu
|
08fdf05528
|
Added and tested the covariant laplacian + CG solver
|
2017-02-13 15:05:01 +00:00 |
|
paboyle
|
aca7a3ef0a
|
Optimisation control improvements
|
2017-02-10 18:22:31 -05:00 |
|
Guido Cossu
|
c3d7ec65fa
|
All tests compile.
|
2017-02-10 10:27:51 +00:00 |
|
Guido Cossu
|
e0571c872b
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-02-09 16:12:00 +00:00 |
|
Guido Cossu
|
84687ccf1f
|
Handling an Intel compiler warning for Json class
|
2017-02-09 15:33:33 +00:00 |
|
Guido Cossu
|
3274561cf8
|
Cleanup
|
2017-02-09 15:18:38 +00:00 |
|
paboyle
|
2c246551d0
|
Overlap comms and compute options in wilson kernels
|
2017-02-07 01:37:10 -05:00 |
|
paboyle
|
71ac2e7940
|
Faster RNG init
|
2017-02-07 01:33:23 -05:00 |
|
paboyle
|
a48ee6f0f2
|
Don't use MPI3_leader any more. No real gain and complex
|
2017-02-07 01:31:24 -05:00 |
|
paboyle
|
73547cca66
|
MPI3 working i think
|
2017-02-07 01:30:02 -05:00 |
|
paboyle
|
123c673db7
|
Policy to control async or sync SendRecv
|
2017-02-07 01:24:54 -05:00 |
|
paboyle
|
61f82216e2
|
Communicator Policy, NodeCount distinct from Rank count
|
2017-02-07 01:22:53 -05:00 |
|
paboyle
|
8e7ca92278
|
Debugged cshift case
|
2017-02-07 01:21:32 -05:00 |
|
paboyle
|
485ad6fde0
|
Stencil working in SHM MPI3
|
2017-02-07 01:20:39 -05:00 |
|
paboyle
|
6ea2184e18
|
OMP define change
|
2017-02-07 01:17:16 -05:00 |
|
paboyle
|
fdc170b8a3
|
Parallel fors in lattice transfer
|
2017-02-07 01:16:39 -05:00 |
|
paboyle
|
85c7bc4321
|
Bug fixes for cases that physics code couldn't hit but latent
and discovered on KNL (long vector, y SIMD dir) and checker dir set to y.
Remove the assertions on these code paths now they are tested.
|
2017-02-07 01:01:15 -05:00 |
|
paboyle
|
0883d6a7ce
|
Overlap comms compute support; make reg naming consistent with bgq aasm
|
2017-02-07 00:59:32 -05:00 |
|
paboyle
|
b5e9c900a4
|
Better printing and signal handling options
|
2017-02-07 00:57:55 -05:00 |
|
paboyle
|
4bbdfb434c
|
Overlap comms compute modifications
|
2017-02-07 00:57:01 -05:00 |
|
Lanny91
|
b7cd1a19e3
|
Utilities for reading and writing "pair" objects.
|
2017-02-06 14:08:59 +00:00 |
|
Christopher Kelly
|
c94133af49
|
Added iteration reporting to CG and mixed CG
Added ability to manually change the initial CG inner tolerance in mixed CG
Added .hpp files to filelist script
|
2017-02-02 17:04:42 -05:00 |
|
|
e7d8030a64
|
operator>> for serialisable enums
|
2017-02-01 15:51:08 -08:00 |
|
|
d775fbb2f9
|
Gammas: code cleaning and gamma_L implementation & test
|
2017-02-01 15:45:05 -08:00 |
|
|
863855f46f
|
header fix
|
2017-02-01 11:59:44 -08:00 |
|
|
419af7610d
|
New gamma matrices tidying: generated code is confined to Gamma.* for readability
|
2017-02-01 11:23:12 -08:00 |
|
|
1140573027
|
Gamma adj fix: now in Grid namespace to avoid collisions
|
2017-01-30 10:53:04 -08:00 |
|
|
a0cfbb6e88
|
Merge branch 'feature/gammas' into feature/hadrons
# Conflicts:
# .gitignore
# lib/qcd/spin/Dirac.cc
# scripts/filelist
|
2017-01-30 09:10:49 -08:00 |
|
|
515a26b3c6
|
gammas: copyright update
|
2017-01-30 09:07:09 -08:00 |
|
Guido Cossu
|
16be6d378c
|
Now action factory support different Fields (templated)
|
2017-01-30 14:22:41 +00:00 |
|
Guido Cossu
|
f05d0565aa
|
Adding ScalarField theory
|
2017-01-30 10:59:28 +00:00 |
|
Guido Cossu
|
899e685627
|
Merge branch 'feature/sitmo_rng' into develop
|
2017-01-27 14:15:56 +00:00 |
|
Guido Cossu
|
6929a84c70
|
Reformatting files
|
2017-01-27 11:54:44 +00:00 |
|
Guido Cossu
|
5c779a789b
|
Moving registrations in an independent file
|
2017-01-27 11:23:51 +00:00 |
|
|
fad743fbb1
|
Build system sanity check: corrected several headers not in the <Grid/*> format
|
2017-01-26 17:00:41 -08:00 |
|
Guido Cossu
|
e863a948e3
|
Cleaning up files and directories
|
2017-01-26 15:24:49 +00:00 |
|
Guido Cossu
|
7996f06335
|
Commented out registrations.
Move to an independent file that is linked only for the factory managed HMC
|
2017-01-25 18:27:45 +00:00 |
|
Guido Cossu
|
ef8d3831eb
|
Temporary patch the threading error in InsertSlice and ExtractSlice
Find source and fix the error
|
2017-01-25 18:12:04 +00:00 |
|
Guido Cossu
|
70ed9fc40c
|
Updating the engine to the last version
|
2017-01-25 18:10:41 +00:00 |
|
Guido Cossu
|
7b40a3e3e5
|
Reorganizing files
|
2017-01-25 18:09:46 +00:00 |
|
Guido Cossu
|
677757cfeb
|
Added and tested SITMO PRNG
|
2017-01-25 12:47:22 +00:00 |
|
Guido Cossu
|
f7fbbaaca3
|
Compiles after merging
|
2017-01-25 12:11:58 +00:00 |
|
Guido Cossu
|
17629b8d9e
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-01-25 11:33:53 +00:00 |
|