paboyle
|
7e5faa0f34
|
Multiple RNGs
|
2017-04-02 00:25:44 +09:00 |
|
paboyle
|
1c4bc7ed38
|
Debugged staggered conventions
|
2017-03-31 14:41:48 +09:00 |
|
paboyle
|
93ea5d9468
|
Pretty code
|
2017-03-30 15:00:03 +09:00 |
|
paboyle
|
9fd23faadf
|
Pretty layout
|
2017-03-30 13:44:45 +09:00 |
|
paboyle
|
10e4fa0dc8
|
Template instantiation improvements
|
2017-03-30 13:44:25 +09:00 |
|
paboyle
|
c4aca1dde4
|
Conjugate coefficients on adjoint
|
2017-03-30 13:44:05 +09:00 |
|
paboyle
|
b9e8ea3aaa
|
conjugate coefficient on the dagger
|
2017-03-30 13:43:13 +09:00 |
|
paboyle
|
077aa728b9
|
Fix the ZMobius (I think)
|
2017-03-30 13:42:09 +09:00 |
|
paboyle
|
a8d83d886e
|
Macro controls
|
2017-03-30 13:31:34 +09:00 |
|
paboyle
|
7fd46eeec4
|
Trailing whitespace removal
|
2017-03-30 13:31:10 +09:00 |
|
paboyle
|
2b115929dc
|
Small AVX512 asm ifdef patch
|
2017-03-29 18:51:23 +09:00 |
|
paboyle
|
417ec56cca
|
Release candidate
|
2017-03-29 05:45:33 -04:00 |
|
paboyle
|
756bc25008
|
Verbose header print by default
|
2017-03-29 04:44:17 -04:00 |
|
paboyle
|
35695ba57a
|
Bug fix in MPI3
|
2017-03-29 04:43:55 -04:00 |
|
paboyle
|
d805867e02
|
Better init
|
2017-03-28 13:25:05 -04:00 |
|
paboyle
|
98f9318279
|
Build on AVX2 and MPI passing with clang++
|
2017-03-28 23:16:04 +09:00 |
|
paboyle
|
4b17e8eba8
|
Merge branch 'develop' into feature/bgq-asm
Conflicts:
lib/qcd/action/fermion/Fermion.h
lib/qcd/action/fermion/WilsonFermion.cc
lib/util/Init.cc
tests/Test_cayley_even_odd_vec.cc
|
2017-03-28 04:49:30 -04:00 |
|
paboyle
|
75112a632a
|
IO improvements to fail on IO error
|
2017-03-28 02:28:04 -04:00 |
|
paboyle
|
18bde08d1b
|
Merge branch 'feature/staggering' into develop
|
2017-03-28 15:25:55 +09:00 |
|
Guido Cossu
|
4c1ea8677e
|
Small cosmetic changes and vscode gitignore
|
2017-03-23 14:09:35 +09:00 |
|
paboyle
|
fc93f0b2ec
|
Save some code for static huge tlb's. It is ifdef'ed out but an interesting root only experiment.
No gain from it.
|
2017-03-21 22:30:29 -04:00 |
|
paboyle
|
8c8473998d
|
Average over whole cluster the comm time.
|
2017-03-21 22:29:51 -04:00 |
|
Guido Cossu
|
120fb59978
|
Adding tests for WilsonFlow classes
|
2017-03-21 16:11:35 +09:00 |
|
Guido Cossu
|
fd56b3ff38
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-03-21 13:33:41 +09:00 |
|
Guido Cossu
|
0ec6829edc
|
Fixing compilation errors for the WilsonFlow
|
2017-03-21 13:06:32 +09:00 |
|
Guido Cossu
|
18b7845b7b
|
Adding WilsonFlow smearing
|
2017-03-21 11:52:05 +09:00 |
|
Guido Cossu
|
3d0fe15374
|
Added topological charge measurement
|
2017-03-17 16:14:57 +09:00 |
|
Guido Cossu
|
91886068fe
|
Fixed seg fault for observable modules
|
2017-03-17 13:59:31 +09:00 |
|
Guido Cossu
|
6d1e9e5f92
|
Small cleanup of the observables
|
2017-03-17 11:42:55 +09:00 |
|
Guido Cossu
|
b640230b1e
|
Moving hmc observables in a different directory
|
2017-03-17 11:40:17 +09:00 |
|
paboyle
|
e7c36771ed
|
ZMobius prep for asm
|
2017-03-15 14:23:33 -04:00 |
|
Guido Cossu
|
038b6ee9cd
|
Fixing JSON compilation error
|
2017-03-16 01:09:24 +09:00 |
|
Guido Cossu
|
38806343a8
|
Improving efficiency of the force term
|
2017-03-15 15:16:16 +09:00 |
|
Guido Cossu
|
831ca4e3bf
|
Added Scalar action for fields in the adjoint representation
|
2017-03-14 14:55:18 +09:00 |
|
paboyle
|
8dc57a1e25
|
Layout change
|
2017-03-13 11:11:46 +00:00 |
|
paboyle
|
f57bd770b0
|
Merge branch 'bugfix/dminus' into feature/bgq-asm
|
2017-03-13 11:11:03 +00:00 |
|
paboyle
|
4ed10a3d06
|
Merge branch 'develop' into feature/bgq-asm
|
2017-03-13 11:10:10 +00:00 |
|
Chulwoo Jung
|
33edde245d
|
Changing Dminus(Dag) to use full vectors to work correctly
|
2017-03-12 23:02:42 -04:00 |
|
paboyle
|
447c5e6cd7
|
Z mobius hermiticity correction
|
2017-03-13 01:30:43 +00:00 |
|
paboyle
|
8b99d80d8c
|
Merge branch 'bgq-asm-shmemfixes' into feature/bgq-asm
|
2017-03-12 23:30:09 +00:00 |
|
Guido Cossu
|
b3dede4dd3
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-03-10 23:57:37 +09:00 |
|
Guido Cossu
|
4e34132f4d
|
Correcting modules use in test files
|
2017-03-10 23:54:53 +09:00 |
|
Guido Cossu
|
c07cb10247
|
Merge branch 'feature/hmc_generalise' of https://github.com/paboyle/Grid into feature/hmc_generalise
|
2017-03-10 22:37:25 +09:00 |
|
Guido Cossu
|
d7767a2a62
|
Few more tests
|
2017-03-10 22:33:48 +09:00 |
|
Guido Cossu
|
ec035983fd
|
Fixing the implicit integration
|
2017-03-01 11:56:35 +00:00 |
|
paboyle
|
af230a1fb8
|
Average the time across the whole machine for outliers
|
2017-02-28 17:05:22 -05:00 |
|
Christopher Kelly
|
06a132e3f9
|
Fixes to SHMEM comms
|
2017-02-28 13:31:54 -08:00 |
|
Guido Cossu
|
596dcd85b2
|
Auxiliary fields
|
2017-02-27 13:16:38 +00:00 |
|
paboyle
|
96d44d5c55
|
Header fix
|
2017-02-24 19:12:11 -05:00 |
|
Guido Cossu
|
7270c6a150
|
Integrator works now
|
2017-02-24 17:03:42 +00:00 |
|
Lanny91
|
7fe797daf8
|
SIMD vector length sanity checks
|
2017-02-23 16:49:44 +00:00 |
|
Lanny91
|
486a01294a
|
Corrected QPX SIMD width
|
2017-02-23 16:47:56 +00:00 |
|
paboyle
|
586a7c90b7
|
Merge branch 'develop' into feature/bgq-asm
|
2017-02-23 00:26:59 +00:00 |
|
paboyle
|
e099dcdae7
|
Merge branch 'develop' into feature/bgq-asm
|
2017-02-23 00:25:29 +00:00 |
|
paboyle
|
4e7ab3166f
|
Refactoring header layout
|
2017-02-22 18:09:33 +00:00 |
|
paboyle
|
aac80cbb44
|
Bug fix from Chris K
|
2017-02-22 12:19:09 -05:00 |
|
Lanny91
|
c80948411b
|
Added tRotate function and MaddRealPart struct for generic SIMD, bugfix in MultRealPart and minor cosmetic changes.
|
2017-02-22 14:57:10 +00:00 |
|
Lanny91
|
95625a7bd1
|
Use Grid Integer type
|
2017-02-22 13:09:32 +00:00 |
|
Lanny91
|
0796696733
|
Emulated integer vector type for QPX and generic SIMD instruction sets.
|
2017-02-22 12:01:36 +00:00 |
|
azusayamaguchi
|
1c30e9a961
|
Verified
|
2017-02-21 23:01:25 +00:00 |
|
Francesco Sanfilippo
|
93cc270016
|
making public same serializable parameters in HMC Module
RNGModuleParameters
GridModuleParameters
|
2017-02-21 23:11:56 +01:00 |
|
Francesco Sanfilippo
|
15e668eef1
|
now it is possible to pass {coords list} to a peek or poke
|
2017-02-21 22:48:38 +01:00 |
|
azusayamaguchi
|
bf7e3f20d4
|
Staggaered fermion optimised version
|
2017-02-21 14:35:42 +00:00 |
|
Guido Cossu
|
902afcfbaf
|
Adding metric and the implicit steps
|
2017-02-21 11:30:57 +00:00 |
|
paboyle
|
3ae92fa2e6
|
Global changes to parallel_for structure.
Move the comms flags to more sensible names
|
2017-02-21 05:24:27 -05:00 |
|
paboyle
|
3906cd2149
|
Stencil fix on BNL KNL system
|
2017-02-20 17:51:31 -05:00 |
|
paboyle
|
661fc4d3d1
|
Debug AVX512 exchange code paths
|
2017-02-20 17:48:36 -05:00 |
|
paboyle
|
41009cc142
|
Move excange into the stencil only; keep Cshift fully general
|
2017-02-20 17:48:04 -05:00 |
|
paboyle
|
37720c4db7
|
Count bytes off node only
|
2017-02-20 17:47:40 -05:00 |
|
Guido Cossu
|
97a6b61551
|
Covariant laplacian and implicit integration
|
2017-02-20 11:17:27 +00:00 |
|
paboyle
|
cd0da81196
|
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
|
2017-02-16 18:52:30 -05:00 |
|
paboyle
|
f246fe3304
|
Improvements to avx for invertible to avoid latent bug
|
2017-02-16 23:52:44 +00:00 |
|
paboyle
|
8a29c16bde
|
Faster gather exchange
|
2017-02-16 23:52:22 +00:00 |
|
paboyle
|
d68907fc3e
|
Debug temp
|
2017-02-16 18:51:35 -05:00 |
|
paboyle
|
5c0adf7bf2
|
Make clang happy with parenthesis
|
2017-02-16 23:51:33 +00:00 |
|
paboyle
|
be3a8249c6
|
Faster gather
|
2017-02-16 23:51:15 +00:00 |
|
paboyle
|
bd600702cf
|
Vectorise the XYZT face gathering better.
Hard coded for simd_layout <= 2 in any given spread out direction; full generality is inconsistent
with efficiency.
|
2017-02-15 11:11:04 +00:00 |
|
Guido Cossu
|
bafb101e4f
|
Testing different versions of the Laplacian
|
2017-02-13 15:38:11 +00:00 |
|
Guido Cossu
|
08fdf05528
|
Added and tested the covariant laplacian + CG solver
|
2017-02-13 15:05:01 +00:00 |
|
paboyle
|
aca7a3ef0a
|
Optimisation control improvements
|
2017-02-10 18:22:31 -05:00 |
|
Guido Cossu
|
c3d7ec65fa
|
All tests compile.
|
2017-02-10 10:27:51 +00:00 |
|
Guido Cossu
|
e0571c872b
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-02-09 16:12:00 +00:00 |
|
Guido Cossu
|
84687ccf1f
|
Handling an Intel compiler warning for Json class
|
2017-02-09 15:33:33 +00:00 |
|
Guido Cossu
|
3274561cf8
|
Cleanup
|
2017-02-09 15:18:38 +00:00 |
|
paboyle
|
2c246551d0
|
Overlap comms and compute options in wilson kernels
|
2017-02-07 01:37:10 -05:00 |
|
paboyle
|
71ac2e7940
|
Faster RNG init
|
2017-02-07 01:33:23 -05:00 |
|
paboyle
|
a48ee6f0f2
|
Don't use MPI3_leader any more. No real gain and complex
|
2017-02-07 01:31:24 -05:00 |
|
paboyle
|
73547cca66
|
MPI3 working i think
|
2017-02-07 01:30:02 -05:00 |
|
paboyle
|
123c673db7
|
Policy to control async or sync SendRecv
|
2017-02-07 01:24:54 -05:00 |
|
paboyle
|
61f82216e2
|
Communicator Policy, NodeCount distinct from Rank count
|
2017-02-07 01:22:53 -05:00 |
|
paboyle
|
8e7ca92278
|
Debugged cshift case
|
2017-02-07 01:21:32 -05:00 |
|
paboyle
|
485ad6fde0
|
Stencil working in SHM MPI3
|
2017-02-07 01:20:39 -05:00 |
|
paboyle
|
6ea2184e18
|
OMP define change
|
2017-02-07 01:17:16 -05:00 |
|
paboyle
|
fdc170b8a3
|
Parallel fors in lattice transfer
|
2017-02-07 01:16:39 -05:00 |
|
paboyle
|
85c7bc4321
|
Bug fixes for cases that physics code couldn't hit but latent
and discovered on KNL (long vector, y SIMD dir) and checker dir set to y.
Remove the assertions on these code paths now they are tested.
|
2017-02-07 01:01:15 -05:00 |
|
paboyle
|
0883d6a7ce
|
Overlap comms compute support; make reg naming consistent with bgq aasm
|
2017-02-07 00:59:32 -05:00 |
|
paboyle
|
b5e9c900a4
|
Better printing and signal handling options
|
2017-02-07 00:57:55 -05:00 |
|
paboyle
|
4bbdfb434c
|
Overlap comms compute modifications
|
2017-02-07 00:57:01 -05:00 |
|
Lanny91
|
b7cd1a19e3
|
Utilities for reading and writing "pair" objects.
|
2017-02-06 14:08:59 +00:00 |
|
Christopher Kelly
|
c94133af49
|
Added iteration reporting to CG and mixed CG
Added ability to manually change the initial CG inner tolerance in mixed CG
Added .hpp files to filelist script
|
2017-02-02 17:04:42 -05:00 |
|
|
eedcaf6470
|
Merge branch 'feature/hadrons' into feature/qed-fvol
|
2017-02-01 15:53:10 -08:00 |
|
|
e7d8030a64
|
operator>> for serialisable enums
|
2017-02-01 15:51:08 -08:00 |
|
|
d775fbb2f9
|
Gammas: code cleaning and gamma_L implementation & test
|
2017-02-01 15:45:05 -08:00 |
|
|
863855f46f
|
header fix
|
2017-02-01 11:59:44 -08:00 |
|
|
419af7610d
|
New gamma matrices tidying: generated code is confined to Gamma.* for readability
|
2017-02-01 11:23:12 -08:00 |
|
|
1140573027
|
Gamma adj fix: now in Grid namespace to avoid collisions
|
2017-01-30 10:53:04 -08:00 |
|
|
a0cfbb6e88
|
Merge branch 'feature/gammas' into feature/hadrons
# Conflicts:
# .gitignore
# lib/qcd/spin/Dirac.cc
# scripts/filelist
|
2017-01-30 09:10:49 -08:00 |
|
|
515a26b3c6
|
gammas: copyright update
|
2017-01-30 09:07:09 -08:00 |
|
Guido Cossu
|
16be6d378c
|
Now action factory support different Fields (templated)
|
2017-01-30 14:22:41 +00:00 |
|
Guido Cossu
|
f05d0565aa
|
Adding ScalarField theory
|
2017-01-30 10:59:28 +00:00 |
|
|
28d99b5297
|
Merge branch 'develop' into feature/qed-fvol
|
2017-01-27 16:59:53 -08:00 |
|
Guido Cossu
|
899e685627
|
Merge branch 'feature/sitmo_rng' into develop
|
2017-01-27 14:15:56 +00:00 |
|
Guido Cossu
|
6929a84c70
|
Reformatting files
|
2017-01-27 11:54:44 +00:00 |
|
Guido Cossu
|
5c779a789b
|
Moving registrations in an independent file
|
2017-01-27 11:23:51 +00:00 |
|
|
fad743fbb1
|
Build system sanity check: corrected several headers not in the <Grid/*> format
|
2017-01-26 17:00:41 -08:00 |
|
Guido Cossu
|
e863a948e3
|
Cleaning up files and directories
|
2017-01-26 15:24:49 +00:00 |
|
Guido Cossu
|
7996f06335
|
Commented out registrations.
Move to an independent file that is linked only for the factory managed HMC
|
2017-01-25 18:27:45 +00:00 |
|
Guido Cossu
|
ef8d3831eb
|
Temporary patch the threading error in InsertSlice and ExtractSlice
Find source and fix the error
|
2017-01-25 18:12:04 +00:00 |
|
Guido Cossu
|
70ed9fc40c
|
Updating the engine to the last version
|
2017-01-25 18:10:41 +00:00 |
|
Guido Cossu
|
7b40a3e3e5
|
Reorganizing files
|
2017-01-25 18:09:46 +00:00 |
|
Guido Cossu
|
677757cfeb
|
Added and tested SITMO PRNG
|
2017-01-25 12:47:22 +00:00 |
|
Guido Cossu
|
f7fbbaaca3
|
Compiles after merging
|
2017-01-25 12:11:58 +00:00 |
|
Guido Cossu
|
17629b8d9e
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-01-25 11:33:53 +00:00 |
|
Guido Cossu
|
0baa20d292
|
Againg fixing compilation on Travis, no LIME lib present
|
2017-01-25 11:18:44 +00:00 |
|
Guido Cossu
|
4571c918a4
|
Fixing compilation error when compiling without LIME
|
2017-01-25 11:14:43 +00:00 |
|
Guido Cossu
|
5251ea4d30
|
Adding more fermion action modules, generalised DWF
|
2017-01-25 11:10:44 +00:00 |
|
|
05cb6d318a
|
gammas: adjoint implemented as a symbolic operation
|
2017-01-24 18:07:43 -08:00 |
|
|
0432e30256
|
Gamma right multiply code fix (now passes consistency check)
|
2017-01-24 17:36:23 -08:00 |
|
|
f7db342f49
|
Serialisable enums can be converted to int
|
2017-01-24 17:33:26 -08:00 |
|
Guido Cossu
|
7f456b4173
|
👷 Added all pseudofermion actions to the serialiser
|
2017-01-24 13:57:32 +00:00 |
|
|
a37e71f362
|
New automatic implementation of gamma matrices, Meson and SeqGamma are broken
|
2017-01-23 19:13:43 -08:00 |
|
Guido Cossu
|
244f8fb6dc
|
Added JSON parser (without NextElement)
|
2017-01-23 14:57:38 +00:00 |
|
|
37988221a8
|
Merge branch 'feature/serialisation-hdf5' into feature/qed-fvol
|
2017-01-20 14:04:20 -08:00 |
|
|
4c75095c61
|
HDF5: header fix
|
2017-01-20 12:14:01 -08:00 |
|
|
afa095d33d
|
HDF5: better complex number support
|
2017-01-20 12:10:41 -08:00 |
|
|
6b5259cc10
|
HDF5 detects if a name is a dataset or not without using exception catching
|
2017-01-20 11:03:19 -08:00 |
|
Guido Cossu
|
27dfe816fa
|
Added TwoFlavorsEO
Had to remove a conformability check in the Derivative of SchurDiff,
see the comments in the file
|
2017-01-20 16:59:31 +00:00 |
|
Guido Cossu
|
f96fac0aee
|
All functionalities ready.
Todo: add all the fermion action modules
|
2017-01-20 12:56:20 +00:00 |
|
|
7423a352c5
|
HDF5: typos
|
2017-01-19 18:33:04 -08:00 |
|
|
81e66d6631
|
HDF5: revert back to native types
|
2017-01-19 18:24:53 -08:00 |
|
|
ade1058e5f
|
Hdf5Type does not need to be a pointer anymore
|
2017-01-19 18:23:55 -08:00 |
|
|
6eea9e4da7
|
HDF5 types static initialisation is mysteriously buggy on BG/Q, changing strategy
|
2017-01-19 18:02:53 -08:00 |
|
|
2c673666da
|
Standardisation of HDF5 types
|
2017-01-19 17:19:12 -08:00 |
|
|
7a327a3f28
|
Merge branch 'develop' into feature/qed-fvol
|
2017-01-19 14:22:36 -08:00 |
|
Guido Cossu
|
851f2ad8ef
|
Adding fermions actions support in the factories
|
2017-01-19 10:00:02 +00:00 |
|
|
5405526424
|
Code typo
|
2017-01-18 22:42:19 -08:00 |
|
|
654e0b0fd0
|
Serialisable object are now comparable with ==
|
2017-01-18 17:40:32 -08:00 |
|
|
4be08ebccc
|
debug code cleaning
|
2017-01-18 17:39:59 -08:00 |
|
|
f599cb5b17
|
HDF5 serial IO implemented and tested
|
2017-01-18 16:50:21 -08:00 |
|
Guido Cossu
|
23e0561dd6
|
Added all required functionalities, time for cleaning
All actions to be added
|
2017-01-18 16:31:51 +00:00 |
|
|
5803933aea
|
First implementation of HDF5 serial IO writer, reader is still empty
|
2017-01-17 16:21:18 -08:00 |
|
Guido Cossu
|
924130833e
|
Moved more parameters to serialization
|
2017-01-17 13:22:18 +00:00 |
|
Guido Cossu
|
0157274762
|
HMC factories
|
2017-01-17 10:46:49 +00:00 |
|
Guido Cossu
|
87e8aad5a0
|
Added support for input file HMC modules (missing the actions yet)
|
2017-01-16 16:07:12 +00:00 |
|
Guido Cossu
|
c6f59c2933
|
Adding factories
|
2017-01-16 10:18:09 +00:00 |
|
|
91a3534054
|
Lattice slice utilities now thread safe
|
2017-01-16 06:32:25 +00:00 |
|
Guido Cossu
|
0dfda4bb90
|
Working on the RNGModule
|
2017-01-09 11:06:18 +00:00 |
|
Guido Cossu
|
1189ebc8b5
|
Cleaning up the checkpointers interface
|
2017-01-05 15:52:52 +00:00 |
|
|
82b3f54697
|
scalar free propagator fix
|
2017-01-05 14:58:07 +00:00 |
|
Guido Cossu
|
1bb8578173
|
Added module for checkpointers
|
2017-01-05 13:09:32 +00:00 |
|
Peter Boyle
|
c3b6d573b9
|
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
|
2016-12-30 22:42:17 +00:00 |
|
|
afbf7d4c37
|
QED Gimpl moved in Photon.h
|
2016-12-29 22:43:38 +01:00 |
|
|
8c3cc32364
|
Scalar action
|
2016-12-29 22:42:58 +01:00 |
|
Peter Boyle
|
1e179c903d
|
Worried about integer; suspect where statements are broken
|
2016-12-27 17:46:38 +00:00 |
|
Peter Boyle
|
669cfca9b7
|
No inline
|
2016-12-27 17:45:40 +00:00 |
|
Peter Boyle
|
ff2f559a57
|
Remove inline on gather optimised path
|
2016-12-27 17:45:19 +00:00 |
|
Peter Boyle
|
03c81bd902
|
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
|
2016-12-27 11:25:35 +00:00 |
|
Peter Boyle
|
a869addef1
|
Stats switch off
|
2016-12-27 11:25:22 +00:00 |
|
Peter Boyle
|
1caa3fbc2d
|
LOCK UNLOCK only
|
2016-12-27 11:24:45 +00:00 |
|
Peter Boyle
|
3d21297bbb
|
Call the fast path compressor for wilson kernels to avoid if else on projector
|
2016-12-27 11:23:13 +00:00 |
|
Peter Boyle
|
25efefc5b4
|
Back to original thread policy post test
|
2016-12-23 09:49:04 +00:00 |
|
Peter Boyle
|
eabf316ed9
|
BGQ performance ASM
|
2016-12-22 21:56:08 +00:00 |
|
Peter Boyle
|
04ae7929a3
|
BGQ or KNL assembler now
|
2016-12-22 17:53:22 +00:00 |
|
Peter Boyle
|
caba0d42a5
|
L1p controls
|
2016-12-22 17:52:55 +00:00 |
|
Peter Boyle
|
9ae81c06d2
|
L1p controls for BG/Q
|
2016-12-22 17:52:21 +00:00 |
|
Peter Boyle
|
7dc36628a1
|
QPX finishing
|
2016-12-22 17:50:48 +00:00 |
|
Peter Boyle
|
b8cdb3e90a
|
Debug hack; raises from 62GF/s to 72 GF/s per node on BG/Q
|
2016-12-22 17:50:14 +00:00 |
|
Peter Boyle
|
5241245534
|
Default to static scheduling
|
2016-12-22 17:49:21 +00:00 |
|
Dr Peter Boyle
|
960316e207
|
type conversion in printf
|
2016-12-22 17:27:01 +00:00 |
|
Guido Cossu
|
5214846341
|
Adding a resource manager
|
2016-12-22 12:41:56 +00:00 |
|
|
17b3a10d46
|
stochastic QED: function to cache 1/sqrt(khat^2)
|
2016-12-22 00:29:19 +01:00 |
|
Guido Cossu
|
ce1a115e0b
|
Removing redundant arguments for integrator functions, step 1
|
2016-12-20 17:51:30 +00:00 |
|
|
9ac3ac41df
|
serialisable Photon parameters
|
2016-12-20 12:41:01 +01:00 |
|
|
6f1ea96293
|
Merge branch 'develop' into feature/qed-fvol
|
2016-12-20 12:33:02 +01:00 |
|
|
f8d11ff673
|
better serialisable enums (can be encapsulated into classes)
|
2016-12-20 12:31:49 +01:00 |
|
paboyle
|
3f2d53a994
|
BGQ assembler beginning
|
2016-12-20 10:21:26 +00:00 |
|
paboyle
|
a59f5374d7
|
Evade warning
|
2016-12-18 02:23:55 +00:00 |
|
paboyle
|
4b220972ac
|
Warning fix
|
2016-12-18 02:14:17 +00:00 |
|
paboyle
|
629f43e36c
|
Return statement needed
|
2016-12-18 02:09:37 +00:00 |
|
paboyle
|
a3172b3455
|
Precision error
|
2016-12-18 02:07:45 +00:00 |
|
paboyle
|
3e6945cd65
|
Fixing AVX Z-mobius
|
2016-12-18 02:05:11 +00:00 |
|
paboyle
|
87be03006a
|
AVX 512 code broke other compiles; fixing
|
2016-12-18 01:45:09 +00:00 |
|
paboyle
|
f17436fec2
|
Bad commit fixed
|
2016-12-18 01:27:34 +00:00 |
|
Peter Boyle
|
4d8b01b7ed
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2016-12-18 00:56:57 +00:00 |
|
Peter Boyle
|
fa6acccf55
|
Zmobius asm
|
2016-12-18 00:56:19 +00:00 |
|
azusayamaguchi
|
df9108154d
|
Debugged 2 versions of assembler; ls vectorised, xyzt vectorised
|
2016-12-17 23:47:51 +00:00 |
|
azusayamaguchi
|
b3e7f600da
|
Partial implementation of 4d vectorisation assembler
|
2016-12-16 23:50:30 +00:00 |
|
azusayamaguchi
|
d4071daf2a
|
Template specialise
|
2016-12-16 22:28:29 +00:00 |
|
azusayamaguchi
|
a2a6329094
|
AVX512 only for ASM compilation
|
2016-12-16 22:03:29 +00:00 |
|
azusayamaguchi
|
eabc577940
|
Assembler possibly working
|
2016-12-16 16:55:36 +00:00 |
|
|
2e3c5890b6
|
qed-fvol: build fix
|
2016-12-15 20:06:46 +00:00 |
|
|
bc6678732f
|
Merge branch 'feature/hadrons' into feature/qed-fvol
# Conflicts:
# Makefile.am
# configure.ac
# lib/qcd/action/gauge/Photon.h
|
2016-12-15 19:53:00 +00:00 |
|
|
91e98b1dd5
|
Merge branch 'feature/hadrons' into develop
|
2016-12-15 18:15:56 +00:00 |
|
|
b791c274b0
|
Revert "AVX: uninitialised variable fix"
This reverts commit c22c3db9ad .
|
2016-12-15 18:15:35 +00:00 |
|
Guido Cossu
|
0bd296dda4
|
Adding check of the Dag part in the benchmark
|
2016-12-14 03:15:09 +00:00 |
|
|
c22c3db9ad
|
AVX: uninitialised variable fix
|
2016-12-13 19:05:58 +00:00 |
|
Guido Cossu
|
2fb92dbc6e
|
Cleaning up previous debug lines
|
2016-12-13 07:53:43 +00:00 |
|
Guido Cossu
|
5c74b6028b
|
Commit for debugging, lot of IO
|
2016-12-13 06:35:30 +00:00 |
|
Guido Cossu
|
ef72f322d2
|
consistency of tests
|
2016-12-13 02:24:20 +00:00 |
|
Azusa Yamaguchi
|
426197e446
|
Nc=3
|
2016-12-12 09:10:54 +00:00 |
|
Azusa Yamaguchi
|
99e2c1e666
|
Kernels options
|
2016-12-12 09:08:53 +00:00 |
|
Azusa Yamaguchi
|
1440565a10
|
Decrease verbosity
|
2016-12-12 09:08:04 +00:00 |
|
Azusa Yamaguchi
|
e9f0c0ea39
|
Staggered kernels options
|
2016-12-12 09:07:38 +00:00 |
|
Peter Boyle
|
fe187e9ed3
|
Compiles and passes under ZMobius with assembler
|
2016-12-10 00:47:48 +00:00 |
|
Peter Boyle
|
0091b50f49
|
Zmobius working -- not asm yet
|
2016-12-09 22:51:32 +00:00 |
|
Peter Boyle
|
fb8d4b2357
|
Lots of debug on performance Mobius
|
2016-12-08 17:28:28 +00:00 |
|
Peter Boyle
|
83fa038bdf
|
Streaming stores
|
2016-12-08 16:58:42 +00:00 |
|
Peter Boyle
|
7a61feb6d3
|
Allocator added with caching for Linux VM subsystem optimisation
|
2016-12-08 16:58:01 +00:00 |
|
Peter Boyle
|
69ae817d1c
|
Updates for supporting Mobius better
|
2016-12-08 16:43:28 +00:00 |
|
Guido Cossu
|
2bd4233919
|
Completed testing of the HMC for Ls vectorised version (on AVX2)
|
2016-12-07 04:56:37 +00:00 |
|
Guido Cossu
|
143c70e29f
|
Debugged the threaded version. Cleaning up
|
2016-12-07 04:40:25 +00:00 |
|
|
51322da6f8
|
Hadrons: genetic scheduler improvement
|
2016-12-07 09:00:45 +09:00 |
|
|
c56707e003
|
useless debug message removed
|
2016-12-07 08:59:20 +09:00 |
|
Guido Cossu
|
b812d5e39c
|
Added single threaded version of the derivative for the Ls vectorised DWF
|
2016-12-06 16:31:13 +00:00 |
|
Guido Cossu
|
01480da0a8
|
Merge branch 'develop' into feature/hmc_generalise
|
2016-12-05 05:10:27 +00:00 |
|
Peter Boyle
|
e27c6b217c
|
Updating
|
2016-12-01 12:42:53 +00:00 |
|
|
9ad3d3453e
|
Hadrons is now a library, the previous XML driven program is now a test
|
2016-12-01 21:36:29 +09:00 |
|
paboyle
|
6adf35da54
|
Faster Mobius
|
2016-12-01 11:39:04 +00:00 |
|
paboyle
|
bd0430b34f
|
Serialisation in malloc fixed
|
2016-11-29 22:27:55 +00:00 |
|
Azusa Yamaguchi
|
c097fd041a
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
|
2016-11-29 13:44:17 +00:00 |
|
Azusa Yamaguchi
|
77fb25fb29
|
Push 5d tests
|
2016-11-29 13:43:56 +00:00 |
|
Azusa Yamaguchi
|
389e0a77bd
|
Staggerd Fermion 5D
|
2016-11-29 13:13:56 +00:00 |
|
paboyle
|
4704f2d009
|
Actions updated
|
2016-11-29 00:14:36 +00:00 |
|
Guido Cossu
|
ae9688e343
|
Reporting also the total mflops
|
2016-11-28 11:37:02 +00:00 |
|
|
43928846f2
|
first steps to make Hadrons a library
|
2016-11-28 16:02:15 +09:00 |
|
|
fabcd4179d
|
Hadrons: propagator type coming from the fermion implementation
|
2016-11-28 14:02:10 +09:00 |
|
|
a8843c9af6
|
Code cleaning, the fermion implementation can be sepcified using the macro FIMPL
|
2016-11-27 16:47:22 +09:00 |
|
|
7a1a7a685e
|
Merge branch 'feature/fft-opt' into feature/hadrons
|
2016-11-27 15:32:03 +09:00 |
|
Lanny91
|
b18950f776
|
Added simd real divide test with QPX divide fixes
|
2016-11-25 13:21:33 +00:00 |
|
Lanny91
|
0acbf77bc6
|
Add QPX Div structure
|
2016-11-24 13:24:12 +00:00 |
|
|
5833f247fa
|
more FFt optimisations
|
2016-11-24 09:09:48 +09:00 |
|
Azusa Yamaguchi
|
95f43d27ae
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
|
2016-11-22 13:49:22 +00:00 |
|
Azusa Yamaguchi
|
668ca57702
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
|
2016-11-22 13:49:11 +00:00 |
|
|
a2cffb0304
|
AVXFMA target fixed
|
2016-11-21 17:47:18 +01:00 |
|
|
97cddda49e
|
Merge branch 'feature/gen-simd' into feature/doxygen
# Conflicts:
# Makefile.am
# configure.ac
|
2016-11-19 13:11:13 +01:00 |
|
|
b873504b90
|
fully generic SIMD
|
2016-11-19 01:32:39 +01:00 |
|
Guido Cossu
|
62749d05a6
|
Naming the scalar action
|
2016-11-17 12:26:20 +00:00 |
|
Guido Cossu
|
3834feb4b7
|
Adding action names
|
2016-11-16 16:46:49 +00:00 |
|
James Harrison
|
6b8ee7bae0
|
Merge branch 'feature/feynman-rules' into feature/qed-fvol
|
2016-11-15 13:08:08 +00:00 |
|
James Harrison
|
739c2308b5
|
Set imaginary part of stochastic QED field to zero using real() instead of conjugate().
|
2016-11-15 13:07:52 +00:00 |
|
|
042ae5b87c
|
generic 256bits SIMD
|
2016-11-15 12:16:15 +00:00 |
|
James Harrison
|
d49e502f53
|
Merge branch 'feature/feynman-rules' into feature/qed-fvol
|
2016-11-14 18:00:33 +00:00 |
|
James Harrison
|
92ec3404f8
|
Set imaginary part of stochastic QED field to zero after FFT into position space
|
2016-11-14 17:59:02 +00:00 |
|
Guido Cossu
|
a783282b8b
|
Merge branch 'develop' into feature/hmc_generalise
|
2016-11-10 18:13:07 +00:00 |
|
paboyle
|
604f0ea2f6
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-09 04:13:01 -08:00 |
|
paboyle
|
33dc1f51b5
|
Final sign off commits from Cori-1
|
2016-11-09 04:11:03 -08:00 |
|
James Harrison
|
c30d96ea50
|
QedFVol: x86intrin.h namespace fix
|
2016-11-09 11:06:20 +00:00 |
|
|
13a8997789
|
Merge branch 'release/v0.6.0' into feature/hadrons
# Conflicts:
# Makefile.am
|
2016-11-08 20:43:39 +00:00 |
|
|
9576f0903d
|
namespace fix
|
2016-11-08 19:07:47 +00:00 |
|
|
8a5e3a917c
|
Merge branch 'develop' into release/v0.6.0
# Conflicts:
# tests/core/Test_fft_gfix.cc
|
2016-11-08 16:53:42 +00:00 |
|
|
3d2a22a14d
|
include fix for MKL
|
2016-11-08 15:31:47 +00:00 |
|
azusayamaguchi
|
f85b35314d
|
Fix a routine for single node processor coor from rank
|
2016-11-08 11:49:13 +00:00 |
|
azusayamaguchi
|
0cff8754d1
|
Usecs
|
2016-11-08 11:35:41 +00:00 |
|
azusayamaguchi
|
692b44dac1
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-04 22:48:11 +00:00 |
|
azusayamaguchi
|
96ba42a297
|
omm buf
|
2016-11-04 22:47:25 +00:00 |
|
azusayamaguchi
|
f7b60004f3
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-04 16:08:07 +00:00 |
|
|
ad971ca07b
|
fftw3.h is now expected to be an external header
|
2016-11-04 13:12:35 +00:00 |
|
|
f2f16eb972
|
fftw3.h removed, please don't commit this file back
|
2016-11-04 13:11:05 +00:00 |
|
azusayamaguchi
|
b7d55f7dfb
|
Fix a typo in reorg of the --dslash-asm
|
2016-11-04 11:35:08 +00:00 |
|
azusayamaguchi
|
6e548a8ad5
|
Linux compile needed
|
2016-11-04 11:34:16 +00:00 |
|
Azusa Yamaguchi
|
ee686a7d85
|
Compiles now
|
2016-11-03 16:58:23 +00:00 |
|
Azusa Yamaguchi
|
1c5b7a6be5
|
Staggered phases first cut, c1, c2, u0
|
2016-11-03 16:26:56 +00:00 |
|
|
a5dd4a9bab
|
Merge branch 'feature/fft-opt' into develop
|
2016-11-03 14:34:46 +00:00 |
|
|
ec232af851
|
Photon.h references removed
|
2016-11-03 14:34:16 +00:00 |
|
|
17e30281e9
|
Merge branch 'develop' into feature/fft-opt
# Conflicts:
# lib/FFT.h
|
2016-11-03 14:14:03 +00:00 |
|
|
aee44dc694
|
Photon.h removed from develop branch
|
2016-11-03 13:54:15 +00:00 |
|
|
75bbf6a0af
|
Merge branch 'develop' into feature/feynman-rules
|
2016-11-03 13:52:11 +00:00 |
|
paboyle
|
111bfbc6bc
|
notimestamp by default
|
2016-11-03 11:40:26 +00:00 |
|
paboyle
|
f41a230b32
|
Decrease mpi3l verbose
|
2016-11-02 19:54:03 +00:00 |
|
paboyle
|
c067051d5f
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-02 13:59:18 +00:00 |
|
paboyle
|
9e2ec2719b
|
Merge branch 'develop' into feature/mpi3-master-slave
|
2016-11-02 13:02:56 +00:00 |
|
paboyle
|
757a928f9a
|
Improvement to use own SHM_OPEN call to avoid openmpi bug.
|
2016-11-02 12:37:46 +00:00 |
|
Guido Cossu
|
bc248b6948
|
Merge branch 'release/v0.6.0' into feature/KNL_double_prec
Conflicts:
lib/simd/Grid_avx512.h
|
2016-11-02 10:40:49 +00:00 |
|
Guido Cossu
|
ae8561892e
|
Eliminating useless defines
|
2016-11-02 10:21:06 +00:00 |
|
paboyle
|
32375aca65
|
Semaphore sleep/wake up on remote processes.
|
2016-11-02 09:27:20 +00:00 |
|
paboyle
|
bb94ddd0eb
|
Tidy up of mpi3; also some cleaning of the dslash controls.
|
2016-11-02 08:07:09 +00:00 |
|
James Harrison
|
7f0fc0eff5
|
Remove explicit use of double-precision types in photon.h
|
2016-11-01 16:02:35 +00:00 |
|
Azusa Yamaguchi
|
164d3691db
|
Staggered
|
2016-11-01 14:24:22 +00:00 |
|
paboyle
|
791cb050c8
|
Comms improvements
|
2016-11-01 11:35:43 +00:00 |
|
|
d5e95bc350
|
Merge branch 'release/v0.6.0' into feature/feynman-rules
|
2016-10-31 18:36:21 +00:00 |
|
|
7a84906b5f
|
Merge branch 'release/v0.6.0' into feature/fft-opt
|
2016-10-31 18:31:49 +00:00 |
|
|
66d832c733
|
FFTW header fix
|
2016-10-31 16:39:29 +00:00 |
|
|
e74417ca12
|
big build system polish
|
2016-10-31 16:31:27 +00:00 |
|
Guido Cossu
|
e8c3174ae2
|
Small change in the defines
|
2016-10-30 12:23:11 +00:00 |
|
Guido Cossu
|
9b066e94d0
|
Compilation with both single and double precision
|
2016-10-30 12:04:06 +00:00 |
|
James Harrison
|
618abdf302
|
Add missing volume factor in stochastic QED field
|
2016-10-29 11:04:02 +01:00 |
|
Guido Cossu
|
e1042aef77
|
First version of the doube prec for testing purposes
It does not compile single and double version at the same time
|
2016-10-28 17:20:04 +01:00 |
|
paboyle
|
aa6a839c60
|
avx512 build fix; detect clang/gcc intrinsics vs. ICPC
|
2016-10-28 09:13:09 +01:00 |
|
|
b4d2af8c89
|
threaded FFT
|
2016-10-26 19:46:36 +01:00 |
|
|
434af6aeaa
|
Merge branch 'develop' into feature/fft-opt
|
2016-10-26 18:50:38 +01:00 |
|
|
e90f8ac841
|
Merge branch 'develop' into feature/feynman-rules
|
2016-10-26 18:50:21 +01:00 |
|
|
a1705a8d53
|
debug message removed
|
2016-10-26 18:50:07 +01:00 |
|
|
ca21003f01
|
Merge branch 'feature/fft-opt' into feature/feynman-rules
# Conflicts:
# lib/FFT.h
# lib/qcd/action/fermion/WilsonFermion5D.h
# tests/core/Test_fft.cc
|
2016-10-26 18:44:47 +01:00 |
|
|
14ddf2c234
|
more FFT optimisations
|
2016-10-26 17:36:26 +01:00 |
|
Guido Cossu
|
1d666771f9
|
Debugging the RNG, eliminate the barrier after broadcast
|
2016-10-26 16:08:23 +01:00 |
|
Guido Cossu
|
d50055cd96
|
Making the ILDG support optional
|
2016-10-26 09:48:01 +01:00 |
|
Azusa Yamaguchi
|
bca861e112
|
Note:FFT shoud be GridFFT (Not change yet).
Gauge fix with FFt is added (tests/core)
|
2016-10-25 14:21:48 +01:00 |
|
|
33d199a0ad
|
temporary thread safety in FFT
|
2016-10-25 12:56:40 +01:00 |
|
paboyle
|
b820076b91
|
Merge branch 'develop' into feature/mpi3
|
2016-10-25 06:02:33 +01:00 |
|
paboyle
|
09f66100d3
|
MPI 3 compile on non-linux
|
2016-10-25 06:01:12 +01:00 |
|
azusayamaguchi
|
d7d92af09d
|
Travis fail fix attempt
|
2016-10-25 01:45:53 +01:00 |
|
azusayamaguchi
|
460d0753a1
|
Merge branch 'develop' into feature/mpi3
Conflicts:
lib/simd/Grid_avx512.h
|
2016-10-25 01:08:51 +01:00 |
|
azusayamaguchi
|
8f8058f8a5
|
More random bits on parallel seeding
|
2016-10-25 01:05:52 +01:00 |
|
azusayamaguchi
|
d97a27f483
|
Verbose
|
2016-10-25 01:05:31 +01:00 |
|
azusayamaguchi
|
7c3363b91e
|
Compiles all comms targets
|
2016-10-25 00:04:17 +01:00 |
|
azusayamaguchi
|
b94478fa51
|
mpi, mpi3, shmem all compile.
mpi, mpi3 pass single node multi-rank
|
2016-10-24 23:45:31 +01:00 |
|
Guido Cossu
|
47c7159177
|
ILDG reader/writer works
Fill the xml header with the required information, todo.
|
2016-10-24 21:57:54 +01:00 |
|
|
13bf0482e3
|
FFT optimisation
|
2016-10-24 19:25:40 +01:00 |
|
|
a795b5705e
|
memory optimisation
|
2016-10-24 19:25:15 +01:00 |
|
|
392e064513
|
fast local peek-poke
|
2016-10-24 19:24:21 +01:00 |
|
azusayamaguchi
|
b6a65059a2
|
Update to use shared memory to contain the stencil comms buffers
Tested on 2.1.1.1 1.2.1.1 4.1.1.1 1.4.1.1 2.2.1.1 subnode decompositions
|
2016-10-24 17:30:43 +01:00 |
|
Guido Cossu
|
f415db583a
|
Adding ILDG format
|
2016-10-24 15:48:22 +01:00 |
|
Guido Cossu
|
f55c16f984
|
Adding a barrier in the RNG save
|
2016-10-24 11:02:14 +01:00 |
|
azusayamaguchi
|
ea25a4d9ac
|
Works
|
2016-10-23 06:10:05 +01:00 |
|
azusayamaguchi
|
c190221fd3
|
Internal SHM comms in non-simd directions working
Need to fix simd directions
|
2016-10-22 18:14:27 +01:00 |
|
Guido Cossu
|
df67e013ca
|
More debug output for the RNG
|
2016-10-22 13:34:17 +01:00 |
|
Guido Cossu
|
3e990c9d0a
|
Reverting the broadcast change
|
2016-10-22 13:26:43 +01:00 |
|
Guido Cossu
|
4b740fc8fd
|
Debugging the RNG state save
|
2016-10-22 13:06:00 +01:00 |
|
azusayamaguchi
|
0fcd2e7188
|
Simplify the comms structure prior to implementing Shared memory direct bouncs
|
2016-10-21 22:44:10 +01:00 |
|
azusayamaguchi
|
910b8dd6a1
|
use simd type
|
2016-10-21 22:35:29 +01:00 |
|
azusayamaguchi
|
75ebd3a0d1
|
Typo fixes and rotate for CLANG
|
2016-10-21 22:34:29 +01:00 |
|
Guido Cossu
|
cccd14b09e
|
Small cleanup
|
2016-10-21 17:20:54 +01:00 |
|
Guido Cossu
|
e6acffdfc2
|
Fixing the plaquette computation
|
2016-10-21 16:06:34 +01:00 |
|
|
7c8f79b147
|
more stochastic QED fixes
|
2016-10-21 15:20:12 +01:00 |
|
azusayamaguchi
|
09fd5c43a7
|
Reasonably fast version
|
2016-10-21 15:17:39 +01:00 |
|
|
462921e549
|
QED: fix stochastic field
|
2016-10-21 14:41:08 +01:00 |
|
Guido Cossu
|
392130a537
|
Working on the 5d
|
2016-10-21 14:22:25 +01:00 |
|
azusayamaguchi
|
f22317748f
|
Merge branch 'feature/mpi3' of https://github.com/paboyle/Grid into feature/mpi3
|
2016-10-21 13:36:35 +01:00 |
|
azusayamaguchi
|
6a9eae6b6b
|
Reporting improvements
|
2016-10-21 13:36:18 +01:00 |
|
azusayamaguchi
|
fad96cf250
|
StencilBufs
|
2016-10-21 13:36:00 +01:00 |
|
azusayamaguchi
|
f331809c27
|
Use variable type for loop
|
2016-10-21 13:35:37 +01:00 |
|
|
bd6a228af6
|
Merge commit '20a091c3eddfdb67a82ece6413740a93650a2f98' into feature/feynman-rules
|
2016-10-21 13:10:30 +01:00 |
|
|
63d219498b
|
first (dirty) implementation of Feynman stoctachtic EM field
|
2016-10-21 13:10:13 +01:00 |
|
paboyle
|
2c54a53d0a
|
Compile verbose reduce
|
2016-10-21 12:12:14 +01:00 |
|
paboyle
|
306160ad9a
|
bcopy threaded
|
2016-10-21 12:07:28 +01:00 |
|
azusayamaguchi
|
20a091c3ed
|
Intel vs. Clang intrinsics differences absorbed
|
2016-10-21 09:08:36 +01:00 |
|
azusayamaguchi
|
202078eb1b
|
Cray / OpenSHMEM ordering differs
|
2016-10-21 09:07:20 +01:00 |
|
paboyle
|
a762b1fb71
|
MPI3 working with a bounce through shared memory on my laptop.
Longer term plan: make the "u_comm_buf" in Stencil point to the shared region and avoid the
send between ranks on same node.
|
2016-10-21 09:03:26 +01:00 |
|
Guido Cossu
|
deef2673b2
|
Separating the Lattice theories stub from the QCD.h file
|
2016-10-20 17:24:08 +01:00 |
|
paboyle
|
5b5925b8e5
|
Forgot to add
|
2016-10-20 17:09:40 +01:00 |
|
Guido Cossu
|
977b0a6dd9
|
Merge branch 'develop' into feature/hmc_generalise
|
2016-10-20 17:04:41 +01:00 |
|
Guido Cossu
|
977d844394
|
Few modifications on stdout messages
|
2016-10-20 17:01:59 +01:00 |
|
paboyle
|
b58adc6a4b
|
commVector
|
2016-10-20 17:00:15 +01:00 |
|
paboyle
|
f9d5e95d72
|
allocator template typedefs moved to AlignedAllocator
|
2016-10-20 16:59:39 +01:00 |
|
paboyle
|
4f8e636a43
|
commVector
|
2016-10-20 16:59:16 +01:00 |
|
paboyle
|
9b39f35ae6
|
commVector different for SHMEM compat
|
2016-10-20 16:58:53 +01:00 |
|
paboyle
|
5fe2b85cbd
|
MPI3 and shared memory support
|
2016-10-20 16:58:01 +01:00 |
|
paboyle
|
c7cccaaa69
|
Comm vector for shmem
|
2016-10-20 16:57:31 +01:00 |
|
paboyle
|
cbcfea466f
|
MPI3
|
2016-10-20 16:57:14 +01:00 |
|
paboyle
|
4955672fc3
|
MPI3
|
2016-10-20 16:57:00 +01:00 |
|
paboyle
|
8c043da5b7
|
SHMEM and comms allocator made different
|
2016-10-20 16:56:05 +01:00 |
|
paboyle
|
3cbe974eb4
|
Layout
|
2016-10-20 16:55:21 +01:00 |
|
|
997fd882ff
|
Merge branch 'develop' into feature/feynman-rules
# Conflicts:
# lib/Threads.h
# lib/qcd/action/fermion/WilsonFermion.cc
# lib/qcd/action/fermion/WilsonFermion.h
# lib/qcd/utils/SUn.h
# lib/simd/Grid_avx.h
# lib/simd/Intel512common.h
|
2016-10-19 18:35:18 +01:00 |
|
Guido Cossu
|
590675e2ca
|
Csum in hex format
|
2016-10-19 17:26:25 +01:00 |
|
Guido Cossu
|
8c65bdf6d3
|
Printing checksum for the RNG file
|
2016-10-19 16:56:11 +01:00 |
|
Guido Cossu
|
74f1ed3bc5
|
Adding some documentation for HMC
|
2016-10-19 10:51:13 +01:00 |
|
paboyle
|
7af9b87318
|
Cache face tables to improve performance.
Extract merge now looking poor.
|
2016-10-18 09:51:37 +01:00 |
|
paboyle
|
811ca45473
|
GNU clang hack for AVX512 since there are missing reduce intrinsics in Clang 3.9 and GCC-6 AVX512 support
|
2016-10-17 16:23:21 +01:00 |
|
paboyle
|
bc1a4d40ba
|
Faster integer handling avoid push_back
|
2016-10-17 16:16:44 +01:00 |
|
Guido Cossu
|
e250e6b7bb
|
Moving parameters outside of the HMCrunner
|
2016-10-14 17:22:32 +01:00 |
|
paboyle
|
c8079e6621
|
Time the face gateher in x-dir more carefully
|
2016-10-13 22:28:50 +01:00 |
|
azusayamaguchi
|
8b0d171c9a
|
32bit issue on the KNL code variant where byte offsets were stored
|
2016-10-12 17:49:32 +01:00 |
|
azusayamaguchi
|
8bbd9ebc27
|
Reversing changes to Stencil class
|
2016-10-12 13:47:20 +01:00 |
|
azusayamaguchi
|
6472b431f0
|
__rdpmc needed for gcc, clang++
|
2016-10-12 12:29:08 +01:00 |
|
azusayamaguchi
|
bd205a3293
|
Fixing for non x86 and non KNL
|
2016-10-12 12:09:15 +01:00 |
|
azusayamaguchi
|
496beffa88
|
Fix non-KNL build
|
2016-10-12 12:06:08 +01:00 |
|
azusayamaguchi
|
9b63e97108
|
align not absolutely required and confuses clang++
|
2016-10-12 11:51:21 +01:00 |
|
azusayamaguchi
|
81f2aeaece
|
KNL streaming stores, and KNL performance coutners
|
2016-10-12 11:45:22 +01:00 |
|
paboyle
|
2d4a45c758
|
Typecast pointer
|
2016-10-12 09:14:15 +01:00 |
|
paboyle
|
a123dcd7e9
|
Static required for shmem. Reading same object twice requires csum reset
|
2016-10-12 00:29:57 +01:00 |
|
paboyle
|
6b27c42dfe
|
Cosmetic
|
2016-10-12 00:29:39 +01:00 |
|
paboyle
|
f7c2aa3ba5
|
runtime by default
|
2016-10-12 00:29:13 +01:00 |
|
paboyle
|
7240d73184
|
Parallelise the x faces; fix the segv on KNL with comms
|
2016-10-11 22:21:07 +01:00 |
|
paboyle
|
42cd148f5e
|
Base pointer for comms buffer under AVX512 assembly
|
2016-10-11 16:06:06 +01:00 |
|
Guido Cossu
|
eda4dd622e
|
Some more edit
|
2016-10-11 15:45:20 +01:00 |
|
paboyle
|
6e01264bb7
|
don't use static by default
|
2016-10-11 10:03:39 +01:00 |
|
paboyle
|
6f408256bc
|
FMA4 option moved on the align
|
2016-10-11 10:03:01 +01:00 |
|
paboyle
|
8d11681aac
|
verbose remove
|
2016-10-10 23:50:42 +01:00 |
|
paboyle
|
3d5c9a1ee9
|
No compile fix on clang++ 3.9
|
2016-10-10 23:50:13 +01:00 |
|
paboyle
|
dc389e467c
|
axpy_ssp for any coeff type via template
|
2016-10-10 23:48:05 +01:00 |
|
paboyle
|
3619167d62
|
Mass parameter
|
2016-10-10 23:47:33 +01:00 |
|
paboyle
|
96f1d1b828
|
Debugged Domain wall and Overlap feynman rules (infinite Ls, finite mass).
|
2016-10-10 23:46:45 +01:00 |
|
paboyle
|
657e0a8f4d
|
Mass parameter
|
2016-10-10 23:46:10 +01:00 |
|
paboyle
|
616e7cd83e
|
Mass parameter
|
2016-10-10 23:45:48 +01:00 |
|
paboyle
|
6f26d2e8d4
|
Overlap tree level feynman rule
|
2016-10-10 23:45:18 +01:00 |
|
paboyle
|
c014574504
|
A "please implement me" feynman rule. If this were abstract virtual it would
require/force implementation
|
2016-10-10 23:44:00 +01:00 |
|
paboyle
|
d7ce164e6e
|
Feynman rule for DWF
|
2016-10-10 23:43:36 +01:00 |
|
paboyle
|
c0d5b99016
|
Dminus
|
2016-10-10 23:43:19 +01:00 |
|
paboyle
|
09ca32d678
|
Dminus added for Cayley
|
2016-10-10 23:42:55 +01:00 |
|
paboyle
|
082ae350c6
|
static schedule by default
|
2016-10-10 23:42:30 +01:00 |
|