paboyle
|
e099dcdae7
|
Merge branch 'develop' into feature/bgq-asm
|
2017-02-23 00:25:29 +00:00 |
|
paboyle
|
4e7ab3166f
|
Refactoring header layout
|
2017-02-22 18:09:33 +00:00 |
|
paboyle
|
aac80cbb44
|
Bug fix from Chris K
|
2017-02-22 12:19:09 -05:00 |
|
paboyle
|
3ae92fa2e6
|
Global changes to parallel_for structure.
Move the comms flags to more sensible names
|
2017-02-21 05:24:27 -05:00 |
|
paboyle
|
3906cd2149
|
Stencil fix on BNL KNL system
|
2017-02-20 17:51:31 -05:00 |
|
paboyle
|
5a1fb29db7
|
Useful debug code info to preserve
|
2017-02-20 17:49:23 -05:00 |
|
paboyle
|
661fc4d3d1
|
Debug AVX512 exchange code paths
|
2017-02-20 17:48:36 -05:00 |
|
paboyle
|
41009cc142
|
Move excange into the stencil only; keep Cshift fully general
|
2017-02-20 17:48:04 -05:00 |
|
paboyle
|
37720c4db7
|
Count bytes off node only
|
2017-02-20 17:47:40 -05:00 |
|
paboyle
|
1a30455a10
|
1000 iters on bmark for more accurate timing
|
2017-02-20 17:47:01 -05:00 |
|
paboyle
|
cd0da81196
|
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
|
2017-02-16 18:52:30 -05:00 |
|
paboyle
|
f246fe3304
|
Improvements to avx for invertible to avoid latent bug
|
2017-02-16 23:52:44 +00:00 |
|
paboyle
|
8a29c16bde
|
Faster gather exchange
|
2017-02-16 23:52:22 +00:00 |
|
paboyle
|
d68907fc3e
|
Debug temp
|
2017-02-16 18:51:35 -05:00 |
|
paboyle
|
5c0adf7bf2
|
Make clang happy with parenthesis
|
2017-02-16 23:51:33 +00:00 |
|
paboyle
|
be3a8249c6
|
Faster gather
|
2017-02-16 23:51:15 +00:00 |
|
paboyle
|
bd600702cf
|
Vectorise the XYZT face gathering better.
Hard coded for simd_layout <= 2 in any given spread out direction; full generality is inconsistent
with efficiency.
|
2017-02-15 11:11:04 +00:00 |
|
paboyle
|
aca7a3ef0a
|
Optimisation control improvements
|
2017-02-10 18:22:31 -05:00 |
|
paboyle
|
2c246551d0
|
Overlap comms and compute options in wilson kernels
|
2017-02-07 01:37:10 -05:00 |
|
paboyle
|
71ac2e7940
|
Faster RNG init
|
2017-02-07 01:33:23 -05:00 |
|
paboyle
|
2bf4688e83
|
Running on BNL KNL
|
2017-02-07 01:32:10 -05:00 |
|
paboyle
|
a48ee6f0f2
|
Don't use MPI3_leader any more. No real gain and complex
|
2017-02-07 01:31:24 -05:00 |
|
paboyle
|
73547cca66
|
MPI3 working i think
|
2017-02-07 01:30:02 -05:00 |
|
paboyle
|
123c673db7
|
Policy to control async or sync SendRecv
|
2017-02-07 01:24:54 -05:00 |
|
paboyle
|
61f82216e2
|
Communicator Policy, NodeCount distinct from Rank count
|
2017-02-07 01:22:53 -05:00 |
|
paboyle
|
8e7ca92278
|
Debugged cshift case
|
2017-02-07 01:21:32 -05:00 |
|
paboyle
|
485ad6fde0
|
Stencil working in SHM MPI3
|
2017-02-07 01:20:39 -05:00 |
|
paboyle
|
6ea2184e18
|
OMP define change
|
2017-02-07 01:17:16 -05:00 |
|
paboyle
|
fdc170b8a3
|
Parallel fors in lattice transfer
|
2017-02-07 01:16:39 -05:00 |
|
paboyle
|
060da786e9
|
Comms benchmark improvements
|
2017-02-07 01:07:39 -05:00 |
|
paboyle
|
85c7bc4321
|
Bug fixes for cases that physics code couldn't hit but latent
and discovered on KNL (long vector, y SIMD dir) and checker dir set to y.
Remove the assertions on these code paths now they are tested.
|
2017-02-07 01:01:15 -05:00 |
|
paboyle
|
0883d6a7ce
|
Overlap comms compute support; make reg naming consistent with bgq aasm
|
2017-02-07 00:59:32 -05:00 |
|
paboyle
|
9ff97b4711
|
Improved stencil tests passing all on KNL multinode
|
2017-02-07 00:58:34 -05:00 |
|
paboyle
|
b5e9c900a4
|
Better printing and signal handling options
|
2017-02-07 00:57:55 -05:00 |
|
paboyle
|
4bbdfb434c
|
Overlap comms compute modifications
|
2017-02-07 00:57:01 -05:00 |
|
Christopher Kelly
|
c94133af49
|
Added iteration reporting to CG and mixed CG
Added ability to manually change the initial CG inner tolerance in mixed CG
Added .hpp files to filelist script
|
2017-02-02 17:04:42 -05:00 |
|
|
e7d8030a64
|
operator>> for serialisable enums
|
2017-02-01 15:51:08 -08:00 |
|
|
d775fbb2f9
|
Gammas: code cleaning and gamma_L implementation & test
|
2017-02-01 15:45:05 -08:00 |
|
|
863855f46f
|
header fix
|
2017-02-01 11:59:44 -08:00 |
|
|
419af7610d
|
New gamma matrices tidying: generated code is confined to Gamma.* for readability
|
2017-02-01 11:23:12 -08:00 |
|
|
7da7d263c4
|
typo
|
2017-01-30 10:53:13 -08:00 |
|
|
1140573027
|
Gamma adj fix: now in Grid namespace to avoid collisions
|
2017-01-30 10:53:04 -08:00 |
|
|
a0cfbb6e88
|
Merge branch 'feature/gammas' into feature/hadrons
# Conflicts:
# .gitignore
# lib/qcd/spin/Dirac.cc
# scripts/filelist
|
2017-01-30 09:10:49 -08:00 |
|
|
515a26b3c6
|
gammas: copyright update
|
2017-01-30 09:07:09 -08:00 |
|
Guido Cossu
|
899e685627
|
Merge branch 'feature/sitmo_rng' into develop
|
2017-01-27 14:15:56 +00:00 |
|
|
3bf993d81a
|
gitignore update
|
2017-01-26 17:00:59 -08:00 |
|
|
fad743fbb1
|
Build system sanity check: corrected several headers not in the <Grid/*> format
|
2017-01-26 17:00:41 -08:00 |
|
Guido Cossu
|
ef8d3831eb
|
Temporary patch the threading error in InsertSlice and ExtractSlice
Find source and fix the error
|
2017-01-25 18:12:04 +00:00 |
|
Guido Cossu
|
70ed9fc40c
|
Updating the engine to the last version
|
2017-01-25 18:10:41 +00:00 |
|
|
4d3787db65
|
Hadrons fixed for new gammas, Meson only does one contraction but this’ll change in the future
|
2017-01-25 09:59:00 -08:00 |
|