6aeaf6f568
Parallel IO worked on. I'm puzzled because I already thought I shook this out on MacOS + OpenMPI and then
...
turned up problems on the BlueWaters Cray.
Gets 75MB/s from home filesystem on parallel configuration read. Need to make the RNG IO parallel,
and also to look at aggregating bigger writes for the parallel write.
Not sure what the home filesystem is.
2016-02-21 08:03:21 -06:00
3425751cb8
Missing return value
2016-02-19 01:06:03 +00:00
22422a84d9
Small problem in compressor fix
2016-02-17 19:03:09 -06:00
c9fadf97a5
Simplify the compressor interface again.
2016-02-17 18:16:45 -06:00
81395e85d1
Regressing to not overlap comms and compute becasue bluewaters, edison, and cori are so rubbish at it.
2016-02-16 13:56:44 -06:00
a0fc47c6f9
Cheaper implementation
2016-02-15 16:02:36 -06:00
e2f73e3ead
Updates for shmem
2016-02-10 16:50:32 -08:00
6371676a75
Correcting some compilation errors for clang-sse
2016-02-10 11:37:03 +09:00
fc6ad65751
Pushed the overlap comms tweaks
2016-01-11 06:34:22 -08:00
dafc74020c
Overlap comms compute improvements in hand op kernels, and better timing from Edison and Cori
2016-01-10 16:54:27 -08:00
d19321dfde
Overlap comms compute changes
2016-01-10 19:20:16 +00:00
c99d748da6
Timing reports in benchmarks now reflect the asynch comms thread statistics
2016-01-04 14:42:16 +00:00
02452afd36
Optional overlap of comms with compute
2016-01-04 14:18:40 +00:00
331768dcff
Added overlap comms compute mode
2016-01-03 01:38:11 +00:00
aae8bf31a7
Global edit adding copyright and license info to every source file.
2016-01-02 14:51:32 +00:00
5a80930dd2
Charge conjugation boundary conditions for gauge fields implemented as a policy
...
class, changing the nature of covariant Cshifts used in
plaquettes, rectangles and staples.
As a result same code is used for the plaq and rect action independent of the BC type.
Should probably isolate the BC in a separate class that Gimpl takes as a template param.
Do the same with smearing policies.
This would then allow composition of BC with smearing etc....
2016-01-02 13:37:25 +00:00
841a37f941
Fix to WilsonCompressor that fixes a bug in comms phase due to the sign change on gamma
...
matrix in hopping term.
Add logging of time spent in CG.
2015-12-29 23:49:41 +00:00
e108e708a3
Wilson TM tests and compiles in
2015-12-17 23:06:33 +00:00
67ccb043f1
Added TM fermions for DSDR etc..
2015-12-17 22:34:28 +00:00
34a0fde2ad
Fixes to fermion force terms after sign of gamma_mu (0...3) change.
...
Thought I had already committed these.
Believe I have got the Gparity fermion force working.
* tests/Test_gpdwf_force.cc -- correctly predicts dS for two flavour pseudofermion
based on a small dt update of U field.
* tests/Test_hmc_EODWFRatio_Gparity.cc -- ran 1 trajectory on 8^4 with dH=0.21.
Need to accumulate a full plaquette log to believe fully which will take some hours of run time.
2015-12-15 23:14:12 +00:00
3ce10aa975
Fix a regression failure on Mobius; chroma regression added
2015-12-10 22:55:00 +00:00
b2c02a6106
Runs fastst on cori
2015-11-28 16:58:16 -08:00
e9ff25b06b
Small threading change makes a difference on Cori.
2015-11-07 00:07:05 -08:00
05a7029600
Stencil change
2015-11-07 00:06:31 -08:00
899ca41cb8
Merge branch 'master' of github.com:paboyle/Grid
...
Conflicts:
lib/qcd/action/fermion/WilsonFermion5D.cc
2015-11-06 03:50:04 -08:00
d29b4c1dee
Assembler files
2015-11-06 03:48:48 -08:00
a2ff068e29
Asm and threading for many core
2015-11-06 03:47:14 -08:00
17af18dcab
Changes for AVX512 assembler
2015-11-06 03:45:51 -08:00
28022755ae
Stencil class name global change to StencilImpl typedef
2015-11-06 05:30:17 -06:00
1159de165c
Asm option for AVX512
2015-11-05 22:04:51 -08:00
16c7993434
Merge branch 'master' of github.com:paboyle/Grid
...
Conflicts:
lib/simd/Grid_avx512.h
lib/simd/Grid_imci.h
2015-11-04 03:32:10 -08:00
4e65ad21ac
Adding a routine for AVX512 / IMCI with explicit assembly implementations
2015-11-04 03:15:08 -08:00
abb23df83f
formatting only
2015-11-04 10:00:27 +00:00
1878bf97d0
Babbage fix
2015-09-30 16:04:01 -07:00
64d64d1ab6
Updating to modify non-inlining permute routines and hopefully get better reg use and
...
enhance performance.
2015-09-25 08:55:04 -07:00
2f38ebc446
Reintroducing the hand unrolled loops
2015-09-08 17:45:30 +01:00
a842a6c94d
One flavour rational unprec added; untested but does compile.
...
Moving param structs into a single header for later connection to file I/O using
macromagic.h
2015-08-18 14:40:08 +01:00
f0e32f12cf
Merge branch 'master' of https://github.com/paboyle/Grid
2015-08-15 23:59:04 +01:00
55cfc89459
* Finished the template/policy style introduction of gparity, except the gparity force terms.
...
So valence sector looks ok.
FermionOperatorImpl.h provides the policy classes.
Expect HMC will introduce a smearing policy and a fermion representation change policy template
param. Will also probably need multi-precision work.
* HMC is running even-odd and non-checkerboarded (checked 4^4 wilson fermion/wilson gauge).
There appears to be a bug in the multi-level integrator -- <e-dH> passes with single level but
not with multi-level.
In any case there looks to be quite a bit to clean up.
This is the "const det" style implementation that is not appropriate yet for clover since
it assumes that Mee is indept of the gauge fields. Easily fixed in future.
2015-08-15 23:25:49 +01:00
ba8c09a58e
Reorganising the Fermion interface
2015-08-14 14:16:45 +01:00
59d66eb17a
Gparity works now even if simd distributed in a Gparity twist direction.
...
Tested by doubling lattice in t-direction.
2015-08-14 12:57:42 +01:00
4dc7c36aa8
Gparity works now even if simd distributed in a Gparity twist direction.
...
Tested by doubling lattice in t-direction.
2015-08-14 12:57:42 +01:00
028e2061e0
Gparity valence test now working.
...
Interface in FermionOperator will change a lot in future
2015-08-14 00:01:04 +01:00
7d3512ab21
Gparity valence test now working.
...
Interface in FermionOperator will change a lot in future
2015-08-14 00:01:04 +01:00
8a0be42080
Gparity test added; partial implementation -- this is Chris K's doubled lattice only
...
and have to regress this with the 2 flavour implementation.
2015-08-12 09:49:33 +01:00
9183380946
Gparity test added; partial implementation -- this is Chris K's doubled lattice only
...
and have to regress this with the 2 flavour implementation.
2015-08-12 09:49:33 +01:00
26f5ee0621
Header
2015-08-11 06:23:38 +01:00
f165b1a120
Header
2015-08-11 06:23:38 +01:00
881acaa065
Gparity modifications in the Gparity compressor variant.
2015-08-11 06:22:20 +01:00
0a9ebac514
Gparity modifications in the Gparity compressor variant.
2015-08-11 06:22:20 +01:00