paboyle
7828887604
Namespace, indent
2018-01-14 22:51:18 +00:00
paboyle
e6efc93a7c
Namespace
2018-01-14 22:50:35 +00:00
paboyle
ff7e773d5e
Namesapce
2018-01-14 22:49:48 +00:00
paboyle
a0380fad72
Namespace
2018-01-14 22:48:57 +00:00
paboyle
61e9a33777
Namesapce
2018-01-14 22:48:08 +00:00
paboyle
3e139b52d3
Namespace
2018-01-14 22:47:24 +00:00
paboyle
fd6031b005
Namespace
2018-01-14 22:46:17 +00:00
paboyle
fe44fc50d9
Namespace
2018-01-14 22:45:29 +00:00
paboyle
2dd88cf3f8
Namespace
2018-01-14 22:44:41 +00:00
paboyle
6b7e82f1a9
Namespace, indentation
2018-01-14 22:44:06 +00:00
paboyle
be612b3931
Namespace, indentation
2018-01-14 22:43:27 +00:00
paboyle
f5e74033f9
Namespace
2018-01-14 22:42:31 +00:00
paboyle
8d52e0a349
Namespace
2018-01-14 22:41:23 +00:00
paboyle
a60f6d353e
Namespace
2018-01-14 22:40:29 +00:00
paboyle
5d3b574325
Missing banner; should recreate globally
2018-01-14 22:39:24 +00:00
paboyle
6ee5ea6b32
Namespace QCD gone
2018-01-14 22:38:22 +00:00
paboyle
43e48542ab
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-01-08 11:34:45 +00:00
paboyle
6ecf280723
Simplify comms layer proliferation
2018-01-08 11:28:04 +00:00
2401360784
Merge pull request #138 from guelpers/feature/hadrons
...
bug fix in sequential insertion of conserved vector current
2017-12-11 18:53:41 +01:00
Vera Guelpers
2cfb50cbe5
bug fix in sequential insertion of conserved vector current
2017-12-08 11:13:39 +00:00
0a038ea15a
Merge branch 'develop' into feature/hadrons
2017-12-06 16:49:10 +01:00
62eb1f0e59
FermionOperator virtual destructor needed for polymorphism
2017-12-06 16:48:17 +01:00
682e7d7839
Merge branch 'develop' into feature/hadrons
2017-11-01 19:24:38 +00:00
paboyle
bf58557fb1
Block compressed Lanczos
2017-10-10 14:15:11 +01:00
63b2bc1936
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/qcd/action/fermion/FermionOperatorImpl.h
2017-10-05 14:16:23 +01:00
paboyle
d54807b8c0
MPIT works with split grid now
2017-10-02 23:14:56 +01:00
Peter Boyle
946a8671b9
Merge pull request #129 from djm2131/feature/eofa
...
Add support for DWF with the exact one flavor algorithm
2017-09-21 10:15:21 +01:00
Peter Boyle
bfb68e6f02
Merge pull request #130 from giltirn/gparity-handunroll
...
Gparity handunroll
2017-09-21 10:11:00 +01:00
Christopher Kelly
59bd1fe21b
Fix for 'perm' and 'local' not being set for hand-unrolled external-site Dslash, which caused incorrect behavior of G-parity kernel
2017-08-29 13:07:37 -07:00
a56e3b40c4
Merge branch 'develop' into feature/hadrons
2017-08-29 11:03:53 -06:00
Christopher Kelly
74af885d4e
Removed some no-longer-needed associated with G-parity hand unrolled kernel
2017-08-29 09:50:37 -04:00
paboyle
80c5bce5bb
Merge branch 'develop' into feature/multi-communicator
2017-08-25 20:21:26 +01:00
paboyle
f68b5de9c8
No compile fix on Clang
2017-08-25 19:35:21 +01:00
Christopher Kelly
f365a83fae
In G-parity unrolled kernel, replaced calls to permute and exchange with run-time-evaluated permute type with explicit calls to appropriate underlying functions
2017-08-25 14:24:11 -04:00
Peter Boyle
c289699d9a
updated from cambridge mpi3 shakeout
2017-08-25 11:41:01 +01:00
Peter Boyle
c3b1263e75
Benchmark prep
2017-08-25 09:25:54 +01:00
Christopher Kelly
34a9aeb331
Reduced number of if-statement evaluations in G-parity unrolled kernel
2017-08-24 13:53:50 -07:00
21b02760c3
Merge branch 'develop' into feature/hadrons
2017-08-24 17:05:45 +01:00
Christopher Kelly
ce5df177ee
Removed superfluous implementation of G-parity twist for hand-unrolled kernel from GparityWilsonImpl
2017-08-23 15:05:22 -04:00
Christopher Kelly
a0bb8e5b46
Added hand-unrolled kernel implementations of all the other dslash precision / comms precision combinations with G-parity
2017-08-23 14:44:40 -04:00
Christopher Kelly
46f88e6d72
G-parity hand-unrolled intrinsics twist now uses one less permute and one less temporary
2017-08-23 13:21:10 -04:00
David Murphy
dd8f1ea189
Vectorized Mobius EOFA Dperp + shift operation
2017-08-23 13:17:26 -04:00
Christopher Kelly
b61835c1a5
Added inplace version of intrinsic G-parity twist to hand-unrolled kernel
2017-08-23 12:33:48 -04:00
Azusa Yamaguchi
d9cd4f0273
Staggered multinode block cg debugged. Missing global sum.
...
Code stalls and resumes on KNL at cambridge. Curious.
CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
David Murphy
459f70e8d4
Check-in of working Mobius EOFA class and tests
2017-08-22 22:38:30 -04:00
Christopher Kelly
061e48fd73
Replaced slow unpack-repack in G-parity BC twist with intrinsics version
2017-08-22 18:12:12 -04:00
Christopher Kelly
ab50145001
Implemented first, unoptimized version of hand-unrolled G-parity kernels
...
Improved Test_gparity
2017-08-22 17:12:25 -04:00
paboyle
a446d95c33
Trying to pass TeamCity and Travis
2017-08-20 01:10:50 +01:00
Peter Boyle
7d88198387
Merge branch 'develop' into feature/multi-communicator
2017-08-19 13:03:35 -04:00
David Murphy
9d45fca8bc
Implement MobiusEOFAFermioncache.cc
2017-08-17 23:45:36 -04:00