1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-20 02:31:01 +01:00
Commit Graph

601 Commits

Author SHA1 Message Date
paboyle 641a28aa1d Namespace 2018-01-14 22:53:50 +00:00
paboyle 75207fa010 FOrmat 2018-01-14 22:53:13 +00:00
paboyle c2b0e0269a Namespace 2018-01-14 22:52:22 +00:00
paboyle 7828887604 Namespace, indent 2018-01-14 22:51:18 +00:00
paboyle e6efc93a7c Namespace 2018-01-14 22:50:35 +00:00
paboyle ff7e773d5e Namesapce 2018-01-14 22:49:48 +00:00
paboyle a0380fad72 Namespace 2018-01-14 22:48:57 +00:00
paboyle 61e9a33777 Namesapce 2018-01-14 22:48:08 +00:00
paboyle 3e139b52d3 Namespace 2018-01-14 22:47:24 +00:00
paboyle fd6031b005 Namespace 2018-01-14 22:46:17 +00:00
paboyle fe44fc50d9 Namespace 2018-01-14 22:45:29 +00:00
paboyle 2dd88cf3f8 Namespace 2018-01-14 22:44:41 +00:00
paboyle 6b7e82f1a9 Namespace, indentation 2018-01-14 22:44:06 +00:00
paboyle be612b3931 Namespace, indentation 2018-01-14 22:43:27 +00:00
paboyle f5e74033f9 Namespace 2018-01-14 22:42:31 +00:00
paboyle 8d52e0a349 Namespace 2018-01-14 22:41:23 +00:00
paboyle a60f6d353e Namespace 2018-01-14 22:40:29 +00:00
paboyle 5d3b574325 Missing banner; should recreate globally 2018-01-14 22:39:24 +00:00
paboyle 6ee5ea6b32 Namespace QCD gone 2018-01-14 22:38:22 +00:00
paboyle 43e48542ab Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2018-01-08 11:34:45 +00:00
paboyle 6ecf280723 Simplify comms layer proliferation 2018-01-08 11:28:04 +00:00
portelli 2401360784 Merge pull request #138 from guelpers/feature/hadrons
bug fix in sequential insertion of conserved vector current
2017-12-11 18:53:41 +01:00
Vera Guelpers 2cfb50cbe5 bug fix in sequential insertion of conserved vector current 2017-12-08 11:13:39 +00:00
portelli 0a038ea15a Merge branch 'develop' into feature/hadrons 2017-12-06 16:49:10 +01:00
portelli 62eb1f0e59 FermionOperator virtual destructor needed for polymorphism 2017-12-06 16:48:17 +01:00
portelli 682e7d7839 Merge branch 'develop' into feature/hadrons 2017-11-01 19:24:38 +00:00
paboyle bf58557fb1 Block compressed Lanczos 2017-10-10 14:15:11 +01:00
portelli 63b2bc1936 Merge branch 'develop' into feature/hadrons
# Conflicts:
#	lib/qcd/action/fermion/FermionOperatorImpl.h
2017-10-05 14:16:23 +01:00
paboyle d54807b8c0 MPIT works with split grid now 2017-10-02 23:14:56 +01:00
Peter Boyle 946a8671b9 Merge pull request #129 from djm2131/feature/eofa
Add support for DWF with the exact one flavor algorithm
2017-09-21 10:15:21 +01:00
Peter Boyle bfb68e6f02 Merge pull request #130 from giltirn/gparity-handunroll
Gparity handunroll
2017-09-21 10:11:00 +01:00
Christopher Kelly 59bd1fe21b Fix for 'perm' and 'local' not being set for hand-unrolled external-site Dslash, which caused incorrect behavior of G-parity kernel 2017-08-29 13:07:37 -07:00
portelli a56e3b40c4 Merge branch 'develop' into feature/hadrons 2017-08-29 11:03:53 -06:00
Christopher Kelly 74af885d4e Removed some no-longer-needed associated with G-parity hand unrolled kernel 2017-08-29 09:50:37 -04:00
paboyle 80c5bce5bb Merge branch 'develop' into feature/multi-communicator 2017-08-25 20:21:26 +01:00
paboyle f68b5de9c8 No compile fix on Clang 2017-08-25 19:35:21 +01:00
Christopher Kelly f365a83fae In G-parity unrolled kernel, replaced calls to permute and exchange with run-time-evaluated permute type with explicit calls to appropriate underlying functions 2017-08-25 14:24:11 -04:00
Peter Boyle c289699d9a updated from cambridge mpi3 shakeout 2017-08-25 11:41:01 +01:00
Peter Boyle c3b1263e75 Benchmark prep 2017-08-25 09:25:54 +01:00
Christopher Kelly 34a9aeb331 Reduced number of if-statement evaluations in G-parity unrolled kernel 2017-08-24 13:53:50 -07:00
portelli 21b02760c3 Merge branch 'develop' into feature/hadrons 2017-08-24 17:05:45 +01:00
Christopher Kelly ce5df177ee Removed superfluous implementation of G-parity twist for hand-unrolled kernel from GparityWilsonImpl 2017-08-23 15:05:22 -04:00
Christopher Kelly a0bb8e5b46 Added hand-unrolled kernel implementations of all the other dslash precision / comms precision combinations with G-parity 2017-08-23 14:44:40 -04:00
Christopher Kelly 46f88e6d72 G-parity hand-unrolled intrinsics twist now uses one less permute and one less temporary 2017-08-23 13:21:10 -04:00
David Murphy dd8f1ea189 Vectorized Mobius EOFA Dperp + shift operation 2017-08-23 13:17:26 -04:00
Christopher Kelly b61835c1a5 Added inplace version of intrinsic G-parity twist to hand-unrolled kernel 2017-08-23 12:33:48 -04:00
Azusa Yamaguchi d9cd4f0273 Staggered multinode block cg debugged. Missing global sum.
Code stalls and resumes on KNL at cambridge. Curious.

CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
David Murphy 459f70e8d4 Check-in of working Mobius EOFA class and tests 2017-08-22 22:38:30 -04:00
Christopher Kelly 061e48fd73 Replaced slow unpack-repack in G-parity BC twist with intrinsics version 2017-08-22 18:12:12 -04:00
Christopher Kelly ab50145001 Implemented first, unoptimized version of hand-unrolled G-parity kernels
Improved Test_gparity
2017-08-22 17:12:25 -04:00