Christopher Kelly
|
59bd1fe21b
|
Fix for 'perm' and 'local' not being set for hand-unrolled external-site Dslash, which caused incorrect behavior of G-parity kernel
|
2017-08-29 13:07:37 -07:00 |
|
Christopher Kelly
|
74af885d4e
|
Removed some no-longer-needed associated with G-parity hand unrolled kernel
|
2017-08-29 09:50:37 -04:00 |
|
Christopher Kelly
|
f365a83fae
|
In G-parity unrolled kernel, replaced calls to permute and exchange with run-time-evaluated permute type with explicit calls to appropriate underlying functions
|
2017-08-25 14:24:11 -04:00 |
|
Christopher Kelly
|
34a9aeb331
|
Reduced number of if-statement evaluations in G-parity unrolled kernel
|
2017-08-24 13:53:50 -07:00 |
|
Christopher Kelly
|
ce5df177ee
|
Removed superfluous implementation of G-parity twist for hand-unrolled kernel from GparityWilsonImpl
|
2017-08-23 15:05:22 -04:00 |
|
Christopher Kelly
|
a0bb8e5b46
|
Added hand-unrolled kernel implementations of all the other dslash precision / comms precision combinations with G-parity
|
2017-08-23 14:44:40 -04:00 |
|
Christopher Kelly
|
46f88e6d72
|
G-parity hand-unrolled intrinsics twist now uses one less permute and one less temporary
|
2017-08-23 13:21:10 -04:00 |
|
Christopher Kelly
|
b61835c1a5
|
Added inplace version of intrinsic G-parity twist to hand-unrolled kernel
|
2017-08-23 12:33:48 -04:00 |
|
Christopher Kelly
|
061e48fd73
|
Replaced slow unpack-repack in G-parity BC twist with intrinsics version
|
2017-08-22 18:12:12 -04:00 |
|
Christopher Kelly
|
ab50145001
|
Implemented first, unoptimized version of hand-unrolled G-parity kernels
Improved Test_gparity
|
2017-08-22 17:12:25 -04:00 |
|
Guido Cossu
|
fd367d8bfd
|
Debugging the PointerCache
|
2017-08-16 09:42:57 +01:00 |
|
Guido Cossu
|
8a3fe60a27
|
Added more asserts at grid creation time
|
2017-08-08 11:36:20 +01:00 |
|
Guido Cossu
|
44051aecd1
|
Checking for integer divisions in cartesian full
|
2017-08-08 10:31:12 +01:00 |
|
Guido Cossu
|
06e6f8de00
|
Check that the reduced dim is an integer
|
2017-08-08 10:22:12 +01:00 |
|
Guido Cossu
|
4fe182e5a7
|
Added high level HMC support for overriding default SIMD lane decomposition
|
2017-08-06 10:46:19 +01:00 |
|
Guido Cossu
|
175f393f9d
|
Binary IO error checking
|
2017-08-04 12:14:10 +01:00 |
|
Guido Cossu
|
8bd869da37
|
Correcting a bug in the IO routines
|
2017-07-27 15:12:50 +01:00 |
|
Guido Cossu
|
c0485d799d
|
Explicit parameter declaration in the WilsonGauge test
|
2017-07-26 16:26:04 +01:00 |
|
Guido Cossu
|
7abc5613bd
|
Added smearing to the topological charge observable
|
2017-07-26 16:21:17 +01:00 |
|
Guido Cossu
|
a4b7dddb67
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2017-07-26 12:07:38 +01:00 |
|
Guido Cossu
|
5696781862
|
Debug error in Tensor mult
|
2017-07-26 12:07:34 +01:00 |
|
Christopher Kelly
|
0f214ad427
|
Moved FourierAcceleratedGaugeFixer into Grid::QCD namespace and removed 'using namespace' directives from header
|
2017-07-21 11:13:51 -04:00 |
|
azusayamaguchi
|
659d7d1a40
|
For test/solver
Fixed
|
2017-07-12 15:01:48 +01:00 |
|
azusayamaguchi
|
dc6f078246
|
fixed the header file for mpi3
|
2017-07-11 14:15:08 +01:00 |
|
Peter Boyle
|
40e119c61c
|
NUMA improvements worth preserving from AMD EPYC tests
|
2017-07-08 22:27:11 -04:00 |
|
Peter Boyle
|
a0be3f7330
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2017-06-30 10:53:50 +01:00 |
|
Peter Boyle
|
b5a6e4f1fd
|
Best option for Xeon cache blocking set
|
2017-06-30 10:53:22 +01:00 |
|
Peter Boyle
|
7a788db3dc
|
Guard first touch
|
2017-06-30 10:49:08 +01:00 |
|
Peter Boyle
|
f20eceb6cd
|
First touch once per page in a threaded loop
|
2017-06-30 10:48:27 +01:00 |
|
Peter Boyle
|
38325ebbc6
|
Interleave code path; not enabled
|
2017-06-30 10:23:51 +01:00 |
|
Peter Boyle
|
ac1f1838bc
|
KNL only
|
2017-06-30 10:15:32 +01:00 |
|
Guido Cossu
|
8859a151cc
|
Small corrections to the NEON port
|
2017-06-29 11:30:29 +01:00 |
|
Guido Cossu
|
688a39cfd9
|
Merge pull request #114 from nmeyer-ur/feature/arm-neon
ARM neon intrinsics support
Guido: checked and approved
|
2017-06-29 09:57:17 +01:00 |
|
Nils Meyer
|
0933aeefd4
|
corrected Grid_neon.h
|
2017-06-28 20:22:22 +02:00 |
|
|
07de925127
|
minor scalar action fixes
|
2017-06-28 12:45:44 +01:00 |
|
Nils Meyer
|
a9c816a268
|
moved file to correct folder
|
2017-06-27 21:39:15 +02:00 |
|
Nils Meyer
|
bf729766dd
|
removed collision with QPX implementation
|
2017-06-27 20:32:24 +02:00 |
|
|
0b707b861c
|
Merge branch 'develop' into feature/scalar-hmc-update
|
2017-06-27 14:40:05 +01:00 |
|
|
15e87a4607
|
HDF5 IO fix
|
2017-06-27 14:39:27 +01:00 |
|
|
7d7220cbd7
|
scalar: lambda/4! convention
|
2017-06-27 14:38:45 +01:00 |
|
|
0af740dc15
|
minor scalar HMC code improvement
|
2017-06-24 23:04:05 +01:00 |
|
|
d2e8372df3
|
SU(N) algebra fix (was not working)
|
2017-06-24 23:03:39 +01:00 |
|
Lanny91
|
56abbdf4c2
|
AVX512 integer reduce fix (for non-intel compiler)
|
2017-06-23 11:09:14 +02:00 |
|
Lanny91
|
af71c63f4c
|
AVX2 fix
|
2017-06-23 11:03:12 +02:00 |
|
Lanny91
|
0440d4ce66
|
Merge branch 'develop' of https://github.com/paboyle/Grid into hotfix/bgq
|
2017-06-22 17:09:42 +02:00 |
|
|
b22eab8c8b
|
Merge commit 'a7d56523abee6c9030fdd9303c79954897b1086f' into feature/hadrons
|
2017-06-21 18:32:48 +01:00 |
|
paboyle
|
e8b95bd35b
|
Clean up finished. Could shrink Lanczos to around 400 lines at a push
|
2017-06-21 02:50:09 +01:00 |
|
paboyle
|
7e35286860
|
Simplified lanczos, added Eigen diagonalisation.
Curious if we can deprecate dependencly on BLAS.
Will see when we get 48^3 running on our BG/Q port
|
2017-06-21 02:26:03 +01:00 |
|
paboyle
|
0486ff8e79
|
Improved the lancos
|
2017-06-20 18:46:01 +01:00 |
|
|
1e8a2e1621
|
various compatibility fixes after merge
|
2017-06-20 17:24:55 +01:00 |
|