Christopher Kelly
|
d36d2fb40d
|
Added ability to override default Ls in Benchmark_dwf
|
2017-08-28 06:53:56 -07:00 |
|
Christopher Kelly
|
f365a83fae
|
In G-parity unrolled kernel, replaced calls to permute and exchange with run-time-evaluated permute type with explicit calls to appropriate underlying functions
|
2017-08-25 14:24:11 -04:00 |
|
Christopher Kelly
|
34a9aeb331
|
Reduced number of if-statement evaluations in G-parity unrolled kernel
|
2017-08-24 13:53:50 -07:00 |
|
Christopher Kelly
|
edabb3577f
|
Imported Benchmark_gparity
|
2017-08-23 16:54:06 -04:00 |
|
Christopher Kelly
|
ce5df177ee
|
Removed superfluous implementation of G-parity twist for hand-unrolled kernel from GparityWilsonImpl
|
2017-08-23 15:05:22 -04:00 |
|
Christopher Kelly
|
a0bb8e5b46
|
Added hand-unrolled kernel implementations of all the other dslash precision / comms precision combinations with G-parity
|
2017-08-23 14:44:40 -04:00 |
|
Christopher Kelly
|
46f88e6d72
|
G-parity hand-unrolled intrinsics twist now uses one less permute and one less temporary
|
2017-08-23 13:21:10 -04:00 |
|
Christopher Kelly
|
b61835c1a5
|
Added inplace version of intrinsic G-parity twist to hand-unrolled kernel
|
2017-08-23 12:33:48 -04:00 |
|
Christopher Kelly
|
061e48fd73
|
Replaced slow unpack-repack in G-parity BC twist with intrinsics version
|
2017-08-22 18:12:12 -04:00 |
|
Christopher Kelly
|
ab50145001
|
Implemented first, unoptimized version of hand-unrolled G-parity kernels
Improved Test_gparity
|
2017-08-22 17:12:25 -04:00 |
|
paboyle
|
383ca7d392
|
Switch off comms for now until feature/multi-communicator is merged
|
2017-08-20 01:27:48 +01:00 |
|
paboyle
|
6d0d064a6c
|
Update TODO
|
2017-08-19 23:11:30 +01:00 |
|
paboyle
|
bfef525ed2
|
New benchmark prep
|
2017-08-19 23:10:12 +01:00 |
|
Guido Cossu
|
fd367d8bfd
|
Debugging the PointerCache
|
2017-08-16 09:42:57 +01:00 |
|
Guido Cossu
|
8a3fe60a27
|
Added more asserts at grid creation time
|
2017-08-08 11:36:20 +01:00 |
|
Guido Cossu
|
44051aecd1
|
Checking for integer divisions in cartesian full
|
2017-08-08 10:31:12 +01:00 |
|
Guido Cossu
|
06e6f8de00
|
Check that the reduced dim is an integer
|
2017-08-08 10:22:12 +01:00 |
|
Guido Cossu
|
dbe4d7850c
|
Make a test file compatible with all architectures
|
2017-08-06 10:49:45 +01:00 |
|
Guido Cossu
|
4fe182e5a7
|
Added high level HMC support for overriding default SIMD lane decomposition
|
2017-08-06 10:46:19 +01:00 |
|
Guido Cossu
|
175f393f9d
|
Binary IO error checking
|
2017-08-04 12:14:10 +01:00 |
|
Guido Cossu
|
8bd869da37
|
Correcting a bug in the IO routines
|
2017-07-27 15:12:50 +01:00 |
|
Guido Cossu
|
c7036f6717
|
Adding checks for libm and libstdc++
|
2017-07-27 11:15:09 +01:00 |
|
Guido Cossu
|
c0485d799d
|
Explicit parameter declaration in the WilsonGauge test
|
2017-07-26 16:26:04 +01:00 |
|
Guido Cossu
|
7abc5613bd
|
Added smearing to the topological charge observable
|
2017-07-26 16:21:17 +01:00 |
|
Guido Cossu
|
237cfd11ab
|
Solving the spurious O2 flags
|
2017-07-26 12:08:51 +01:00 |
|
Guido Cossu
|
a4b7dddb67
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2017-07-26 12:07:38 +01:00 |
|
Guido Cossu
|
5696781862
|
Debug error in Tensor mult
|
2017-07-26 12:07:34 +01:00 |
|
|
c3f0889eda
|
Merge pull request #123 from giltirn/develop
Fix for 'using namespace' in lib/qcd/utils/GaugeFix.h
|
2017-07-25 11:32:02 -03:00 |
|
Christopher Kelly
|
0f214ad427
|
Moved FourierAcceleratedGaugeFixer into Grid::QCD namespace and removed 'using namespace' directives from header
|
2017-07-21 11:13:51 -04:00 |
|
Peter Boyle
|
fe4912880d
|
Update README.md
|
2017-07-17 09:53:07 +01:00 |
|
Peter Boyle
|
f038c6babe
|
Update README.md
|
2017-07-14 22:59:16 +01:00 |
|
Peter Boyle
|
169f4b2711
|
Update README.md
|
2017-07-14 22:56:06 +01:00 |
|
Peter Boyle
|
2d8aff36fe
|
Update README.md
|
2017-07-14 22:52:16 +01:00 |
|
azusayamaguchi
|
659d7d1a40
|
For test/solver
Fixed
|
2017-07-12 15:01:48 +01:00 |
|
azusayamaguchi
|
dc6f078246
|
fixed the header file for mpi3
|
2017-07-11 14:15:08 +01:00 |
|
Peter Boyle
|
8a4714a4a6
|
Update README.md
|
2017-07-09 00:11:54 +01:00 |
|
Peter Boyle
|
40e119c61c
|
NUMA improvements worth preserving from AMD EPYC tests
|
2017-07-08 22:27:11 -04:00 |
|
Peter Boyle
|
7b0237b081
|
Update README.md
|
2017-07-01 10:24:41 +01:00 |
|
Peter Boyle
|
b68ad0cc0b
|
Update README.md
|
2017-07-01 10:20:07 +01:00 |
|
Peter Boyle
|
37263fd9b1
|
Update README.md
|
2017-07-01 10:06:24 +01:00 |
|
Peter Boyle
|
3d09e3e9e0
|
Update README.md
|
2017-07-01 10:05:46 +01:00 |
|
Peter Boyle
|
1354b46338
|
Update README.md
|
2017-07-01 10:04:32 +01:00 |
|
Peter Boyle
|
251a97fe1b
|
Update README.md
|
2017-07-01 09:55:36 +01:00 |
|
Peter Boyle
|
e18929eaa0
|
Update README.md
|
2017-07-01 09:53:15 +01:00 |
|
Peter Boyle
|
f3b0a92e71
|
Update README.md
|
2017-07-01 09:48:00 +01:00 |
|
Peter Boyle
|
a0be3f7330
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2017-06-30 10:53:50 +01:00 |
|
Peter Boyle
|
b5a6e4f1fd
|
Best option for Xeon cache blocking set
|
2017-06-30 10:53:22 +01:00 |
|
Peter Boyle
|
7a788db3dc
|
Guard first touch
|
2017-06-30 10:49:08 +01:00 |
|
Peter Boyle
|
f20eceb6cd
|
First touch once per page in a threaded loop
|
2017-06-30 10:48:27 +01:00 |
|
Peter Boyle
|
38325ebbc6
|
Interleave code path; not enabled
|
2017-06-30 10:23:51 +01:00 |
|