paboyle
25becc9324
GPU tweaks for benchmarking; really necessary?
2018-06-13 20:26:07 +01:00
paboyle
aead94e9a7
View introduced
2018-03-04 16:39:29 +00:00
paboyle
36ea5f6b77
gpu friendly coordinates ; no std::vector on GPU
2018-02-24 22:20:14 +00:00
paboyle
ce4da83bc2
Zero changes, literally
2018-01-27 23:51:10 +00:00
paboyle
2a4a0e43c1
Hide internals
2018-01-26 23:08:27 +00:00
paboyle
f4010023ca
Warning fixes
2018-01-25 23:46:47 +00:00
paboyle
d74c21a386
GLobal edit for QCD namespace removal & NAMESPACE macros
2018-01-15 09:37:58 +00:00
Peter Boyle
40e119c61c
NUMA improvements worth preserving from AMD EPYC tests
2017-07-08 22:27:11 -04:00
Peter Boyle
08e04b9676
Better benchmarks
2017-06-28 15:30:06 +01:00
paboyle
3bfd1f13e6
I/O improvements
2017-06-11 23:14:10 +01:00
paboyle
f18f5ed926
Drop random device
2017-04-02 00:26:26 +09:00
paboyle
a0676beeb1
Open up dependency on Eigen and FFTW
2016-07-07 22:31:07 +01:00
paboyle
aae8bf31a7
Global edit adding copyright and license info to every source file.
2016-01-02 14:51:32 +00:00
paboyle
32762346ad
Better run time on KNC
2015-11-04 03:25:34 -08:00
paboyle
af89c40462
Better timing tweaks to give sensible results on 24 threads on Edison dual ivybridge nodes.
2015-09-28 16:09:04 -07:00
Peter Boyle
d1afebf71e
Sizable improvement in multigrid for unsquared.
...
6000 matmuls CG unprec
2000 matmuls CG prec (4000 eo muls)
1050 matmuls PGCR on 16^3 x 32 x 8 m=.01
Substantial effort on timing and logging infrastructure
2015-07-24 01:31:13 +09:00
paboyle
39271b02dd
Modified memory bw test to display word size
2015-07-01 22:46:53 +01:00
Peter Boyle
84b5c7217d
CG test written and passes i.e. converges with small true residual
...
in RedBlack MpcDagMpc, Unprec MdagM and Schur red black solver for
each of.
DomainWallFermion
MobiusFermion
MobiusZolotarevFermion
ScaledShamirFermion
ScaledShamirZolotarevFermion
2015-06-03 10:54:03 +01:00