paboyle
36ea5f6b77
gpu friendly coordinates ; no std::vector on GPU
2018-02-24 22:20:14 +00:00
paboyle
604c05f4b8
parallel_for elimination -> thread_loop
2018-01-28 01:01:36 +00:00
paboyle
f4010023ca
Warning fixes
2018-01-25 23:46:47 +00:00
paboyle
d74c21a386
GLobal edit for QCD namespace removal & NAMESPACE macros
2018-01-15 09:37:58 +00:00
paboyle
9b32d51cd1
Simplify comms layer proliferatoin
2018-01-08 11:27:14 +00:00
Peter Boyle
c3b1263e75
Benchmark prep
2017-08-25 09:25:54 +01:00
Peter Boyle
9e658de238
Use Vector
2017-08-19 12:52:44 -04:00
paboyle
6f5a5cd9b3
Improved threaded comms benchmark
2017-06-28 23:27:02 +01:00
paboyle
54e94360ad
Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit
2017-06-24 23:10:24 +01:00
Peter Boyle
725c513d94
Better MPI3 benchmarking
2017-05-29 16:47:32 -04:00
Guido Cossu
0ffc235741
Adding more statistics to the Benchmark_comms. Min and max
2017-05-19 10:55:04 +01:00
Guido Cossu
8e19c99c7d
Adding more statistical info in the Benchmark_comms
2017-05-18 19:07:35 +01:00
Guido Cossu
a0bc0ad06f
Reverting change in Bechmark_comms. Keeping 300 iterations
2017-05-18 17:48:11 +01:00
Guido Cossu
bc862ce3ab
Fixing an allocation issue in Benchmark_comms
2017-05-18 14:44:56 +01:00
paboyle
060da786e9
Comms benchmark improvements
2017-02-07 01:07:39 -05:00
paboyle
33dc1f51b5
Final sign off commits from Cori-1
2016-11-09 04:11:03 -08:00
paboyle
757a928f9a
Improvement to use own SHM_OPEN call to avoid openmpi bug.
2016-11-02 12:37:46 +00:00
paboyle
bb94ddd0eb
Tidy up of mpi3; also some cleaning of the dslash controls.
2016-11-02 08:07:09 +00:00
paboyle
791cb050c8
Comms improvements
2016-11-01 11:35:43 +00:00
paboyle
32bc7a6ab8
MPI back out of change that hangs
...
AVX2 for clang, gcc needs the -mfma flag.
2016-08-05 10:36:00 +01:00
paboyle
ef97e32152
Adding persistent communicators
2016-07-08 17:16:08 +01:00
paboyle
a0676beeb1
Open up dependency on Eigen and FFTW
2016-07-07 22:31:07 +01:00
paboyle
aae8bf31a7
Global edit adding copyright and license info to every source file.
2016-01-02 14:51:32 +00:00
Peter Boyle
d1afebf71e
Sizable improvement in multigrid for unsquared.
...
6000 matmuls CG unprec
2000 matmuls CG prec (4000 eo muls)
1050 matmuls PGCR on 16^3 x 32 x 8 m=.01
Substantial effort on timing and logging infrastructure
2015-07-24 01:31:13 +09:00
Peter Boyle
93916f400d
Update Benchmark_comms.cc
2015-06-25 10:59:53 +01:00
Peter Boyle
84b5c7217d
CG test written and passes i.e. converges with small true residual
...
in RedBlack MpcDagMpc, Unprec MdagM and Schur red black solver for
each of.
DomainWallFermion
MobiusFermion
MobiusZolotarevFermion
ScaledShamirFermion
ScaledShamirZolotarevFermion
2015-06-03 10:54:03 +01:00