fe6e8f5ac6
Benchmark_comms fix
2022-11-15 17:00:49 -05:00
653039695b
Partial dirichlet changes
2022-11-15 16:37:15 -05:00
aab3bcb46f
Dirichlet first cut - wrong answers on dagger multiply.
...
Struggling to get a compute node so changing systems
2022-02-22 19:58:33 +00:00
f16c2665f5
Host memory explict
2020-11-12 20:29:58 +01:00
8244caff25
Remove the asynchronous non-Stencil calls.
2020-09-03 18:52:55 -04:00
e279b2be29
Merge develop
2019-08-14 23:01:59 +01:00
48e6efc7c9
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
Grid/qcd/action/fermion/WilsonKernelsAsm.cc
Grid/qcd/action/fermion/implementation/ImprovedStaggeredFermionImplementation.h
Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h
benchmarks/Benchmark_comms.cc
2019-08-14 18:56:54 +01:00
263dcbabab
Simplify the comms benchmark
2019-07-30 22:51:04 +01:00
d85dcc72df
Multinode fix
2019-07-20 07:13:28 +01:00
0561c2edeb
Benchmarks modified for new GPU constructs
2019-06-15 12:52:56 +01:00
4e9df9e93c
GPU patches
2019-05-18 17:43:11 +01:00
b57a4d32aa
Merge branch 'develop' into feature/gpu-port
2018-12-13 05:11:34 +00:00
bfbf2f1fa0
no threaded stencil benchmark if OpenMP is not supported
2018-05-03 16:20:01 +01:00
1dddd17e3c
Benchmark improvements from tesseract
2018-04-27 11:44:46 +01:00
36ea5f6b77
gpu friendly coordinates ; no std::vector on GPU
2018-02-24 22:20:14 +00:00
604c05f4b8
parallel_for elimination -> thread_loop
2018-01-28 01:01:36 +00:00
f4010023ca
Warning fixes
2018-01-25 23:46:47 +00:00
d74c21a386
GLobal edit for QCD namespace removal & NAMESPACE macros
2018-01-15 09:37:58 +00:00
9b32d51cd1
Simplify comms layer proliferatoin
2018-01-08 11:27:14 +00:00
c3b1263e75
Benchmark prep
2017-08-25 09:25:54 +01:00
9e658de238
Use Vector
2017-08-19 12:52:44 -04:00
6f5a5cd9b3
Improved threaded comms benchmark
2017-06-28 23:27:02 +01:00
54e94360ad
Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit
2017-06-24 23:10:24 +01:00
725c513d94
Better MPI3 benchmarking
2017-05-29 16:47:32 -04:00
0ffc235741
Adding more statistics to the Benchmark_comms. Min and max
2017-05-19 10:55:04 +01:00
8e19c99c7d
Adding more statistical info in the Benchmark_comms
2017-05-18 19:07:35 +01:00
a0bc0ad06f
Reverting change in Bechmark_comms. Keeping 300 iterations
2017-05-18 17:48:11 +01:00
bc862ce3ab
Fixing an allocation issue in Benchmark_comms
2017-05-18 14:44:56 +01:00
060da786e9
Comms benchmark improvements
2017-02-07 01:07:39 -05:00
33dc1f51b5
Final sign off commits from Cori-1
2016-11-09 04:11:03 -08:00
757a928f9a
Improvement to use own SHM_OPEN call to avoid openmpi bug.
2016-11-02 12:37:46 +00:00
bb94ddd0eb
Tidy up of mpi3; also some cleaning of the dslash controls.
2016-11-02 08:07:09 +00:00
791cb050c8
Comms improvements
2016-11-01 11:35:43 +00:00
32bc7a6ab8
MPI back out of change that hangs
...
AVX2 for clang, gcc needs the -mfma flag.
2016-08-05 10:36:00 +01:00
ef97e32152
Adding persistent communicators
2016-07-08 17:16:08 +01:00
a0676beeb1
Open up dependency on Eigen and FFTW
2016-07-07 22:31:07 +01:00
aae8bf31a7
Global edit adding copyright and license info to every source file.
2016-01-02 14:51:32 +00:00
d1afebf71e
Sizable improvement in multigrid for unsquared.
...
6000 matmuls CG unprec
2000 matmuls CG prec (4000 eo muls)
1050 matmuls PGCR on 16^3 x 32 x 8 m=.01
Substantial effort on timing and logging infrastructure
2015-07-24 01:31:13 +09:00
93916f400d
Update Benchmark_comms.cc
2015-06-25 10:59:53 +01:00
84b5c7217d
CG test written and passes i.e. converges with small true residual
...
in RedBlack MpcDagMpc, Unprec MdagM and Schur red black solver for
each of.
DomainWallFermion
MobiusFermion
MobiusZolotarevFermion
ScaledShamirFermion
ScaledShamirZolotarevFermion
2015-06-03 10:54:03 +01:00