portelli/Grid - Grid - DiRAC Tursa git server

mirror of https://github.com/paboyle/Grid.git synced 2025-06-14 22:07:05 +01:00

Author	SHA1	Message	Date
Peter Boyle	fe6e8f5ac6	Benchmark_comms fix	2022-11-15 17:00:49 -05:00
Peter Boyle	653039695b	Partial dirichlet changes	2022-11-15 16:37:15 -05:00
Peter Boyle	aab3bcb46f	Dirichlet first cut - wrong answers on dagger multiply. Struggling to get a compute node so changing systems	2022-02-22 19:58:33 +00:00
Peter Boyle	f16c2665f5	Host memory explict	2020-11-12 20:29:58 +01:00
Peter Boyle	8244caff25	Remove the asynchronous non-Stencil calls.	2020-09-03 18:52:55 -04:00
Peter Boyle	e279b2be29	Merge develop	2019-08-14 23:01:59 +01:00
Peter Boyle	48e6efc7c9	Merge branch 'develop' into feature/gpu-port Conflicts: Grid/qcd/action/fermion/WilsonKernelsAsm.cc Grid/qcd/action/fermion/implementation/ImprovedStaggeredFermionImplementation.h Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h benchmarks/Benchmark_comms.cc	2019-08-14 18:56:54 +01:00
Peter Boyle	263dcbabab	Simplify the comms benchmark	2019-07-30 22:51:04 +01:00
Peter Boyle	d85dcc72df	Multinode fix	2019-07-20 07:13:28 +01:00
Peter Boyle	0561c2edeb	Benchmarks modified for new GPU constructs	2019-06-15 12:52:56 +01:00
Peter Boyle	4e9df9e93c	GPU patches	2019-05-18 17:43:11 +01:00
Peter Boyle	b57a4d32aa	Merge branch 'develop' into feature/gpu-port	2018-12-13 05:11:34 +00:00
Antonin Portelli	bfbf2f1fa0	no threaded stencil benchmark if OpenMP is not supported	2018-05-03 16:20:01 +01:00
Dr Peter Boyle	1dddd17e3c	Benchmark improvements from tesseract	2018-04-27 11:44:46 +01:00
paboyle	36ea5f6b77	gpu friendly coordinates ; no std::vector on GPU	2018-02-24 22:20:14 +00:00
paboyle	604c05f4b8	parallel_for elimination -> thread_loop	2018-01-28 01:01:36 +00:00
paboyle	f4010023ca	Warning fixes	2018-01-25 23:46:47 +00:00
paboyle	d74c21a386	GLobal edit for QCD namespace removal & NAMESPACE macros	2018-01-15 09:37:58 +00:00
paboyle	9b32d51cd1	Simplify comms layer proliferatoin	2018-01-08 11:27:14 +00:00
Peter Boyle	c3b1263e75	Benchmark prep	2017-08-25 09:25:54 +01:00
Peter Boyle	9e658de238	Use Vector	2017-08-19 12:52:44 -04:00
paboyle	6f5a5cd9b3	Improved threaded comms benchmark	2017-06-28 23:27:02 +01:00
paboyle	54e94360ad	Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit	2017-06-24 23:10:24 +01:00
Peter Boyle	725c513d94	Better MPI3 benchmarking	2017-05-29 16:47:32 -04:00
Guido Cossu	0ffc235741	Adding more statistics to the Benchmark_comms. Min and max	2017-05-19 10:55:04 +01:00
Guido Cossu	8e19c99c7d	Adding more statistical info in the Benchmark_comms	2017-05-18 19:07:35 +01:00
Guido Cossu	a0bc0ad06f	Reverting change in Bechmark_comms. Keeping 300 iterations	2017-05-18 17:48:11 +01:00
Guido Cossu	bc862ce3ab	Fixing an allocation issue in Benchmark_comms	2017-05-18 14:44:56 +01:00
paboyle	060da786e9	Comms benchmark improvements	2017-02-07 01:07:39 -05:00
paboyle	33dc1f51b5	Final sign off commits from Cori-1	2016-11-09 04:11:03 -08:00
paboyle	757a928f9a	Improvement to use own SHM_OPEN call to avoid openmpi bug.	2016-11-02 12:37:46 +00:00
paboyle	bb94ddd0eb	Tidy up of mpi3; also some cleaning of the dslash controls.	2016-11-02 08:07:09 +00:00
paboyle	791cb050c8	Comms improvements	2016-11-01 11:35:43 +00:00
paboyle	32bc7a6ab8	MPI back out of change that hangs AVX2 for clang, gcc needs the -mfma flag.	2016-08-05 10:36:00 +01:00
paboyle	ef97e32152	Adding persistent communicators	2016-07-08 17:16:08 +01:00
paboyle	a0676beeb1	Open up dependency on Eigen and FFTW	2016-07-07 22:31:07 +01:00
paboyle	aae8bf31a7	Global edit adding copyright and license info to every source file.	2016-01-02 14:51:32 +00:00
Peter Boyle	d1afebf71e	Sizable improvement in multigrid for unsquared. 6000 matmuls CG unprec 2000 matmuls CG prec (4000 eo muls) 1050 matmuls PGCR on 16^3 x 32 x 8 m=.01 Substantial effort on timing and logging infrastructure	2015-07-24 01:31:13 +09:00
Peter Boyle	93916f400d	Update Benchmark_comms.cc	2015-06-25 10:59:53 +01:00
Peter Boyle	84b5c7217d	CG test written and passes i.e. converges with small true residual in RedBlack MpcDagMpc, Unprec MdagM and Schur red black solver for each of. DomainWallFermion MobiusFermion MobiusZolotarevFermion ScaledShamirFermion ScaledShamirZolotarevFermion	2015-06-03 10:54:03 +01:00

40 Commits