portelli/Grid - Grid - DiRAC Tursa git server

mirror of https://github.com/paboyle/Grid.git synced 2026-07-20 00:53:27 +01:00

Author	SHA1	Message	Date
portelli	b4d2af8c89	threaded FFT	2016-10-26 19:46:36 +01:00
portelli	434af6aeaa	Merge branch 'develop' into feature/fft-opt	2016-10-26 18:50:38 +01:00
portelli	e90f8ac841	Merge branch 'develop' into feature/feynman-rules	2016-10-26 18:50:21 +01:00
portelli	a1705a8d53	debug message removed	2016-10-26 18:50:07 +01:00
portelli	ca21003f01	Merge branch 'feature/fft-opt' into feature/feynman-rules # Conflicts: # lib/FFT.h # lib/qcd/action/fermion/WilsonFermion5D.h # tests/core/Test_fft.cc	2016-10-26 18:44:47 +01:00
portelli	14ddf2c234	more FFT optimisations	2016-10-26 17:36:26 +01:00
Azusa Yamaguchi	bca861e112	Note:FFT shoud be GridFFT (Not change yet). Gauge fix with FFt is added (tests/core)	2016-10-25 14:21:48 +01:00
portelli	33d199a0ad	temporary thread safety in FFT	2016-10-25 12:56:40 +01:00
paboyle	b820076b91	Merge branch 'develop' into feature/mpi3	2016-10-25 06:02:33 +01:00
paboyle	09f66100d3	MPI 3 compile on non-linux	2016-10-25 06:01:12 +01:00
azusayamaguchi	d7d92af09d	Travis fail fix attempt	2016-10-25 01:45:53 +01:00
azusayamaguchi	460d0753a1	Merge branch 'develop' into feature/mpi3 Conflicts: lib/simd/Grid_avx512.h	2016-10-25 01:08:51 +01:00
azusayamaguchi	8f8058f8a5	More random bits on parallel seeding	2016-10-25 01:05:52 +01:00
azusayamaguchi	d97a27f483	Verbose	2016-10-25 01:05:31 +01:00
azusayamaguchi	7c3363b91e	Compiles all comms targets	2016-10-25 00:04:17 +01:00
azusayamaguchi	b94478fa51	mpi, mpi3, shmem all compile. mpi, mpi3 pass single node multi-rank	2016-10-24 23:45:31 +01:00
portelli	13bf0482e3	FFT optimisation	2016-10-24 19:25:40 +01:00
portelli	a795b5705e	memory optimisation	2016-10-24 19:25:15 +01:00
portelli	392e064513	fast local peek-poke	2016-10-24 19:24:21 +01:00
azusayamaguchi	b6a65059a2	Update to use shared memory to contain the stencil comms buffers Tested on 2.1.1.1 1.2.1.1 4.1.1.1 1.4.1.1 2.2.1.1 subnode decompositions	2016-10-24 17:30:43 +01:00
azusayamaguchi	ea25a4d9ac	Works	2016-10-23 06:10:05 +01:00
azusayamaguchi	c190221fd3	Internal SHM comms in non-simd directions working Need to fix simd directions	2016-10-22 18:14:27 +01:00
azusayamaguchi	0fcd2e7188	Simplify the comms structure prior to implementing Shared memory direct bouncs	2016-10-21 22:44:10 +01:00
azusayamaguchi	910b8dd6a1	use simd type	2016-10-21 22:35:29 +01:00
azusayamaguchi	75ebd3a0d1	Typo fixes and rotate for CLANG	2016-10-21 22:34:29 +01:00
portelli	7c8f79b147	more stochastic QED fixes	2016-10-21 15:20:12 +01:00
azusayamaguchi	09fd5c43a7	Reasonably fast version	2016-10-21 15:17:39 +01:00
portelli	462921e549	QED: fix stochastic field	2016-10-21 14:41:08 +01:00
azusayamaguchi	f22317748f	Merge branch 'feature/mpi3' of https://github.com/paboyle/Grid into feature/mpi3	2016-10-21 13:36:35 +01:00
azusayamaguchi	6a9eae6b6b	Reporting improvements	2016-10-21 13:36:18 +01:00
azusayamaguchi	fad96cf250	StencilBufs	2016-10-21 13:36:00 +01:00
azusayamaguchi	f331809c27	Use variable type for loop	2016-10-21 13:35:37 +01:00
portelli	bd6a228af6	Merge commit '20a091c3eddfdb67a82ece6413740a93650a2f98' into feature/feynman-rules	2016-10-21 13:10:30 +01:00
portelli	63d219498b	first (dirty) implementation of Feynman stoctachtic EM field	2016-10-21 13:10:13 +01:00
paboyle	2c54a53d0a	Compile verbose reduce	2016-10-21 12:12:14 +01:00
paboyle	306160ad9a	bcopy threaded	2016-10-21 12:07:28 +01:00
azusayamaguchi	20a091c3ed	Intel vs. Clang intrinsics differences absorbed	2016-10-21 09:08:36 +01:00
azusayamaguchi	202078eb1b	Cray / OpenSHMEM ordering differs	2016-10-21 09:07:20 +01:00
paboyle	a762b1fb71	MPI3 working with a bounce through shared memory on my laptop. Longer term plan: make the "u_comm_buf" in Stencil point to the shared region and avoid the send between ranks on same node.	2016-10-21 09:03:26 +01:00
paboyle	5b5925b8e5	Forgot to add	2016-10-20 17:09:40 +01:00
paboyle	b58adc6a4b	commVector	2016-10-20 17:00:15 +01:00
paboyle	f9d5e95d72	allocator template typedefs moved to AlignedAllocator	2016-10-20 16:59:39 +01:00
paboyle	4f8e636a43	commVector	2016-10-20 16:59:16 +01:00
paboyle	9b39f35ae6	commVector different for SHMEM compat	2016-10-20 16:58:53 +01:00
paboyle	5fe2b85cbd	MPI3 and shared memory support	2016-10-20 16:58:01 +01:00
paboyle	c7cccaaa69	Comm vector for shmem	2016-10-20 16:57:31 +01:00
paboyle	cbcfea466f	MPI3	2016-10-20 16:57:14 +01:00
paboyle	4955672fc3	MPI3	2016-10-20 16:57:00 +01:00
paboyle	8c043da5b7	SHMEM and comms allocator made different	2016-10-20 16:56:05 +01:00
paboyle	3cbe974eb4	Layout	2016-10-20 16:55:21 +01:00

... 4 5 6 7 8 ...