1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-17 07:17:06 +01:00
Commit Graph

75 Commits

Author SHA1 Message Date
e4deea4b94 Weird bug appears with Vector<Vector<>>.
"fix" with std::vector<Vector<>>

Lies in the face table code. But think there is some latent problem.
Possibly in my allocator since it is caching, but could simplify or eliminate the caching
option and retest. One to look at later.
2018-09-11 04:36:57 +01:00
7bf82f5b37 Offload the face handling to GPU 2018-09-10 11:28:42 +01:00
704ca162c1 Offloadable compression 2018-09-10 11:20:50 +01:00
da17a015c7 Pack the stencil smaller for 128 bit access 2018-07-23 06:12:45 -04:00
e6be7416f4 Use managed memory 2018-06-13 20:14:00 +01:00
26863b6d95 User Managed memory 2018-06-13 20:13:42 +01:00
d4ce7d9905 GPU friendly Stencil needs a view 2018-03-19 07:11:21 -04:00
ad739f042a Introduce views for passing lattice indexing to accelerators. 2018-03-04 15:56:14 +00:00
bf5fb89aff Coordinate handling GPU friendly 2018-02-24 22:21:36 +00:00
51eb2c5dfc Make referencign the stencil and all info required to evaluate the kernel
accelerator marked up
2018-02-02 11:37:13 +00:00
79b50feacf fixme updates 2018-01-29 16:00:40 +00:00
70e276e1ab parallel_for elimination -> thread_loop 2018-01-28 01:01:14 +00:00
c4f82e072b _grid becomes private ; use Grid()§ 2018-01-27 00:04:12 +00:00
912b50f6fa Hiding lattice internals 2018-01-26 23:08:45 +00:00
063603b1ea Warning elimination 2018-01-24 13:12:14 +00:00
5e48b701ec FOrmatting 2018-01-13 00:11:53 +00:00
7f6bffe5ad NAMESPACE 2018-01-13 00:11:30 +00:00
0a68470f9a Simplify comms layers 2018-01-08 11:28:30 +00:00
5918769f97 Subtle Naik term bug updated in Stencil; less on logical && with a function call on right 2017-09-16 12:51:26 +01:00
c289699d9a updated from cambridge mpi3 shakeout 2017-08-25 11:41:01 +01:00
c3b1263e75 Benchmark prep 2017-08-25 09:25:54 +01:00
a446d95c33 Trying to pass TeamCity and Travis 2017-08-20 01:10:50 +01:00
7d88198387 Merge branch 'develop' into feature/multi-communicator 2017-08-19 13:03:35 -04:00
14d53e1c9e Threaded MPI calls patches 2017-07-29 13:08:10 -04:00
b5a6e4f1fd Best option for Xeon cache blocking set 2017-06-30 10:53:22 +01:00
38325ebbc6 Interleave code path; not enabled 2017-06-30 10:23:51 +01:00
54e94360ad Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit 2017-06-24 23:10:24 +01:00
35fa3d1dfd Merge branch 'master' into feature/scalar_adjointFT 2017-05-12 10:41:39 +01:00
2439999ec8 Warning elimination; drop to -O2 on G++ bad versions 2017-05-06 14:44:49 +01:00
741bc836f6 Exposing support for Ncolours and Ndimensions and JSON input file for the ScalarAction 2017-05-05 17:36:43 +01:00
99220f6531 Fixes and better timing 2017-04-26 17:24:11 -04:00
ab66bac4e6 Think I'm getting on top of the reduced cost exterior precomputed list of links 2017-04-25 08:50:26 +01:00
56277a11c8 Build a list of whats on the surface 2017-04-24 17:06:15 +01:00
3703b718aa Mark up a table if a given site only receives from itself; including MPI3 splitting info. 2017-04-22 19:28:37 +01:00
736bf3c866 Major rework of stencil. Half precision and MPI3 now working. 2017-04-22 11:33:50 +01:00
e1a2319d01 Simple compressor moved out of cshift into stencil 2017-04-20 13:18:15 +01:00
d2312e9874 Drop compressor entirely from Cshift to only Stencil. 2017-04-20 13:16:55 +01:00
fc4ab9ccd5 Working half precision comms 2017-04-20 11:20:26 +01:00
4a340aa5ca Massive compressor rework to support reduced precision comms 2017-04-20 09:28:27 +01:00
8ef4300412 spurious .dirstamp files removed 2017-04-10 17:00:22 +01:00
8c8473998d Average over whole cluster the comm time. 2017-03-21 22:29:51 -04:00
4e7ab3166f Refactoring header layout 2017-02-22 18:09:33 +00:00
aca7a3ef0a Optimisation control improvements 2017-02-10 18:22:31 -05:00
c56707e003 useless debug message removed 2016-12-07 08:59:20 +09:00
bb94ddd0eb Tidy up of mpi3; also some cleaning of the dslash controls. 2016-11-02 08:07:09 +00:00
680645f849 Merge branch 'release/v0.5.0' 2016-06-30 15:15:03 -07:00
6d58cb2a68 Enable reordering of the loops in the assembler for cache friendly.
This gets in the way of L2 prefetching however. Do next next link in stencil
prefetching.
2016-06-30 14:35:01 -07:00
5e02392f9c Fixed compilation error for benchmark_dwf
Some parts were assuming floating point precision
2016-06-20 12:30:51 +01:00
f78d89bcbe Update Lebesgue.cc
kill verbose
2016-06-03 13:33:42 +01:00
139cc5f1ae Large change with KNL preparation 2016-06-03 03:24:26 -07:00