Grid/lib/qcd/utils at dc814f30daf0c80571389e7a9b6938c31ae6c1e2 - Grid

mirror of https://github.com/paboyle/Grid.git synced 2026-06-18 18:03:44 +01:00

Files

T

Peter Boyle dc814f30da Binary IO file for generic Grid array parallel I/O.

Number of IO MPI tasks can be varied by selecting which
dimensions use parallel IO and which dimensions use Serial send to boss
I/O.

Thus can neck down from, say 1024 nodes = 4x4x8x8 to {1,8,32,64,128,256,1024} nodes
doing the I/O.

Interpolates nicely between ALL nodes write their data, a single boss per time-plane
in processor space [old UKQCD fortran code did this], and a single node doing all I/O.

Not sure I have the transfer sizes big enough and am not overly convinced fstream
is guaranteed to not give buffer inconsistencies unless I set streambuf size to zero.

Practically it has worked on 8 tasks, 2x1x2x2 writing /cloning NERSC configurations
on my MacOS + OpenMPI and Clang environment.

It is VERY easy to switch to pwrite at a later date, and also easy to send x-strips around from
each node in order to gather bigger chunks at the syscall level.

That would push us up to the circa 8x 18*4*8 == 4KB size write chunk, and by taking, say, x/y non
parallel we get to 16MB contiguous chunks written in multi 4KB transactions
per IOnode in 64^3 lattices for configuration I/O.

I suspect this is fine for system performance.

2015-08-26 13:40:29 +01:00

.dirstamp

Experimental support for ARM

2015-06-09 15:46:21 +09:00

CovariantCshift.h

Endif terminated

2015-06-05 10:19:42 +01:00

LinalgUtils.h

Gamma5 mult direct

2015-08-13 10:51:29 +01:00

SpaceTimeGrid.cc

Conjugate residual added

2015-06-05 18:16:25 +01:00

SpaceTimeGrid.h

Conjugate residual added

2015-06-05 18:16:25 +01:00

SUn.h

Two flavour HMC for Wilson/Wilson is conserving energy.

2015-07-29 17:53:39 +09:00

WilsonLoops.h

Binary IO file for generic Grid array parallel I/O.

2015-08-26 13:40:29 +01:00