Peter Boyle
cb0d2a1b03
threaded rng init; I thought this was on
2018-03-24 19:28:17 -04:00
paboyle
0e6197fbed
Introduce accelerator friendly expression template rewrite.
...
Must obtain and access lattice indexing through a view object that is safe
to copy construct in copy to GPU (without copying the lattice).
2018-03-04 16:03:19 +00:00
paboyle
c1fc947bb8
Coordinate handling GPU friendly + some GPU merge/extract improvements
2018-02-24 22:26:10 +00:00
paboyle
c4f82e072b
_grid becomes private ; use Grid()§
2018-01-27 00:04:12 +00:00
paboyle
43cea62855
Hide internal data
2018-01-26 23:06:03 +00:00
paboyle
d70709a8e8
Thread construct changes
2018-01-24 13:39:06 +00:00
paboyle
7e00f643f8
Namespace indent
2018-01-14 23:55:44 +00:00
paboyle
9579c9c327
Threading improvement
2017-12-05 14:12:22 +00:00
paboyle
c24d4c8d0e
Improved parallel RNG init
2017-12-05 13:01:10 +00:00
paboyle
3e560b9462
Faster RNG init
2017-12-05 11:42:05 +00:00
paboyle
697c0603ce
SITMO I/O for NERSC working now bit repro
2017-05-05 16:54:44 +01:00
Guido Cossu
8c540333d5
Merge branch 'develop' into feature/hmc_generalise
2017-04-05 14:41:04 +01:00
paboyle
83f6fab8fa
Big/Small crush test, and fast SITMO rng init, faster but not ideal
...
MT and Ranlux init.
2017-04-02 12:10:51 +09:00
paboyle
9dc7ca4c3b
Sitmo fast init
2017-04-02 00:28:22 +09:00
paboyle
e099dcdae7
Merge branch 'develop' into feature/bgq-asm
2017-02-23 00:25:29 +00:00
paboyle
3ae92fa2e6
Global changes to parallel_for structure.
...
Move the comms flags to more sensible names
2017-02-21 05:24:27 -05:00
Guido Cossu
e0571c872b
Merge branch 'develop' into feature/hmc_generalise
2017-02-09 16:12:00 +00:00
paboyle
71ac2e7940
Faster RNG init
2017-02-07 01:33:23 -05:00
Guido Cossu
677757cfeb
Added and tested SITMO PRNG
2017-01-25 12:47:22 +00:00
Guido Cossu
851f2ad8ef
Adding fermions actions support in the factories
2017-01-19 10:00:02 +00:00
Guido Cossu
2bd4233919
Completed testing of the HMC for Ls vectorised version (on AVX2)
2016-12-07 04:56:37 +00:00
Guido Cossu
b812d5e39c
Added single threaded version of the derivative for the Ls vectorised DWF
2016-12-06 16:31:13 +00:00
Guido Cossu
a783282b8b
Merge branch 'develop' into feature/hmc_generalise
2016-11-10 18:13:07 +00:00
ca21003f01
Merge branch 'feature/fft-opt' into feature/feynman-rules
...
# Conflicts:
# lib/FFT.h
# lib/qcd/action/fermion/WilsonFermion5D.h
# tests/core/Test_fft.cc
2016-10-26 18:44:47 +01:00
azusayamaguchi
8f8058f8a5
More random bits on parallel seeding
2016-10-25 01:05:52 +01:00
Guido Cossu
f55c16f984
Adding a barrier in the RNG save
2016-10-24 11:02:14 +01:00
Guido Cossu
3e990c9d0a
Reverting the broadcast change
2016-10-22 13:26:43 +01:00
Guido Cossu
590675e2ca
Csum in hex format
2016-10-19 17:26:25 +01:00
paboyle
f7ca6ca889
Bernoulli reenabled -- using integral type for the discrete_distribution, but
...
then casts in the fill
2016-04-30 03:48:28 -07:00
paboyle
ec4a9b7f6c
The Bernoulli gives a no compile due to a static assertion that the type be integral
...
in 4.7 random.h
Probably need to go through an Integer type, and then conver to real after the random draw
to make clean.
2016-04-30 03:42:24 -07:00
23b6172c31
Bernoulli RNG
2016-04-30 00:14:13 -07:00
paboyle
aae8bf31a7
Global edit adding copyright and license info to every source file.
2016-01-02 14:51:32 +00:00
paboyle
08edbb5cbe
HMC bit repro across checkpoints. Fixed parallel RNG issue with threading.
...
Conclusion: c++11 distributions not thread safe and must us distinct dist as well as distinct engine
per site. Makes sense when you think of box muller. Also added a reset of dist on fill to ensure
repro across checkpoints.
2015-12-22 08:54:40 +00:00
paboyle
5710966324
Options to use mersenne twister OR ranlux48 via --enable-rng flag at configure time.
...
Can save and restore RNG state via new (serial) I/O routines in a NERSC header style file.
Store a Parallel (one per site) and a single serial RNG file.
2015-12-19 18:32:25 +00:00
Peter Boyle
fdfe194c41
Threading bug in RNG fill fixed.
2015-08-19 14:41:05 +01:00
Peter Boyle
4e085dd0ed
Domain wall even-odd 2f HMC with wilson gauge and PV 2f ratio now running and giving small dH.
...
Azusa is working hard on the rectangle term and we'll hopefully start reproducing plaquettes
from RBC-UKQCD parameters soon !
My new laptop is pretty warm and is starting to groan ;)
2015-08-19 10:26:07 +01:00
Peter Boyle
d1afebf71e
Sizable improvement in multigrid for unsquared.
...
6000 matmuls CG unprec
2000 matmuls CG prec (4000 eo muls)
1050 matmuls PGCR on 16^3 x 32 x 8 m=.01
Substantial effort on timing and logging infrastructure
2015-07-24 01:31:13 +09:00
Peter Boyle
1d0df449e8
Reorganise of file naming
2015-06-03 12:47:05 +01:00