portelli/Grid - Grid - DiRAC Tursa git server

mirror of https://github.com/paboyle/Grid.git synced 2025-08-17 03:31:54 +01:00

Author	SHA1	Message	Date
paboyle	0494feec98	Libz dependency	2017-06-13 12:00:23 +01:00
paboyle	a16b1e134e	gcc 4.9 fix	2017-06-13 10:48:43 +01:00
paboyle	eaac0044b5	Compile fixes	2017-06-12 00:20:49 +01:00
paboyle	56042f002c	New files	2017-06-11 23:19:20 +01:00
paboyle	3bfd1f13e6	I/O improvements	2017-06-11 23:14:10 +01:00
paboyle	092dcd4e04	MPI I/O only if MPI compiled	2017-06-02 22:50:25 +01:00
paboyle	094c3d091a	Improved and RNG's now survive checkpoint	2017-06-02 00:38:58 +01:00
Peter Boyle	1a1f6d55f9	Roll over to MPI IO for parallel IO	2017-06-01 17:37:26 -04:00
Peter Boyle	21421656ab	Big changes improving the code to use MPI IO	2017-06-01 17:36:53 -04:00
paboyle	1e429a0d57	Added MPI version	2017-05-30 23:41:07 +01:00
paboyle	d38a4de36c	Beginning move to MPI IO	2017-05-30 23:40:39 +01:00
paboyle	b8b5934193	Attempts to speed up the parallel IO	2017-05-25 13:32:24 +01:00
paboyle	a8c10b1933	Use a global-X x Local-Y chunksize for parallel binary I/O. Gives O(32 x 8 x 1888) chunk size on configuration I/O. At 150KB should be getting close to packet sizes and 4MB filesystem block sizes that are reasonably (!?) performant. We shall see once I move this off my laptop and over to BNL and time it.	2017-05-25 11:43:33 +01:00
Guido Cossu	74f451715f	Fix for Mac compilation on the size_t uint64_t types	2017-05-01 15:12:07 +01:00
Guido Cossu	8c540333d5	Merge branch 'develop' into feature/hmc_generalise	2017-04-05 14:41:04 +01:00
paboyle	7e5faa0f34	Multiple RNGs	2017-04-02 00:25:44 +09:00
paboyle	417ec56cca	Release candidate	2017-03-29 05:45:33 -04:00
paboyle	756bc25008	Verbose header print by default	2017-03-29 04:44:17 -04:00
paboyle	75112a632a	IO improvements to fail on IO error	2017-03-28 02:28:04 -04:00
Guido Cossu	a783282b8b	Merge branch 'develop' into feature/hmc_generalise	2016-11-10 18:13:07 +00:00
Guido Cossu	1d666771f9	Debugging the RNG, eliminate the barrier after broadcast	2016-10-26 16:08:23 +01:00
Guido Cossu	d50055cd96	Making the ILDG support optional	2016-10-26 09:48:01 +01:00
Guido Cossu	47c7159177	ILDG reader/writer works Fill the xml header with the required information, todo.	2016-10-24 21:57:54 +01:00
Guido Cossu	f415db583a	Adding ILDG format	2016-10-24 15:48:22 +01:00
Guido Cossu	f55c16f984	Adding a barrier in the RNG save	2016-10-24 11:02:14 +01:00
Guido Cossu	df67e013ca	More debug output for the RNG	2016-10-22 13:34:17 +01:00
Guido Cossu	3e990c9d0a	Reverting the broadcast change	2016-10-22 13:26:43 +01:00
Guido Cossu	4b740fc8fd	Debugging the RNG state save	2016-10-22 13:06:00 +01:00
Guido Cossu	977d844394	Few modifications on stdout messages	2016-10-20 17:01:59 +01:00
Antonin Portelli	997fd882ff	Merge branch 'develop' into feature/feynman-rules # Conflicts: # lib/Threads.h # lib/qcd/action/fermion/WilsonFermion.cc # lib/qcd/action/fermion/WilsonFermion.h # lib/qcd/utils/SUn.h # lib/simd/Grid_avx.h # lib/simd/Intel512common.h	2016-10-19 18:35:18 +01:00
Guido Cossu	590675e2ca	Csum in hex format	2016-10-19 17:26:25 +01:00
Guido Cossu	8c65bdf6d3	Printing checksum for the RNG file	2016-10-19 16:56:11 +01:00
paboyle	a123dcd7e9	Static required for shmem. Reading same object twice requires csum reset	2016-10-12 00:29:57 +01:00
Guido Cossu	eda4dd622e	Some more edit	2016-10-11 15:45:20 +01:00
Guido Cossu	11b4c80b27	Added support for hmc and binary IO for a general field	2016-10-07 13:37:29 +01:00
Guido Cossu	c065e454c3	Adding Binrary IO, untested	2016-10-06 10:12:11 +01:00
Guido Cossu	d9b5fbd374	In the middle of adding a general binary writer	2016-10-04 11:24:08 +01:00
Guido Cossu	f76f281e58	Cleaning files after fix	2016-09-09 11:34:25 +01:00
Antonin Portelli	64bf6fe54e	macro to dump NERSC header to a stream	2016-05-04 12:14:38 -07:00
paboyle	d4e57f4bc6	IO Bandwidth reporting	2016-03-16 02:30:16 -07:00
Peter Boyle	6aeaf6f568	Parallel IO worked on. I'm puzzled because I already thought I shook this out on MacOS + OpenMPI and then turned up problems on the BlueWaters Cray. Gets 75MB/s from home filesystem on parallel configuration read. Need to make the RNG IO parallel, and also to look at aggregating bigger writes for the parallel write. Not sure what the home filesystem is.	2016-02-21 08:03:21 -06:00
Peter Boyle	7f927a541c	Shmem related fixes for shmem compile	2016-02-11 07:37:39 -06:00
paboyle	aae8bf31a7	Global edit adding copyright and license info to every source file.	2016-01-02 14:51:32 +00:00
paboyle	5a80930dd2	Charge conjugation boundary conditions for gauge fields implemented as a policy class, changing the nature of covariant Cshifts used in plaquettes, rectangles and staples. As a result same code is used for the plaq and rect action independent of the BC type. Should probably isolate the BC in a separate class that Gimpl takes as a template param. Do the same with smearing policies. This would then allow composition of BC with smearing etc....	2016-01-02 13:37:25 +00:00
paboyle	31ca609d12	HMC checkpointing . Need a general HMC framework to work in restart.	2015-12-20 02:29:51 +00:00
paboyle	5710966324	Options to use mersenne twister OR ranlux48 via --enable-rng flag at configure time. Can save and restore RNG state via new (serial) I/O routines in a NERSC header style file. Store a Parallel (one per site) and a single serial RNG file.	2015-12-19 18:32:25 +00:00
Peter Boyle	96608c70d1	chrono causing some problems on Cray systems. Suspend use for now	2015-11-04 04:28:31 -06:00
Peter Boyle	d35d63b171	Algorithm in	2015-11-04 04:27:44 -06:00
Peter Boyle	aa52fdadcc	Global edit on HMC sector -- making GaugeField a template parameter and preparing to pass integrator, smearing, bc's as policy classes to hmc. Propose to unify "integrator" and integrator algorithm in a base/derived way to override step. Want to read through ForceGradient to ensure that abstraction covers the force gradient case.	2015-08-30 12:18:34 +01:00
Peter Boyle	76d752585b	Started a tidy up in the HMC sector. Now comfortable with the two level integrators; to a little figure out what Guido had done & why -- but there is a neat saving of force evaluations across the nesting time boundary making use of linearity of the leapP in dt. I cleaned up the printing, reduced the volume of code, in the process sharing printing between all integrators. Placed an assert that the total integration time for all integrators must match at end of trajectory. Have now verified e-dH = 1 for nested integrators in Wilson/Wilson runs with both Omelyan and with Leapfrog so substantial confidence gained.	2015-08-29 17:18:43 +01:00

1 2

59 Commits