paboyle
ab1068044e
C++ emacs happy
2018-01-13 00:01:58 +00:00
paboyle
dda151250f
Emacs format
2018-01-12 23:59:58 +00:00
paboyle
18daf85069
Emacs format
2018-01-12 23:58:23 +00:00
paboyle
81cc28f6ca
Format
2018-01-12 23:57:22 +00:00
paboyle
c01a1e02fe
Namespace, format
2018-01-12 23:55:38 +00:00
paboyle
7e70f4ed9c
Format, NAMESPACE
2018-01-12 23:55:03 +00:00
paboyle
1056e36f11
Format, NAMESPACE
2018-01-12 23:49:46 +00:00
paboyle
0b8a88978b
Format, NAMESPACE
2018-01-12 23:47:24 +00:00
paboyle
59b31b6bb8
Format, NAMESPACE
2018-01-12 23:43:44 +00:00
paboyle
69496482fc
Format, NAMESPACE
2018-01-12 23:42:22 +00:00
paboyle
4be31ad1f6
C++ indentation
2018-01-12 23:39:49 +00:00
paboyle
176a021ce9
Formatting, NAMESPACE§
2018-01-12 23:38:15 +00:00
paboyle
b673174b71
FOrmat, NAMESPACE
2018-01-12 23:29:22 +00:00
paboyle
e6f7a5a818
Namespace
2018-01-12 23:28:01 +00:00
paboyle
68b69a2ac0
Namespace management
2018-01-12 23:26:14 +00:00
paboyle
bd15c38ae8
Formatting emacs compliant
2018-01-12 23:25:02 +00:00
paboyle
b815f5f764
Formatting
2018-01-12 23:23:21 +00:00
paboyle
4da437431e
Reformat
2018-01-12 23:22:46 +00:00
paboyle
3c7bf211a9
Reformat
2018-01-12 23:22:18 +00:00
paboyle
347d5404dd
format
2018-01-12 23:21:25 +00:00
paboyle
5e2cd0d07c
Format
2018-01-12 23:18:22 +00:00
paboyle
62fcee72c5
Format, NAMESPACE
2018-01-12 23:16:37 +00:00
paboyle
0a6168eef0
Format emacs style
2018-01-12 23:11:22 +00:00
paboyle
63865e4232
format
2018-01-12 23:10:48 +00:00
paboyle
c64deedf74
Format
2018-01-12 23:09:35 +00:00
paboyle
3281559ec3
Format
2018-01-12 23:09:01 +00:00
paboyle
6a2eca2ec2
NAMESAPCE
2018-01-12 23:00:03 +00:00
paboyle
d8ff895e74
NAMESPACE and format
2018-01-12 18:27:22 +00:00
paboyle
00c49d4c17
Format
2018-01-12 18:25:39 +00:00
paboyle
ec89714cce
NAMESPACE
2018-01-12 18:24:16 +00:00
paboyle
6ab744c720
NAMESPACE and formatting
2018-01-12 18:11:04 +00:00
paboyle
bbb657da5c
NAMESPACE and formatting
2018-01-12 18:10:11 +00:00
paboyle
fbc2380cb8
NAMESPACE & format
2018-01-12 18:05:36 +00:00
paboyle
08682c5461
NAMESPACE and format to my liking
2018-01-12 18:03:57 +00:00
paboyle
13bce2a6bf
NAMESPACE
2018-01-12 17:58:53 +00:00
paboyle
70e689900b
NAMESPACE
2018-01-12 17:58:13 +00:00
paboyle
e564d11687
Allow resize of the shared memory buffers
2018-01-08 15:20:26 +00:00
paboyle
0b2162f375
Clean up
2018-01-08 14:06:53 +00:00
paboyle
44f65526e0
Simplify communicators
2018-01-08 11:35:43 +00:00
paboyle
43e48542ab
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-01-08 11:34:45 +00:00
paboyle
0b85f1bfc8
Simplify the communicator proliferation: mpi and none.
2018-01-08 11:33:47 +00:00
paboyle
9947cfbf14
Simplify number of communicator cases
2018-01-08 11:33:01 +00:00
paboyle
357badce5e
Simplify communicator case proliferation
2018-01-08 11:32:16 +00:00
paboyle
0091eec23a
Simplify communicator cases
2018-01-08 11:31:32 +00:00
paboyle
9e9c2962df
Simplify comms layer proliferation
2018-01-08 11:30:22 +00:00
paboyle
b91282ad46
Simplify comms layer proliferation
2018-01-08 11:28:52 +00:00
paboyle
0a68470f9a
Simplify comms layers
2018-01-08 11:28:30 +00:00
paboyle
6ecf280723
Simplify comms layer proliferation
2018-01-08 11:28:04 +00:00
paboyle
7eeab7f995
Simplify comms layers
2018-01-08 11:27:43 +00:00
f3a77f4b7f
Merge branch 'feature/hadrons' into feature/hadrons-new-memory-model
2017-12-12 14:05:23 +00:00
2401360784
Merge pull request #138 from guelpers/feature/hadrons
...
bug fix in sequential insertion of conserved vector current
2017-12-11 18:53:41 +01:00
Vera Guelpers
2cfb50cbe5
bug fix in sequential insertion of conserved vector current
2017-12-08 11:13:39 +00:00
f9aa39e1c4
global memory debug through command line flag
2017-12-07 14:40:58 +01:00
e78794688a
memory profiler improvement
2017-12-06 16:50:25 +01:00
29e2eddea8
Merge branch 'develop' into feature/hadrons-new-memory-model
2017-12-06 16:49:21 +01:00
0a038ea15a
Merge branch 'develop' into feature/hadrons
2017-12-06 16:49:10 +01:00
62eb1f0e59
FermionOperator virtual destructor needed for polymorphism
2017-12-06 16:48:17 +01:00
paboyle
9579c9c327
Threading improvement
2017-12-05 14:12:22 +00:00
paboyle
c24d4c8d0e
Improved parallel RNG init
2017-12-05 13:01:10 +00:00
paboyle
a14038051f
Improved AllToAll asserts
2017-12-05 11:43:25 +00:00
paboyle
3e560b9462
Faster RNG init
2017-12-05 11:42:05 +00:00
paboyle
d93c6760ec
Faster code for split unsplit
2017-12-05 11:39:26 +00:00
paboyle
ae3b7713a9
Cold start doesnt need RNG
2017-12-05 11:36:31 +00:00
33cb509d4b
Merge branch 'develop' into feature/hadrons
2017-12-01 19:45:32 +00:00
456c78c233
Merge branch 'develop' into feature/hadrons-new-memory-model
2017-12-01 19:45:12 +00:00
2fd4989029
Merge branch 'develop' of github.com:paboyle/Grid into develop
2017-12-01 19:44:31 +00:00
2427a21428
minor serial IO fixes, XML now issues warning when trying to read absent nodes, these becomes
2017-12-01 19:44:07 +00:00
paboyle
28ceacec45
Split/Unsplit working
2017-11-27 15:13:29 +00:00
paboyle
4987edbd44
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-11-27 12:34:56 +00:00
paboyle
1f04e56038
Believe split/unsplit works, but need to make pretty
2017-11-27 12:33:08 +00:00
paboyle
4bfc8c85c3
Clean up verbose communicator create
2017-11-27 12:32:37 +00:00
azusayamaguchi
e55397bc13
Staggerd cg
2017-11-24 14:18:30 +00:00
paboyle
94b8fb5686
Debug in progress
2017-11-19 01:39:04 +00:00
fionnoh
1d7ccc6b2c
Declaring virtual functions as pure virtual functions.
2017-11-09 19:46:57 +00:00
Azusa Yamaguchi
1860b1698c
Fixed the bag on MPI_T at Cam
2017-11-08 09:03:01 +00:00
Azusa Yamaguchi
9b8d1cc3da
Staggered Schur decomposed matrix norm changed to not be the Schur anymore :(
...
Carleton wanted this for multimass / multishift
2017-11-07 14:48:45 +00:00
Guido Cossu
149c3f9e9c
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-11-07 14:01:13 +00:00
Guido Cossu
c519aab19d
Fixing the MPI memory leak in the communicators
2017-11-07 13:55:37 +00:00
paboyle
69929f20bb
Destructor fix. Split Grid and MPI3 will not yet work without more effort from me.
2017-11-06 23:45:00 +00:00
paboyle
360efd0088
Improved treatment of reverse asked for by chris.
...
Truncate the basis.
Power method renormalises
2017-11-02 22:05:31 +00:00
a4e5fd1000
Merge branch 'feature/hadrons' into feature/hadrons-new-memory-model
2017-11-01 19:24:51 +00:00
682e7d7839
Merge branch 'develop' into feature/hadrons
2017-11-01 19:24:38 +00:00
paboyle
27ea2afe86
No compile on comms == none fix
2017-10-30 01:14:11 +00:00
paboyle
a7f72eb994
SHaking out
2017-10-30 00:22:06 +00:00
paboyle
501fa1614a
Communicator updates for split grid
2017-10-30 00:16:12 +00:00
paboyle
fe4d9b003c
More digits
2017-10-30 00:04:47 +00:00
paboyle
4a699b4da3
New rank can be found out
2017-10-30 00:04:14 +00:00
paboyle
689323f4ee
Reverse dim ordering lexico support
2017-10-30 00:03:15 +00:00
paboyle
84b441800f
Merge branch 'develop' into feature/lanczos-reorg
2017-10-27 14:21:38 +01:00
paboyle
1ef424b139
Split grid Y2K bug fix attempt
2017-10-27 14:20:35 +01:00
paboyle
f96c800d25
Passes reload of coarse basis
2017-10-27 09:43:22 +01:00
paboyle
32a52d7583
Move the local coherence lanczos into algorithms.
...
Keep the I/O in the tester. Other people can copy this method to write other I/O formats.
2017-10-27 09:04:31 +01:00
paboyle
7fab183c0e
Better read test
2017-10-27 08:17:49 +01:00
paboyle
9ec9850bdb
64bit ftello update
2017-10-26 23:34:31 +01:00
paboyle
0c4ddaea0b
Cleaning up
2017-10-26 23:31:46 +01:00
paboyle
00ebc150ad
Mistake in string parse; interface is ambiguous and must fix. Is char * a file, or a XML buffer ?
2017-10-26 23:30:37 +01:00
paboyle
0f3e9ae57d
Gsites error. Only appeared (so far) in I/O code for even odd fields
2017-10-26 23:29:59 +01:00
Azusa Yamaguchi
034de160bf
Staggered updates : Schur fixed and added a unit test for Test_staggered_cg_schur.cc giving stronger check
2017-10-26 20:58:46 +01:00
paboyle
31f99574fa
Moving these out of algorithms
2017-10-26 07:47:42 +01:00
paboyle
a34c8a2961
Update to IRL; getting close to the structure I would like.
2017-10-26 07:45:56 +01:00
paboyle
e9be293444
Better messaging
2017-10-26 01:59:30 +01:00
paboyle
08583afaff
Red black friendly coarsening
2017-10-25 23:51:18 +01:00
paboyle
b395a312af
Better error messaging
2017-10-25 23:50:37 +01:00
paboyle
66295b99aa
Bit less verbose SciDAC IO
2017-10-25 23:50:05 +01:00
paboyle
b8654be0ef
64 bit safe offsets
2017-10-25 23:49:23 +01:00
paboyle
f6c3f6bf2d
XML serialisation of parms and initialise from parms object
2017-10-25 23:47:59 +01:00
paboyle
d83868fdbb
Identity linear op added -- useful in circumstances where a linear op may or may not be needed.
...
Supply a trivial one if not needed
2017-10-25 23:47:10 +01:00
paboyle
303e0b927d
Improvements for coarse grid compressed lanczos
2017-10-25 23:46:33 +01:00
paboyle
28ba8a0f48
Force spacing more nicely
2017-10-25 23:45:57 +01:00
Guido Cossu
8a3aae98f6
Solving minor bug in compilation
2017-10-25 10:34:49 +01:00
Guido Cossu
8309f2364b
Solving again the MPI comm bug with FFTs
2017-10-25 10:24:14 +01:00
Guido Cossu
27936900e6
Putting the FG verbosity in the Integrator level
2017-10-18 13:08:09 +01:00
paboyle
e325929851
ALl codes compile against the new Lanczos call signature
2017-10-13 14:02:43 +01:00
paboyle
47af3565f4
Logging improvement; reunified the Lanczos codes
2017-10-13 13:23:07 +01:00
paboyle
4b4d187935
Reunified the Lanczos implementations
2017-10-13 13:22:44 +01:00
paboyle
9aff354ab5
Final version prior to reunification
2017-10-13 13:22:26 +01:00
paboyle
cb9ff20249
Approx tests and lanczos improvement
2017-10-13 11:30:50 +01:00
paboyle
9fe6ac71ea
Starting reorg of Blocked lanczos
2017-10-11 10:12:07 +01:00
paboyle
bf58557fb1
Block compressed Lanczos
2017-10-10 14:15:11 +01:00
paboyle
a1d80282ec
cb factorise
2017-10-10 13:49:31 +01:00
paboyle
4eb8bbbebe
Christop mods
2017-10-10 13:48:51 +01:00
paboyle
d1c6288c5f
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-10-10 13:38:40 +01:00
Azusa Yamaguchi
bb7378cfc3
Schur for staggered
2017-10-10 12:02:18 +01:00
Azusa Yamaguchi
f0e084a88c
Schur staggered
2017-10-10 10:00:43 +01:00
paboyle
08ca338875
Split grid communication
2017-10-09 23:19:45 +01:00
paboyle
f7cbf82c04
Better stdout/err debug
2017-10-09 23:18:48 +01:00
paboyle
07009c569a
Comms splitting improvements
2017-10-09 23:16:51 +01:00
63b2bc1936
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/qcd/action/fermion/FermionOperatorImpl.h
2017-10-05 14:16:23 +01:00
paboyle
d54807b8c0
MPIT works with split grid now
2017-10-02 23:14:56 +01:00
paboyle
5625b47c7d
Merge branch 'feature/dwf-multirhs' into develop
2017-10-02 12:42:32 +01:00
paboyle
1edcf902b7
Macos ANON
2017-10-02 12:41:02 +01:00
paboyle
4f8b6f26b4
Merge branch 'develop' into feature/dwf-multirhs
2017-10-02 11:41:49 +01:00
Guido Cossu
f7072d1ac2
Solving an annoying compilation error in json
2017-10-02 07:13:40 +01:00
df21668f2c
memory profiler update
2017-09-22 14:21:18 +01:00
paboyle
fddeb29d6b
Bug fix with spreadout FFT
2017-09-21 11:10:08 +01:00
paboyle
a9ec5cf564
Christoph bug report integrate
2017-09-21 10:32:41 +01:00
Peter Boyle
946a8671b9
Merge pull request #129 from djm2131/feature/eofa
...
Add support for DWF with the exact one flavor algorithm
2017-09-21 10:15:21 +01:00
Peter Boyle
771a1b8e79
Merge pull request #128 from paboyle/feature/CG-reliable-update
...
Feature/cg reliable update
2017-09-21 10:12:03 +01:00
Peter Boyle
bfb68e6f02
Merge pull request #130 from giltirn/gparity-handunroll
...
Gparity handunroll
2017-09-21 10:11:00 +01:00
paboyle
5918769f97
Subtle Naik term bug updated in Stencil; less on logical && with a function call on right
2017-09-16 12:51:26 +01:00
Guido Cossu
1950ac9294
Fixed the Intel compiler problem with the JSON classes
2017-09-08 15:18:59 +01:00
Guido Cossu
13fa70ac1a
Merge branch 'develop' into feature/json-fix
2017-09-08 13:42:20 +01:00
Guido Cossu
7cb2b11f26
Fixing Intel compiler error for the JSON parser
2017-09-08 13:41:53 +01:00
Guido Cossu
1184ed29ae
Merge pull request #124 from nmeyer-ur/feature/arm-neon
...
Added integer reduce functionality
2017-09-08 10:54:35 +02:00
paboyle
203c7bf6fa
Merge branch 'hotfix/dirac-ITT-fix' into develop
2017-09-05 15:08:51 +01:00
paboyle
aed5de4d50
Patching macos compile
2017-09-05 15:07:07 +01:00
paboyle
ba27cc6571
Mac os happiness
2017-09-05 15:00:16 +01:00
paboyle
d856327250
Merge branch 'release/dirac-ITT' into develop
2017-09-05 14:56:12 +01:00
Peter Boyle
bf973d0d56
SHM complete
2017-09-05 14:30:29 +01:00
Peter Boyle
837bf8a5be
Updating to control the SHM allocation scheme under configure time options
2017-09-05 12:51:02 +01:00
Peter Boyle
c05b2199f6
Improvements to huge memory
2017-09-04 10:41:21 -04:00
Azusa Yamaguchi
a5fe07c077
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-09-04 14:10:15 +01:00
Azusa Yamaguchi
b83b2b1415
Stability improvement to BCG. Force m_rr hermitian beyond rounding.
2017-09-04 14:09:47 +01:00
paboyle
7359df3501
Full reporting for benchmark; save robustness factor
2017-08-31 10:42:35 +01:00
Christopher Kelly
59bd1fe21b
Fix for 'perm' and 'local' not being set for hand-unrolled external-site Dslash, which caused incorrect behavior of G-parity kernel
2017-08-29 13:07:37 -07:00
a56e3b40c4
Merge branch 'develop' into feature/hadrons
2017-08-29 11:03:53 -06:00
Nils Meyer
4e907fef2c
Merge remote-tracking branch 'grid/develop' into feature/arm-neon
2017-08-29 17:47:36 +02:00
Christopher Kelly
74af885d4e
Removed some no-longer-needed associated with G-parity hand unrolled kernel
2017-08-29 09:50:37 -04:00
paboyle
4b4c2a715b
fcntl.h needed
2017-08-26 11:38:04 +01:00
paboyle
54a5e6c1d0
Check if we get huge pages on linux. Larry Meadows piece of magic.
2017-08-25 22:36:08 +01:00
paboyle
80c5bce5bb
Merge branch 'develop' into feature/multi-communicator
2017-08-25 20:21:26 +01:00
paboyle
f68b5de9c8
No compile fix on Clang
2017-08-25 19:35:21 +01:00
Christopher Kelly
f365a83fae
In G-parity unrolled kernel, replaced calls to permute and exchange with run-time-evaluated permute type with explicit calls to appropriate underlying functions
2017-08-25 14:24:11 -04:00
Peter Boyle
c289699d9a
updated from cambridge mpi3 shakeout
2017-08-25 11:41:01 +01:00
Peter Boyle
c3b1263e75
Benchmark prep
2017-08-25 09:25:54 +01:00
Christopher Kelly
34a9aeb331
Reduced number of if-statement evaluations in G-parity unrolled kernel
2017-08-24 13:53:50 -07:00
21b02760c3
Merge branch 'develop' into feature/hadrons
2017-08-24 17:05:45 +01:00
paboyle
5fa386ddc9
FFT test compile fixed
2017-08-24 10:17:52 +01:00
Christopher Kelly
ce5df177ee
Removed superfluous implementation of G-parity twist for hand-unrolled kernel from GparityWilsonImpl
2017-08-23 15:05:22 -04:00
Christopher Kelly
a0bb8e5b46
Added hand-unrolled kernel implementations of all the other dslash precision / comms precision combinations with G-parity
2017-08-23 14:44:40 -04:00
Christopher Kelly
46f88e6d72
G-parity hand-unrolled intrinsics twist now uses one less permute and one less temporary
2017-08-23 13:21:10 -04:00
David Murphy
dd8f1ea189
Vectorized Mobius EOFA Dperp + shift operation
2017-08-23 13:17:26 -04:00
Christopher Kelly
b61835c1a5
Added inplace version of intrinsic G-parity twist to hand-unrolled kernel
2017-08-23 12:33:48 -04:00
Azusa Yamaguchi
d9cd4f0273
Staggered multinode block cg debugged. Missing global sum.
...
Code stalls and resumes on KNL at cambridge. Curious.
CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
David Murphy
459f70e8d4
Check-in of working Mobius EOFA class and tests
2017-08-22 22:38:30 -04:00
Christopher Kelly
061e48fd73
Replaced slow unpack-repack in G-parity BC twist with intrinsics version
2017-08-22 18:12:12 -04:00
Christopher Kelly
ab50145001
Implemented first, unoptimized version of hand-unrolled G-parity kernels
...
Improved Test_gparity
2017-08-22 17:12:25 -04:00
paboyle
b49bec0cec
MAP_HUGETLB portability fix
2017-08-20 03:08:54 +01:00
paboyle
1cdf999668
Moving multicommunicator into mpi3 also for threading
2017-08-20 02:39:10 +01:00
paboyle
11062fb686
Comms none fail fix
2017-08-20 01:37:07 +01:00
paboyle
a446d95c33
Trying to pass TeamCity and Travis
2017-08-20 01:10:50 +01:00
paboyle
be66e7dd95
Merge branch 'develop' into feature/multi-communicator
2017-08-19 23:12:38 +01:00
Peter Boyle
0b0cf62193
Fix mpi 3 interface change
2017-08-19 13:18:50 -04:00
Peter Boyle
7d88198387
Merge branch 'develop' into feature/multi-communicator
2017-08-19 13:03:35 -04:00
Peter Boyle
2f619482b8
Enable blocking stencil send
2017-08-19 12:53:59 -04:00
Peter Boyle
d6472eda8d
Use mmap
2017-08-19 12:53:18 -04:00
Peter Boyle
bcefdd7c4e
Align both allocator calls to 2MB
2017-08-19 12:49:02 -04:00
David Murphy
9d45fca8bc
Implement MobiusEOFAFermioncache.cc
2017-08-17 23:45:36 -04:00
David Murphy
ac9e6b63c0
More re-import of Mobius EOFA
2017-08-17 19:28:53 -04:00
David Murphy
e140b3f802
Beginning to re-import Mobius EOFA
2017-08-16 23:36:23 -04:00
David Murphy
d9d3d30cc7
Minor clean-up
2017-08-16 20:57:51 -04:00
David Murphy
47a12ec7b5
Implement EOFA pseudofermion force and Shamir tests for G-parity and non G-parity cases
2017-08-16 19:50:08 -04:00
David Murphy
ec1e2f7a40
Add (mostly implemented) ExactOneFlavourRatio pseudofermion class and tests of Shamir heatbath and action
2017-08-16 12:38:59 -04:00
David Murphy
41f73ec083
Add ChronoForecast class for forecasting solutions across poles in the EOFA heatbath
2017-08-16 12:37:38 -04:00
Guido Cossu
fd367d8bfd
Debugging the PointerCache
2017-08-16 09:42:57 +01:00
David Murphy
6d0786ff9d
Typo fixes and check-in of G-parity action test for DWF
2017-08-15 22:47:00 -04:00
David Murphy
b7f93aeb4d
Change CayleyFermion5D::SetCoefficientsInternal to virtual to allow overriding in derived EOFA classes
2017-08-15 14:18:51 -04:00
David Murphy
202a7fe900
Re-import DWF and abstract base EOFA fermion classes and tests
2017-08-15 13:36:08 -04:00
Guido Cossu
8a3fe60a27
Added more asserts at grid creation time
2017-08-08 11:36:20 +01:00
Guido Cossu
44051aecd1
Checking for integer divisions in cartesian full
2017-08-08 10:31:12 +01:00