Guido Cossu
8a3fe60a27
Added more asserts at grid creation time
2017-08-08 11:36:20 +01:00
Guido Cossu
44051aecd1
Checking for integer divisions in cartesian full
2017-08-08 10:31:12 +01:00
Guido Cossu
06e6f8de00
Check that the reduced dim is an integer
2017-08-08 10:22:12 +01:00
Chulwoo Jung
caa6605b43
Still tweaking memory saving routines in Lanczos
2017-08-07 00:01:04 -04:00
Chulwoo Jung
522c9248ae
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/Lanczos
2017-08-06 23:58:21 -04:00
Guido Cossu
4fe182e5a7
Added high level HMC support for overriding default SIMD lane decomposition
2017-08-06 10:46:19 +01:00
Guido Cossu
175f393f9d
Binary IO error checking
2017-08-04 12:14:10 +01:00
Christopher Kelly
7d867a8134
Merge branch 'develop' into feature/CG-reliable-update
2017-08-02 09:48:04 -04:00
Christopher Kelly
9939b267d2
Added switching to fallback linear operator in reliable update CG, and added recalculation of b parameter on update.
2017-07-31 13:39:44 -04:00
Peter Boyle
14d53e1c9e
Threaded MPI calls patches
2017-07-29 13:08:10 -04:00
Chulwoo Jung
191fbf85fc
Added ImplicitlyRestartedLanczosCJ to Algorithms.h
2017-07-28 15:33:59 -04:00
Guido Cossu
8bd869da37
Correcting a bug in the IO routines
2017-07-27 15:12:50 +01:00
Guido Cossu
c0485d799d
Explicit parameter declaration in the WilsonGauge test
2017-07-26 16:26:04 +01:00
Guido Cossu
7abc5613bd
Added smearing to the topological charge observable
2017-07-26 16:21:17 +01:00
Guido Cossu
a4b7dddb67
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-07-26 12:07:38 +01:00
Guido Cossu
5696781862
Debug error in Tensor mult
2017-07-26 12:07:34 +01:00
Christopher Kelly
9b6cde173f
Merge branch 'feature/CG-reliable-update' into ckelly_develop
2017-07-25 11:51:08 -04:00
Christopher Kelly
9f280b82c4
Added mixed-precision CG with reliable updates
2017-07-25 11:30:41 -04:00
Chulwoo Jung
93650f3a61
Adding back (temporarily) dense matrix routines until Lanczos is fininalized
2017-07-24 21:49:25 -04:00
Chulwoo Jung
cab4b4d063
Deleting old include file references
2017-07-24 20:51:31 -04:00
Chulwoo Jung
cf4b30b2dd
re-adding ImplcitlyRestartedLanczos
2017-07-24 20:40:25 -04:00
Chulwoo Jung
c51d0b4078
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/Lanczos
2017-07-24 20:35:29 -04:00
Nils Meyer
7a53dc3715
Added integer reduce functionality
2017-07-24 11:12:59 +02:00
Christopher Kelly
0f214ad427
Moved FourierAcceleratedGaugeFixer into Grid::QCD namespace and removed 'using namespace' directives from header
2017-07-21 11:13:51 -04:00
Guido Cossu
9fa07eecde
Merge branch 'develop' into feature/json-fix
2017-07-12 15:47:22 +01:00
azusayamaguchi
659d7d1a40
For test/solver
...
Fixed
2017-07-12 15:01:48 +01:00
Guido Cossu
f64fb7bd77
Fix gcc error on JSON compilation
2017-07-12 14:55:42 +01:00
Guido Cossu
2a35449b91
Merge branch 'develop' into feature/json-fix
2017-07-12 14:47:00 +01:00
Guido Cossu
184af5bd05
Added support for std::pair in the JSON serialiser
2017-07-12 14:44:53 +01:00
Guido Cossu
097c9637ee
Fixed the JSON parsing error
2017-07-11 14:31:57 +01:00
azusayamaguchi
dc6f078246
fixed the header file for mpi3
2017-07-11 14:15:08 +01:00
Peter Boyle
40e119c61c
NUMA improvements worth preserving from AMD EPYC tests
2017-07-08 22:27:11 -04:00
Guido Cossu
d9593c4b81
Merge branch 'develop' into feature/json-fix
2017-07-07 14:17:50 +01:00
paboyle
75dc7794b9
Working on Cori
2017-07-02 16:47:42 -07:00
paboyle
dee68fc728
IO working multiple nodes again. Strategy of all nodes writing metadata is unsafe.
...
Only one rank should do this. must identify this rank. Means pass communicator to the
Objects.
2017-07-02 23:33:48 +01:00
paboyle
57002924bc
NERSC shakeout of this
2017-07-02 14:58:30 -07:00
Peter Boyle
a0be3f7330
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-06-30 10:53:50 +01:00
Peter Boyle
b5a6e4f1fd
Best option for Xeon cache blocking set
2017-06-30 10:53:22 +01:00
Peter Boyle
7a788db3dc
Guard first touch
2017-06-30 10:49:08 +01:00
Peter Boyle
f20eceb6cd
First touch once per page in a threaded loop
2017-06-30 10:48:27 +01:00
Peter Boyle
38325ebbc6
Interleave code path; not enabled
2017-06-30 10:23:51 +01:00
Peter Boyle
ac1f1838bc
KNL only
2017-06-30 10:15:32 +01:00
Guido Cossu
8859a151cc
Small corrections to the NEON port
2017-06-29 11:30:29 +01:00
Guido Cossu
688a39cfd9
Merge pull request #114 from nmeyer-ur/feature/arm-neon
...
ARM neon intrinsics support
Guido: checked and approved
2017-06-29 09:57:17 +01:00
Nils Meyer
0933aeefd4
corrected Grid_neon.h
2017-06-28 20:22:22 +02:00
07de925127
minor scalar action fixes
2017-06-28 12:45:44 +01:00
Nils Meyer
a9c816a268
moved file to correct folder
2017-06-27 21:39:15 +02:00
Nils Meyer
bf729766dd
removed collision with QPX implementation
2017-06-27 20:32:24 +02:00
0b707b861c
Merge branch 'develop' into feature/scalar-hmc-update
2017-06-27 14:40:05 +01:00
15e87a4607
HDF5 IO fix
2017-06-27 14:39:27 +01:00
7d7220cbd7
scalar: lambda/4! convention
2017-06-27 14:38:45 +01:00
paboyle
54e94360ad
Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit
2017-06-24 23:10:24 +01:00
0af740dc15
minor scalar HMC code improvement
2017-06-24 23:04:05 +01:00
d2e8372df3
SU(N) algebra fix (was not working)
2017-06-24 23:03:39 +01:00
paboyle
869b99ec1e
Threaded calls to multiple communicators
2017-06-24 10:55:54 +01:00
paboyle
349d75e483
Precision fix
2017-06-23 02:57:59 -07:00
Lanny91
56abbdf4c2
AVX512 integer reduce fix (for non-intel compiler)
2017-06-23 11:09:14 +02:00
Lanny91
af71c63f4c
AVX2 fix
2017-06-23 11:03:12 +02:00
paboyle
1feddf4ba6
const fixes
2017-06-22 19:32:41 +01:00
paboyle
e504260f3d
Able to run a test job splitting into multiple MPI subdomains.
2017-06-22 18:53:11 +01:00
Lanny91
0440d4ce66
Merge branch 'develop' of https://github.com/paboyle/Grid into hotfix/bgq
2017-06-22 17:09:42 +02:00
paboyle
5e4bea8f20
Benchmark DWF works
2017-06-22 08:38:54 +01:00
paboyle
6ebf9f15b7
Splitting communicators first cut
2017-06-22 08:14:34 +01:00
paboyle
b9104f3072
Block CG
2017-06-21 21:08:03 +01:00
b22eab8c8b
Merge commit 'a7d56523abee6c9030fdd9303c79954897b1086f' into feature/hadrons
2017-06-21 18:32:48 +01:00
paboyle
e8b95bd35b
Clean up finished. Could shrink Lanczos to around 400 lines at a push
2017-06-21 02:50:09 +01:00
paboyle
7e35286860
Simplified lanczos, added Eigen diagonalisation.
...
Curious if we can deprecate dependencly on BLAS.
Will see when we get 48^3 running on our BG/Q port
2017-06-21 02:26:03 +01:00
paboyle
0486ff8e79
Improved the lancos
2017-06-20 18:46:01 +01:00
1e8a2e1621
various compatibility fixes after merge
2017-06-20 17:24:55 +01:00
7587df831a
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/qcd/action/scalar/ScalarImpl.h
2017-06-20 15:50:39 +01:00
Azusa Yamaguchi
e9cc21900f
Block solver complete for staggered. Now stable on mass 0.003 and
...
gives 8x (!) speed up on Haswell laptop vs. standard CG for 8 RHS solves.
166 iterations vs. 537 iterations so algorithmic gain + 2x in flop rate gain.
Better than a slap in the face with a wet kipper.
2017-06-20 12:37:41 +01:00
Azusa Yamaguchi
0a8faac271
Fix make tests compile
2017-06-19 22:54:18 +01:00
Azusa Yamaguchi
abc4de0fd2
No compile make tests fix
2017-06-19 22:03:03 +01:00
284ee194b1
JSON update
2017-06-19 14:38:15 +01:00
Azusa Yamaguchi
cfe3cd76d1
Block solver improvements
2017-06-19 14:04:21 +01:00
Azusa Yamaguchi
3fa5e3109f
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-06-19 14:01:44 +01:00
paboyle
8b7049f737
Improved detectino of usqcdInfo for plaq/linktr
2017-06-19 08:46:07 +01:00
paboyle
c85024683e
Merge branch 'feature/parallelio' into develop
2017-06-19 01:39:48 +01:00
paboyle
1300b0b04b
Update to enable multiple records per file more consistent with SciDAC.
...
open, close, write records...
2017-06-19 01:01:48 +01:00
paboyle
1d18d95d4f
Class name return
2017-06-18 00:13:03 +01:00
paboyle
ae39ec85a3
ComplexField defined
2017-06-18 00:12:48 +01:00
paboyle
b96daf53a0
Query tensor structures
2017-06-18 00:12:15 +01:00
paboyle
46879e1658
Complex defined in Impl even for gauge.
2017-06-18 00:11:45 +01:00
paboyle
ae4de94798
SciDAC I/O support
2017-06-18 00:11:23 +01:00
paboyle
0ab555b4f5
SciDAC I/O and ILDG improvements
2017-06-18 00:11:02 +01:00
paboyle
8e9be9f84f
Updates for SciDAC IO
2017-06-18 00:10:42 +01:00
paboyle
d572170170
Update for SciDAC
2017-06-18 00:10:20 +01:00
81b18f843a
Merge branch 'feature/scalar_adjointFT' into feature/hadrons
...
# Conflicts:
# lib/qcd/action/scalar/ScalarImpl.h
2017-06-16 17:59:55 +01:00
Lanny91
a833f88c32
Added missing SIMD integer reduction implementation for AVX, AVX-512, SSE4, IMCI
2017-06-16 15:58:47 +01:00
Lanny91
07b2c1b253
Placeholder precision change functions to allow Grid to compile with QPX (warning: no actual functionality)
2017-06-16 15:04:26 +01:00
Lanny91
735cbdb983
QPX Integer reduction (+ integer reduction test)
2017-06-14 10:55:10 +01:00
Lanny91
2ad54c5a02
QPX exchange support
2017-06-14 10:53:39 +01:00
Nils Meyer
3d04dc33c6
ARM neon intrinsics support
2017-06-13 13:26:59 +02:00
paboyle
91199a8ea0
openmpi is not const safe
2017-06-13 12:21:29 +01:00
paboyle
0494feec98
Libz dependency
2017-06-13 12:00:23 +01:00
paboyle
a16b1e134e
gcc 4.9 fix
2017-06-13 10:48:43 +01:00
Chulwoo Jung
2f4cbeb4d5
Minor changes
2017-06-12 18:25:18 -04:00
paboyle
769ad578f5
Odd new error on G++ 49 on travis
2017-06-12 00:41:21 +01:00
paboyle
eaac0044b5
Compile fixes
2017-06-12 00:20:49 +01:00
paboyle
56042f002c
New files
2017-06-11 23:19:20 +01:00
paboyle
3bfd1f13e6
I/O improvements
2017-06-11 23:14:10 +01:00
Azusa Yamaguchi
70ab598c96
Move gfix into utils
2017-06-08 22:22:23 +01:00
Azusa Yamaguchi
1d0ca65e28
Move Gfix into utils
2017-06-08 22:21:50 +01:00
Chulwoo Jung
fb7c4fb815
Recovering lapack interface without array allocation
2017-06-07 00:00:59 -04:00
Chulwoo Jung
00bb71e5af
Checking in before reworking lapack interface
2017-06-06 16:26:41 -04:00
f6aa82b7f2
Merge branch 'develop' into feature/hadrons
2017-06-06 11:46:33 -05:00
Chulwoo Jung
cfed2c1ea0
Broken Lanczos. Going back to an older verion temporarily.
2017-06-06 12:14:45 -04:00
Chulwoo Jung
b1b15f0b70
Further fixes from multidimensional array
2017-06-05 23:13:41 -04:00
0503c028be
Merge branch 'feature/qed-fvol' into feature/hadrons (non-trivial conflicts on scalar Impl)
...
# Conflicts:
# configure.ac
# lib/qcd/action/scalar/Scalar.h
2017-06-05 16:37:47 -05:00
paboyle
092dcd4e04
MPI I/O only if MPI compiled
2017-06-02 22:50:25 +01:00
Guido Cossu
7da4856e8e
Wilson flow with adaptive steps
2017-06-02 16:55:53 +01:00
Guido Cossu
aaf1e33a77
Adding adaptive integration in the WilsonFlow
2017-06-02 16:32:35 +01:00
paboyle
094c3d091a
Improved and RNG's now survive checkpoint
2017-06-02 00:38:58 +01:00
Peter Boyle
1a1f6d55f9
Roll over to MPI IO for parallel IO
2017-06-01 17:37:26 -04:00
Peter Boyle
21421656ab
Big changes improving the code to use MPI IO
2017-06-01 17:36:53 -04:00
Peter Boyle
6f687a67cd
As local vols increase, use 64 bits for safety
2017-06-01 17:36:18 -04:00
paboyle
1e429a0d57
Added MPI version
2017-05-30 23:41:07 +01:00
paboyle
d38a4de36c
Beginning move to MPI IO
2017-05-30 23:40:39 +01:00
paboyle
53a9aeb965
Cosmetic only
2017-05-30 23:39:53 +01:00
paboyle
e30fa9f4b8
RankCount; need to clean up ambigious ProcessCount
2017-05-30 23:39:16 +01:00
paboyle
58e8d0a10d
reverse direction lexico mapping
2017-05-30 23:38:30 +01:00
paboyle
62cf9cf638
Cleaner code
2017-05-30 23:38:02 +01:00
Guido Cossu
7c6cc85df6
Updating WilsonFlow test
2017-05-27 18:03:49 +01:00
Guido Cossu
0de314870d
Faster derivative for WilsonGauge
2017-05-26 14:31:49 +01:00
Guido Cossu
f4e8bf2858
Fixing the topological charge. Wilson Flow tested, ok
2017-05-26 12:45:59 +01:00
Chulwoo Jung
927c7ae3ed
changed allocation for LAPACK temporaries, to avoid crashing with some compilers (reported by Christoph)
2017-05-25 21:43:53 -04:00
Chulwoo Jung
05d04ceff8
Adding SimpleLanczos
2017-05-25 12:30:47 -04:00
Chulwoo Jung
8313367a50
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/Lanczos
2017-05-25 12:30:06 -04:00
paboyle
b8b5934193
Attempts to speed up the parallel IO
2017-05-25 13:32:24 +01:00
Guido Cossu
75856f2945
Compilation fix in the Tensor_exp
2017-05-25 12:44:56 +01:00
Guido Cossu
3c112a7a25
Small correction to the general exp definition
2017-05-25 12:09:00 +01:00
Guido Cossu
ab3596d4d3
Using Cayley-Hamilton form for the exponential of SU(3) matrices
2017-05-25 12:07:47 +01:00
paboyle
a8c10b1933
Use a global-X x Local-Y chunksize for parallel binary I/O.
...
Gives O(32 x 8 x 18*8*8) chunk size on configuration I/O.
At 150KB should be getting close to packet sizes and 4MB filesystem
block sizes that are reasonably (!?) performant. We shall see once I move
this off my laptop and over to BNL and time it.
2017-05-25 11:43:33 +01:00
Chulwoo Jung
5c479ce663
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/Lanczos
2017-05-24 18:58:53 -04:00
Chulwoo Jung
4bf9d65bf8
Checking in memory saving version of Lanczos
2017-05-24 18:57:32 -04:00
Chulwoo Jung
3a056c4dff
Re-adding Bisection for SimpleLanczos
2017-05-22 18:23:03 -04:00
Guido Cossu
15e801af3f
Fixing a compilation error for generic SIMD
2017-05-19 16:39:36 +01:00
Chulwoo Jung
b0ba651654
Turning off the final sort for now
2017-05-19 10:49:09 -04:00
Chulwoo Jung
25d4c175c3
Cleaning up Lanczos
2017-05-18 18:33:47 -04:00
Guido Cossu
a8fb2835ca
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-05-18 14:45:00 +01:00
22f4feee7b
Merge branch 'develop' into feature/scalar_adjointFT
2017-05-17 13:27:13 +02:00
paboyle
3267683e22
Union workaround for g++
2017-05-17 11:26:18 +01:00
Azusa Yamaguchi
f46a67ffb3
No compile issue on clang on mac fixed.
...
Compiler version was clang++-3.9 under mpicxx
2017-05-17 10:51:01 +01:00
Guido Cossu
10f2872aae
Faster exponentiation for lattice fields
2017-05-15 15:51:16 +01:00
35fa3d1dfd
Merge branch 'master' into feature/scalar_adjointFT
2017-05-12 10:41:39 +01:00
paboyle
49a5d9bac7
Clang major, minor trailing underscore
2017-05-11 12:25:02 +01:00
paboyle
8a43e88b4f
Compiler check early in build
2017-05-11 11:43:06 +01:00
paboyle
238df20370
Still working on the compiler compat checks
2017-05-11 11:30:14 +01:00
paboyle
655492a443
Compiler detection
2017-05-11 11:21:11 +01:00
paboyle
1cab06f6bd
Compat checks for compilers
2017-05-11 10:20:24 +01:00