|
4c75095c61
|
HDF5: header fix
|
2017-01-20 12:14:01 -08:00 |
|
|
afa095d33d
|
HDF5: better complex number support
|
2017-01-20 12:10:41 -08:00 |
|
|
6b5259cc10
|
HDF5 detects if a name is a dataset or not without using exception catching
|
2017-01-20 11:03:19 -08:00 |
|
|
7423a352c5
|
HDF5: typos
|
2017-01-19 18:33:04 -08:00 |
|
|
81e66d6631
|
HDF5: revert back to native types
|
2017-01-19 18:24:53 -08:00 |
|
|
ade1058e5f
|
Hdf5Type does not need to be a pointer anymore
|
2017-01-19 18:23:55 -08:00 |
|
|
6eea9e4da7
|
HDF5 types static initialisation is mysteriously buggy on BG/Q, changing strategy
|
2017-01-19 18:02:53 -08:00 |
|
|
2c673666da
|
Standardisation of HDF5 types
|
2017-01-19 17:19:12 -08:00 |
|
|
5405526424
|
Code typo
|
2017-01-18 22:42:19 -08:00 |
|
|
654e0b0fd0
|
Serialisable object are now comparable with ==
|
2017-01-18 17:40:32 -08:00 |
|
|
4be08ebccc
|
debug code cleaning
|
2017-01-18 17:39:59 -08:00 |
|
|
f599cb5b17
|
HDF5 serial IO implemented and tested
|
2017-01-18 16:50:21 -08:00 |
|
|
5803933aea
|
First implementation of HDF5 serial IO writer, reader is still empty
|
2017-01-17 16:21:18 -08:00 |
|
|
91a3534054
|
Lattice slice utilities now thread safe
|
2017-01-16 06:32:25 +00:00 |
|
Peter Boyle
|
c3b6d573b9
|
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
|
2016-12-30 22:42:17 +00:00 |
|
Peter Boyle
|
1e179c903d
|
Worried about integer; suspect where statements are broken
|
2016-12-27 17:46:38 +00:00 |
|
Peter Boyle
|
669cfca9b7
|
No inline
|
2016-12-27 17:45:40 +00:00 |
|
Peter Boyle
|
ff2f559a57
|
Remove inline on gather optimised path
|
2016-12-27 17:45:19 +00:00 |
|
Peter Boyle
|
03c81bd902
|
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
|
2016-12-27 11:25:35 +00:00 |
|
Peter Boyle
|
a869addef1
|
Stats switch off
|
2016-12-27 11:25:22 +00:00 |
|
Peter Boyle
|
1caa3fbc2d
|
LOCK UNLOCK only
|
2016-12-27 11:24:45 +00:00 |
|
Peter Boyle
|
3d21297bbb
|
Call the fast path compressor for wilson kernels to avoid if else on projector
|
2016-12-27 11:23:13 +00:00 |
|
Peter Boyle
|
25efefc5b4
|
Back to original thread policy post test
|
2016-12-23 09:49:04 +00:00 |
|
Peter Boyle
|
eabf316ed9
|
BGQ performance ASM
|
2016-12-22 21:56:08 +00:00 |
|
Peter Boyle
|
04ae7929a3
|
BGQ or KNL assembler now
|
2016-12-22 17:53:22 +00:00 |
|
Peter Boyle
|
caba0d42a5
|
L1p controls
|
2016-12-22 17:52:55 +00:00 |
|
Peter Boyle
|
9ae81c06d2
|
L1p controls for BG/Q
|
2016-12-22 17:52:21 +00:00 |
|
Peter Boyle
|
7dc36628a1
|
QPX finishing
|
2016-12-22 17:50:48 +00:00 |
|
Peter Boyle
|
b8cdb3e90a
|
Debug hack; raises from 62GF/s to 72 GF/s per node on BG/Q
|
2016-12-22 17:50:14 +00:00 |
|
Peter Boyle
|
5241245534
|
Default to static scheduling
|
2016-12-22 17:49:21 +00:00 |
|
Dr Peter Boyle
|
960316e207
|
type conversion in printf
|
2016-12-22 17:27:01 +00:00 |
|
|
f8d11ff673
|
better serialisable enums (can be encapsulated into classes)
|
2016-12-20 12:31:49 +01:00 |
|
paboyle
|
3f2d53a994
|
BGQ assembler beginning
|
2016-12-20 10:21:26 +00:00 |
|
paboyle
|
a59f5374d7
|
Evade warning
|
2016-12-18 02:23:55 +00:00 |
|
paboyle
|
4b220972ac
|
Warning fix
|
2016-12-18 02:14:17 +00:00 |
|
paboyle
|
629f43e36c
|
Return statement needed
|
2016-12-18 02:09:37 +00:00 |
|
paboyle
|
a3172b3455
|
Precision error
|
2016-12-18 02:07:45 +00:00 |
|
paboyle
|
3e6945cd65
|
Fixing AVX Z-mobius
|
2016-12-18 02:05:11 +00:00 |
|
paboyle
|
87be03006a
|
AVX 512 code broke other compiles; fixing
|
2016-12-18 01:45:09 +00:00 |
|
paboyle
|
f17436fec2
|
Bad commit fixed
|
2016-12-18 01:27:34 +00:00 |
|
Peter Boyle
|
4d8b01b7ed
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2016-12-18 00:56:57 +00:00 |
|
Peter Boyle
|
fa6acccf55
|
Zmobius asm
|
2016-12-18 00:56:19 +00:00 |
|
azusayamaguchi
|
df9108154d
|
Debugged 2 versions of assembler; ls vectorised, xyzt vectorised
|
2016-12-17 23:47:51 +00:00 |
|
azusayamaguchi
|
b3e7f600da
|
Partial implementation of 4d vectorisation assembler
|
2016-12-16 23:50:30 +00:00 |
|
azusayamaguchi
|
d4071daf2a
|
Template specialise
|
2016-12-16 22:28:29 +00:00 |
|
azusayamaguchi
|
a2a6329094
|
AVX512 only for ASM compilation
|
2016-12-16 22:03:29 +00:00 |
|
azusayamaguchi
|
eabc577940
|
Assembler possibly working
|
2016-12-16 16:55:36 +00:00 |
|
|
91e98b1dd5
|
Merge branch 'feature/hadrons' into develop
|
2016-12-15 18:15:56 +00:00 |
|
|
b791c274b0
|
Revert "AVX: uninitialised variable fix"
This reverts commit c22c3db9ad .
|
2016-12-15 18:15:35 +00:00 |
|
|
c22c3db9ad
|
AVX: uninitialised variable fix
|
2016-12-13 19:05:58 +00:00 |
|
Azusa Yamaguchi
|
426197e446
|
Nc=3
|
2016-12-12 09:10:54 +00:00 |
|
Azusa Yamaguchi
|
99e2c1e666
|
Kernels options
|
2016-12-12 09:08:53 +00:00 |
|
Azusa Yamaguchi
|
1440565a10
|
Decrease verbosity
|
2016-12-12 09:08:04 +00:00 |
|
Azusa Yamaguchi
|
e9f0c0ea39
|
Staggered kernels options
|
2016-12-12 09:07:38 +00:00 |
|
Peter Boyle
|
fe187e9ed3
|
Compiles and passes under ZMobius with assembler
|
2016-12-10 00:47:48 +00:00 |
|
Peter Boyle
|
0091b50f49
|
Zmobius working -- not asm yet
|
2016-12-09 22:51:32 +00:00 |
|
Peter Boyle
|
fb8d4b2357
|
Lots of debug on performance Mobius
|
2016-12-08 17:28:28 +00:00 |
|
Peter Boyle
|
83fa038bdf
|
Streaming stores
|
2016-12-08 16:58:42 +00:00 |
|
Peter Boyle
|
7a61feb6d3
|
Allocator added with caching for Linux VM subsystem optimisation
|
2016-12-08 16:58:01 +00:00 |
|
Peter Boyle
|
69ae817d1c
|
Updates for supporting Mobius better
|
2016-12-08 16:43:28 +00:00 |
|
|
51322da6f8
|
Hadrons: genetic scheduler improvement
|
2016-12-07 09:00:45 +09:00 |
|
|
c56707e003
|
useless debug message removed
|
2016-12-07 08:59:20 +09:00 |
|
Peter Boyle
|
e27c6b217c
|
Updating
|
2016-12-01 12:42:53 +00:00 |
|
|
9ad3d3453e
|
Hadrons is now a library, the previous XML driven program is now a test
|
2016-12-01 21:36:29 +09:00 |
|
paboyle
|
6adf35da54
|
Faster Mobius
|
2016-12-01 11:39:04 +00:00 |
|
paboyle
|
bd0430b34f
|
Serialisation in malloc fixed
|
2016-11-29 22:27:55 +00:00 |
|
Azusa Yamaguchi
|
c097fd041a
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
|
2016-11-29 13:44:17 +00:00 |
|
Azusa Yamaguchi
|
77fb25fb29
|
Push 5d tests
|
2016-11-29 13:43:56 +00:00 |
|
Azusa Yamaguchi
|
389e0a77bd
|
Staggerd Fermion 5D
|
2016-11-29 13:13:56 +00:00 |
|
paboyle
|
4704f2d009
|
Actions updated
|
2016-11-29 00:14:36 +00:00 |
|
Guido Cossu
|
ae9688e343
|
Reporting also the total mflops
|
2016-11-28 11:37:02 +00:00 |
|
|
43928846f2
|
first steps to make Hadrons a library
|
2016-11-28 16:02:15 +09:00 |
|
|
fabcd4179d
|
Hadrons: propagator type coming from the fermion implementation
|
2016-11-28 14:02:10 +09:00 |
|
|
a8843c9af6
|
Code cleaning, the fermion implementation can be sepcified using the macro FIMPL
|
2016-11-27 16:47:22 +09:00 |
|
|
7a1a7a685e
|
Merge branch 'feature/fft-opt' into feature/hadrons
|
2016-11-27 15:32:03 +09:00 |
|
Lanny91
|
b18950f776
|
Added simd real divide test with QPX divide fixes
|
2016-11-25 13:21:33 +00:00 |
|
Lanny91
|
0acbf77bc6
|
Add QPX Div structure
|
2016-11-24 13:24:12 +00:00 |
|
|
5833f247fa
|
more FFt optimisations
|
2016-11-24 09:09:48 +09:00 |
|
Azusa Yamaguchi
|
95f43d27ae
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
|
2016-11-22 13:49:22 +00:00 |
|
Azusa Yamaguchi
|
668ca57702
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
|
2016-11-22 13:49:11 +00:00 |
|
|
a2cffb0304
|
AVXFMA target fixed
|
2016-11-21 17:47:18 +01:00 |
|
|
97cddda49e
|
Merge branch 'feature/gen-simd' into feature/doxygen
# Conflicts:
# Makefile.am
# configure.ac
|
2016-11-19 13:11:13 +01:00 |
|
|
b873504b90
|
fully generic SIMD
|
2016-11-19 01:32:39 +01:00 |
|
|
042ae5b87c
|
generic 256bits SIMD
|
2016-11-15 12:16:15 +00:00 |
|
paboyle
|
604f0ea2f6
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-09 04:13:01 -08:00 |
|
paboyle
|
33dc1f51b5
|
Final sign off commits from Cori-1
|
2016-11-09 04:11:03 -08:00 |
|
|
13a8997789
|
Merge branch 'release/v0.6.0' into feature/hadrons
# Conflicts:
# Makefile.am
|
2016-11-08 20:43:39 +00:00 |
|
|
9576f0903d
|
namespace fix
|
2016-11-08 19:07:47 +00:00 |
|
|
8a5e3a917c
|
Merge branch 'develop' into release/v0.6.0
# Conflicts:
# tests/core/Test_fft_gfix.cc
|
2016-11-08 16:53:42 +00:00 |
|
|
3d2a22a14d
|
include fix for MKL
|
2016-11-08 15:31:47 +00:00 |
|
azusayamaguchi
|
f85b35314d
|
Fix a routine for single node processor coor from rank
|
2016-11-08 11:49:13 +00:00 |
|
azusayamaguchi
|
0cff8754d1
|
Usecs
|
2016-11-08 11:35:41 +00:00 |
|
azusayamaguchi
|
692b44dac1
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-04 22:48:11 +00:00 |
|
azusayamaguchi
|
96ba42a297
|
omm buf
|
2016-11-04 22:47:25 +00:00 |
|
azusayamaguchi
|
f7b60004f3
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-04 16:08:07 +00:00 |
|
|
ad971ca07b
|
fftw3.h is now expected to be an external header
|
2016-11-04 13:12:35 +00:00 |
|
|
f2f16eb972
|
fftw3.h removed, please don't commit this file back
|
2016-11-04 13:11:05 +00:00 |
|
azusayamaguchi
|
b7d55f7dfb
|
Fix a typo in reorg of the --dslash-asm
|
2016-11-04 11:35:08 +00:00 |
|
azusayamaguchi
|
6e548a8ad5
|
Linux compile needed
|
2016-11-04 11:34:16 +00:00 |
|
Azusa Yamaguchi
|
ee686a7d85
|
Compiles now
|
2016-11-03 16:58:23 +00:00 |
|
Azusa Yamaguchi
|
1c5b7a6be5
|
Staggered phases first cut, c1, c2, u0
|
2016-11-03 16:26:56 +00:00 |
|
|
a5dd4a9bab
|
Merge branch 'feature/fft-opt' into develop
|
2016-11-03 14:34:46 +00:00 |
|
|
ec232af851
|
Photon.h references removed
|
2016-11-03 14:34:16 +00:00 |
|
|
17e30281e9
|
Merge branch 'develop' into feature/fft-opt
# Conflicts:
# lib/FFT.h
|
2016-11-03 14:14:03 +00:00 |
|
|
aee44dc694
|
Photon.h removed from develop branch
|
2016-11-03 13:54:15 +00:00 |
|
|
75bbf6a0af
|
Merge branch 'develop' into feature/feynman-rules
|
2016-11-03 13:52:11 +00:00 |
|
paboyle
|
111bfbc6bc
|
notimestamp by default
|
2016-11-03 11:40:26 +00:00 |
|
paboyle
|
f41a230b32
|
Decrease mpi3l verbose
|
2016-11-02 19:54:03 +00:00 |
|
paboyle
|
c067051d5f
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-02 13:59:18 +00:00 |
|
paboyle
|
9e2ec2719b
|
Merge branch 'develop' into feature/mpi3-master-slave
|
2016-11-02 13:02:56 +00:00 |
|
paboyle
|
757a928f9a
|
Improvement to use own SHM_OPEN call to avoid openmpi bug.
|
2016-11-02 12:37:46 +00:00 |
|
Guido Cossu
|
bc248b6948
|
Merge branch 'release/v0.6.0' into feature/KNL_double_prec
Conflicts:
lib/simd/Grid_avx512.h
|
2016-11-02 10:40:49 +00:00 |
|
Guido Cossu
|
ae8561892e
|
Eliminating useless defines
|
2016-11-02 10:21:06 +00:00 |
|
paboyle
|
32375aca65
|
Semaphore sleep/wake up on remote processes.
|
2016-11-02 09:27:20 +00:00 |
|
paboyle
|
bb94ddd0eb
|
Tidy up of mpi3; also some cleaning of the dslash controls.
|
2016-11-02 08:07:09 +00:00 |
|
James Harrison
|
7f0fc0eff5
|
Remove explicit use of double-precision types in photon.h
|
2016-11-01 16:02:35 +00:00 |
|
Azusa Yamaguchi
|
164d3691db
|
Staggered
|
2016-11-01 14:24:22 +00:00 |
|
paboyle
|
791cb050c8
|
Comms improvements
|
2016-11-01 11:35:43 +00:00 |
|
|
d5e95bc350
|
Merge branch 'release/v0.6.0' into feature/feynman-rules
|
2016-10-31 18:36:21 +00:00 |
|
|
7a84906b5f
|
Merge branch 'release/v0.6.0' into feature/fft-opt
|
2016-10-31 18:31:49 +00:00 |
|
|
66d832c733
|
FFTW header fix
|
2016-10-31 16:39:29 +00:00 |
|
|
e74417ca12
|
big build system polish
|
2016-10-31 16:31:27 +00:00 |
|
Guido Cossu
|
e8c3174ae2
|
Small change in the defines
|
2016-10-30 12:23:11 +00:00 |
|
Guido Cossu
|
9b066e94d0
|
Compilation with both single and double precision
|
2016-10-30 12:04:06 +00:00 |
|
James Harrison
|
618abdf302
|
Add missing volume factor in stochastic QED field
|
2016-10-29 11:04:02 +01:00 |
|
Guido Cossu
|
e1042aef77
|
First version of the doube prec for testing purposes
It does not compile single and double version at the same time
|
2016-10-28 17:20:04 +01:00 |
|
paboyle
|
aa6a839c60
|
avx512 build fix; detect clang/gcc intrinsics vs. ICPC
|
2016-10-28 09:13:09 +01:00 |
|
|
b4d2af8c89
|
threaded FFT
|
2016-10-26 19:46:36 +01:00 |
|
|
434af6aeaa
|
Merge branch 'develop' into feature/fft-opt
|
2016-10-26 18:50:38 +01:00 |
|
|
e90f8ac841
|
Merge branch 'develop' into feature/feynman-rules
|
2016-10-26 18:50:21 +01:00 |
|
|
a1705a8d53
|
debug message removed
|
2016-10-26 18:50:07 +01:00 |
|
|
ca21003f01
|
Merge branch 'feature/fft-opt' into feature/feynman-rules
# Conflicts:
# lib/FFT.h
# lib/qcd/action/fermion/WilsonFermion5D.h
# tests/core/Test_fft.cc
|
2016-10-26 18:44:47 +01:00 |
|
|
14ddf2c234
|
more FFT optimisations
|
2016-10-26 17:36:26 +01:00 |
|
Azusa Yamaguchi
|
bca861e112
|
Note:FFT shoud be GridFFT (Not change yet).
Gauge fix with FFt is added (tests/core)
|
2016-10-25 14:21:48 +01:00 |
|
|
33d199a0ad
|
temporary thread safety in FFT
|
2016-10-25 12:56:40 +01:00 |
|
paboyle
|
b820076b91
|
Merge branch 'develop' into feature/mpi3
|
2016-10-25 06:02:33 +01:00 |
|
paboyle
|
09f66100d3
|
MPI 3 compile on non-linux
|
2016-10-25 06:01:12 +01:00 |
|
azusayamaguchi
|
d7d92af09d
|
Travis fail fix attempt
|
2016-10-25 01:45:53 +01:00 |
|
azusayamaguchi
|
460d0753a1
|
Merge branch 'develop' into feature/mpi3
Conflicts:
lib/simd/Grid_avx512.h
|
2016-10-25 01:08:51 +01:00 |
|
azusayamaguchi
|
8f8058f8a5
|
More random bits on parallel seeding
|
2016-10-25 01:05:52 +01:00 |
|
azusayamaguchi
|
d97a27f483
|
Verbose
|
2016-10-25 01:05:31 +01:00 |
|
azusayamaguchi
|
7c3363b91e
|
Compiles all comms targets
|
2016-10-25 00:04:17 +01:00 |
|
azusayamaguchi
|
b94478fa51
|
mpi, mpi3, shmem all compile.
mpi, mpi3 pass single node multi-rank
|
2016-10-24 23:45:31 +01:00 |
|
|
13bf0482e3
|
FFT optimisation
|
2016-10-24 19:25:40 +01:00 |
|
|
a795b5705e
|
memory optimisation
|
2016-10-24 19:25:15 +01:00 |
|
|
392e064513
|
fast local peek-poke
|
2016-10-24 19:24:21 +01:00 |
|
azusayamaguchi
|
b6a65059a2
|
Update to use shared memory to contain the stencil comms buffers
Tested on 2.1.1.1 1.2.1.1 4.1.1.1 1.4.1.1 2.2.1.1 subnode decompositions
|
2016-10-24 17:30:43 +01:00 |
|
azusayamaguchi
|
ea25a4d9ac
|
Works
|
2016-10-23 06:10:05 +01:00 |
|
azusayamaguchi
|
c190221fd3
|
Internal SHM comms in non-simd directions working
Need to fix simd directions
|
2016-10-22 18:14:27 +01:00 |
|
azusayamaguchi
|
0fcd2e7188
|
Simplify the comms structure prior to implementing Shared memory direct bouncs
|
2016-10-21 22:44:10 +01:00 |
|