paboyle
3844bcf800
If no f16c instructions supported must use software half precision conversion.
...
This will also become useful on BG/Q, so will move out from SSE4 into a general area.
Lifted the Eigen half precision from web. Looks sensible, but not extensively regressed
against the intrinsics implementation yet.
2017-04-20 15:30:52 +01:00
paboyle
d3b9a7fa14
F16c apparently requires AVX, even if the 128 bit are used.
...
Seems odd.
2017-04-13 13:19:11 +01:00
paboyle
4226c633c4
Default to FP16 off again
2017-04-13 12:51:39 +01:00
paboyle
db5ea001a3
Update to use Xcode 8.3 since -mfp16 causes SIGILL
2017-04-13 12:22:40 +01:00
paboyle
1d502e4ed6
FP16 optional compile time
2017-04-13 11:55:24 +01:00
paboyle
73cdf0fffe
Drop f16c from SSE because of a macos compile error on travis
2017-04-13 11:23:41 +01:00
paboyle
9c3065b860
Debug flags off again
2017-04-13 10:01:32 +01:00
paboyle
68392ddb5b
Exchange in generic
...
Precision change in AVX, SSE, AVX512, Generic. QPX still to do.
2017-04-13 08:38:12 +01:00
paboyle
cb6b81ae82
Half precision conversion
2017-04-12 19:32:37 +01:00
paboyle
d1d63a4f2d
sitmo default
2017-04-02 00:26:05 +09:00
Francesco Sanfilippo
041884acf0
Prepending PACKAGE_ with GRID_ in Config.h
...
Avoid polluting linking progr
2017-02-21 22:51:36 +01:00
Guido Cossu
677757cfeb
Added and tested SITMO PRNG
2017-01-25 12:47:22 +00:00
5803933aea
First implementation of HDF5 serial IO writer, reader is still empty
2017-01-17 16:21:18 -08:00
91e98b1dd5
Merge branch 'feature/hadrons' into develop
2016-12-15 18:15:56 +00:00
7a1ac45679
Hadrons: configure.ac Linux typo
2016-12-05 14:00:10 +09:00
9ad3d3453e
Hadrons is now a library, the previous XML driven program is now a test
2016-12-01 21:36:29 +09:00
7a1a7a685e
Merge branch 'feature/fft-opt' into feature/hadrons
2016-11-27 15:32:03 +09:00
Guido Cossu
1e44fd3094
Added some details on the mpi flags for Cray machines
2016-11-26 18:30:53 +00:00
a2cffb0304
AVXFMA target fixed
2016-11-21 17:47:18 +01:00
bafbac6ac4
Merge branch 'feature/gen-simd' into develop
2016-11-19 13:45:30 +01:00
595f1ce371
GEN SIMD build fix
2016-11-19 13:45:12 +01:00
97cddda49e
Merge branch 'feature/gen-simd' into feature/doxygen
...
# Conflicts:
# Makefile.am
# configure.ac
2016-11-19 13:11:13 +01:00
b873504b90
fully generic SIMD
2016-11-19 01:32:39 +01:00
042ae5b87c
generic 256bits SIMD
2016-11-15 12:16:15 +00:00
13a8997789
Merge branch 'release/v0.6.0' into feature/hadrons
...
# Conflicts:
# Makefile.am
2016-11-08 20:43:39 +00:00
7df940dc3e
homemade test recusrive target for old autotools versions
2016-11-04 22:32:25 +00:00
8af8b047fd
tests is now a recusrsive target
2016-11-04 13:44:21 +00:00
92cd797636
MPI auto configure fix
2016-11-03 13:48:07 +00:00
paboyle
9e2ec2719b
Merge branch 'develop' into feature/mpi3-master-slave
2016-11-02 13:02:56 +00:00
paboyle
791cb050c8
Comms improvements
2016-11-01 11:35:43 +00:00
e74417ca12
big build system polish
2016-10-31 16:31:27 +00:00
paboyle
39f1c880b8
mpi3
2016-10-20 16:56:40 +01:00
azusayamaguchi
81f2aeaece
KNL streaming stores, and KNL performance coutners
2016-10-12 11:45:22 +01:00
Guido Cossu
b56c9ffa52
Fix for AVXFMA
2016-10-10 14:43:37 +01:00
cb02b7088f
Merge branch 'develop' into feature/doxygen
...
# Conflicts:
# configure.ac
2016-10-09 13:35:44 +01:00
77c8a94dae
AVXFMA4 flag fix for Intel Compiler
2016-10-09 12:55:12 +01:00
98439847cf
configure portability fix
2016-10-05 14:57:20 +01:00
7ea4b959a4
hopefully more portable configure output
2016-09-27 11:54:37 +01:00
Guido Cossu
15d8f5c88c
Small change to the configure.ac to include the canonical names
2016-09-23 11:05:36 +01:00
a034e9901b
Merge branch 'develop' into feature/hadrons
2016-09-20 13:49:33 +01:00
d2573189d8
build system: FFTW fix
2016-09-20 12:30:24 +01:00
2e74520821
removed libtool use (BG/Q compatibility)
2016-09-16 15:25:49 +01:00
Antonin Portelli
6dd75ad9e5
Merge branch 'develop' of github.com:paboyle/Grid into feature/bgq
2016-09-16 15:07:54 +01:00
paboyle
ff6da364e8
FFT double and single precision gives good performance now in multithreaded code.
2016-08-24 15:05:00 +01:00
4d11a6f5f2
first commit for QPX intrinsics
2016-08-23 14:41:44 +01:00
paboyle
88be3b39bb
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-08-22 18:29:36 +01:00
paboyle
29c4ef41de
Adding a test for libfftw3
2016-08-22 16:21:01 +01:00
paboyle
90e70790f3
Feature for z-Mobius prep
2016-08-15 22:31:29 +01:00
573b8c6020
build system: -O3 is not overriden by env CXXFLAGS
2016-08-06 01:26:24 +01:00
7b56f63a5c
configure Doxygen output fix
2016-08-05 15:35:29 +01:00