Peter Boyle
5f780806c2
Merge pull request #279 from paboyle/bugfix/nvcc-config
...
configure fix for nvcc with extra arguments as CXX
2020-05-06 10:07:52 -04:00
Peter Boyle
28a1fcaaff
First compile against SYCL
2020-05-05 11:13:27 -07:00
u37294
ff53b231c8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-04 10:25:10 -07:00
u37294
fc19cf905b
Lime optional
2020-05-04 10:24:48 -07:00
Christopher Kelly
dee96cbf82
Added workaround in configure to still catch Cuda compiler when nvcc with extra arguments (eg -ccbin) is used as CXX
2020-04-29 10:37:11 -04:00
nils meyer
345721220e
resolved merge conflict
2020-04-24 10:14:21 +02:00
nils meyer
6db68d6ecb
added SVE configure for armclang and gcc
2020-04-24 10:10:47 +02:00
Michael Marshall
5daf176f4a
Updated to expose GRID_CXXLD in addition to CXXLD.
...
NB: CXXLD required as this is what drives linking behaviour.
2020-04-23 15:25:53 +01:00
nmeyer-ur
09f0963d1f
changes in configure.ac ; to be verified
2020-04-23 11:27:03 +02:00
nils meyer
6f44e3c192
reverted changes in configure.ac ; included SVE configure readme
2020-04-23 11:18:50 +02:00
nils meyer
5893888f87
removed default no-strict-aliasing for gcc-10.0.1 exclusively
2020-04-22 19:29:55 +02:00
4e864e56c9
develop pull
2020-04-10 17:19:18 +01:00
6235c7ba98
IPP path fix in configure
2020-03-27 17:23:29 +00:00
7e13724882
removing Hadrons
2020-03-27 12:03:32 +00:00
Peter Boyle
03da4040e2
Make summit happy
2020-01-06 11:47:48 -05:00
Peter Boyle
848079e8ba
Merge pull request #235 from grid-test-organisation/feature/5d-improvement
...
MooeeInv and M5D optimisations + enable threading with nvcc
2019-12-10 21:45:03 -05:00
Peter Boyle
e4399e3ee1
--enable-summit flag
2019-11-21 15:02:10 -05:00
Peter Boyle
ec8e060ec7
Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun
2019-10-31 11:46:09 -04:00
gfilaci
0c1efa5235
pass OpenMP flag to host compiler
2019-09-03 12:12:25 +01:00
Peter Boyle
08904f830e
Merge develop
2019-07-16 11:59:56 +01:00
Peter Boyle
fa9cd50c5b
Merge branch 'develop' into feature/gpu-port
2019-07-16 11:55:17 +01:00
Peter Boyle
966a203dcb
Interactions with GPU compilation
2019-07-11 03:16:17 +01:00
Peter Boyle
3e41b1055c
Remove Gpu only kernels.
2019-06-09 11:20:01 +01:00
gfilaci
3f1c4d8789
fix comment hash
2019-05-02 10:24:36 +01:00
Peter Boyle
60330e05a3
NVCC wacky compiler options frozen. Possibly Cuda 9.2 specific
2019-04-28 07:39:33 +01:00
Peter Boyle
04255128ef
HMC directory
2019-02-24 15:22:17 +00:00
Peter Boyle
3eae9a9e3f
update NVCC flags
2019-01-01 13:49:15 +00:00
Peter Boyle
b57a4d32aa
Merge branch 'develop' into feature/gpu-port
2018-12-13 05:11:34 +00:00
6364aa8acf
Merge branch 'feature/contractor' into develop
2018-11-22 15:44:46 +00:00
d77bc88170
Optional support for faster CRC32C checksum through Intel IPP
2018-11-19 17:21:53 +00:00
Peter Boyle
2205b1e63e
Add CXX to grid-config
2018-11-07 13:32:46 +00:00
aadd9f4468
Eigenpack converter, to be tested, HadronsXmlRun moved to Utilities directory
2018-10-02 00:02:34 +01:00
2db1a4628c
build system minor fix
2018-08-28 18:26:30 +01:00
fb7d021b9d
Hadrons: moving Hadrons to root directory, build system improvements
2018-08-28 15:00:40 +01:00
133d5c2e34
Merge branch 'develop' into feature/hadrons
2018-08-10 16:36:40 +01:00
2a94244890
configure: --with-openssl option and LIME is now mandatory
2018-08-10 16:36:11 +01:00
584c921ca0
Eigen support fix (use of Grid as a library was broken)
2018-08-03 21:07:58 +01:00
paboyle
e036800261
Eigen fix
2018-07-31 01:08:42 +01:00
paboyle
e3a309a73f
Eigen happiness
2018-07-31 00:35:17 +01:00
Peter Boyle
a0714de8ec
Define vector length for GPU
2018-07-23 06:09:05 -04:00
paboyle
7a4c142955
Add GPU specific simd targets
2018-06-13 19:55:30 +01:00
Peter Boyle
172f412102
shmget reintroduce
2018-05-02 14:07:41 +01:00
Peter Boyle
c45f24a1b5
Improvements for tesseract
2018-04-30 21:50:00 +01:00
Peter Boyle
a8a0bb85cc
Control scalar execution or vector under generic. Disable Eigen vectorisation on powerpc / SUmmit
2018-04-12 12:32:57 -04:00
Peter Boyle
d5ce66f6ab
Extra SHM option
2018-03-16 21:37:03 +00:00
paboyle
5fac7080bc
Adding -fno-strict-aliasing by default
2018-03-08 13:01:24 +00:00
paboyle
03f8da8fbc
enable-debug option for debug flags in compile
2018-03-04 15:51:47 +00:00
paboyle
78a9e31ff0
options more obvious
2018-02-24 22:26:32 +00:00
Guido Cossu
655a69259a
Added support for GCC compilation for Skylake AVX512
2018-01-28 17:02:46 +01:00
paboyle
be7b37b9c9
Mistake on openmp
2018-01-27 00:05:11 +00:00
paboyle
c4f82e072b
_grid becomes private ; use Grid()§
2018-01-27 00:04:12 +00:00
paboyle
ff7b587fad
Ugly... nvcc needs -x cu to compile .cc as cuda.
...
Since CXXFLAGS is Also passed to linker, and -x cu breaks link phase must replace
CXX and CXXLD with nvcc -x cu and nvcc -link respectively.
2018-01-24 13:18:19 +00:00
paboyle
d74c21a386
GLobal edit for QCD namespace removal & NAMESPACE macros
2018-01-15 09:37:58 +00:00
paboyle
7b3ed160aa
Rationalise MPI options
2018-01-08 11:26:48 +00:00
paboyle
9fe6ac71ea
Starting reorg of Blocked lanczos
2017-10-11 10:12:07 +01:00
Peter Boyle
837bf8a5be
Updating to control the SHM allocation scheme under configure time options
2017-09-05 12:51:02 +01:00
Peter Boyle
7d88198387
Merge branch 'develop' into feature/multi-communicator
2017-08-19 13:03:35 -04:00
Guido Cossu
c7036f6717
Adding checks for libm and libstdc++
2017-07-27 11:15:09 +01:00
Guido Cossu
237cfd11ab
Solving the spurious O2 flags
2017-07-26 12:08:51 +01:00
azusayamaguchi
dc6f078246
fixed the header file for mpi3
2017-07-11 14:15:08 +01:00
Peter Boyle
a0be3f7330
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-06-30 10:53:50 +01:00
Peter Boyle
2d3737a133
O3, KNL
2017-06-30 10:15:59 +01:00
Guido Cossu
8859a151cc
Small corrections to the NEON port
2017-06-29 11:30:29 +01:00
Guido Cossu
688a39cfd9
Merge pull request #114 from nmeyer-ur/feature/arm-neon
...
ARM neon intrinsics support
Guido: checked and approved
2017-06-29 09:57:17 +01:00
paboyle
54e94360ad
Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit
2017-06-24 23:10:24 +01:00
Nils Meyer
3d04dc33c6
ARM neon intrinsics support
2017-06-13 13:26:59 +02:00
paboyle
0494feec98
Libz dependency
2017-06-13 12:00:23 +01:00
paboyle
3bfd1f13e6
I/O improvements
2017-06-11 23:14:10 +01:00
paboyle
7a8f6af5f8
Drop verbose compiler predefine check
2017-05-11 12:48:40 +01:00
paboyle
2b3fdd4a58
Print CXX predefines
2017-05-11 12:05:50 +01:00
paboyle
529e78d43f
Restart the v0.7.0 release
2017-05-08 18:20:04 +01:00
paboyle
c1c7566089
GCC bug work around in 5.0 through 6.2 inclusive.
2017-05-06 15:20:25 +01:00
paboyle
2439999ec8
Warning elimination; drop to -O2 on G++ bad versions
2017-05-06 14:44:49 +01:00
paboyle
751f2b9703
Better check and benchmark driving
2017-05-05 19:54:38 +01:00
Guido Cossu
de84aacdfd
Fixing a configure error for the smearing tests
2017-05-05 13:59:10 +01:00
Guido Cossu
20999c1370
Merge branch 'develop' into feature/hmc_generalise
2017-05-05 12:47:17 +01:00
124bf4d829
git ref in config summary
2017-05-02 19:41:01 +01:00
e8e56b3414
Config summary saved in git-config
2017-05-02 19:40:47 +01:00
89c430136d
grid-config program
2017-05-02 19:13:13 +01:00
Guido Cossu
3344788fa1
Merge branch 'develop' into feature/hmc_generalise
2017-05-01 12:13:56 +01:00
paboyle
3844bcf800
If no f16c instructions supported must use software half precision conversion.
...
This will also become useful on BG/Q, so will move out from SSE4 into a general area.
Lifted the Eigen half precision from web. Looks sensible, but not extensively regressed
against the intrinsics implementation yet.
2017-04-20 15:30:52 +01:00
paboyle
d3b9a7fa14
F16c apparently requires AVX, even if the 128 bit are used.
...
Seems odd.
2017-04-13 13:19:11 +01:00
paboyle
4226c633c4
Default to FP16 off again
2017-04-13 12:51:39 +01:00
paboyle
db5ea001a3
Update to use Xcode 8.3 since -mfp16 causes SIGILL
2017-04-13 12:22:40 +01:00
paboyle
1d502e4ed6
FP16 optional compile time
2017-04-13 11:55:24 +01:00
paboyle
73cdf0fffe
Drop f16c from SSE because of a macos compile error on travis
2017-04-13 11:23:41 +01:00
paboyle
9c3065b860
Debug flags off again
2017-04-13 10:01:32 +01:00
paboyle
68392ddb5b
Exchange in generic
...
Precision change in AVX, SSE, AVX512, Generic. QPX still to do.
2017-04-13 08:38:12 +01:00
paboyle
cb6b81ae82
Half precision conversion
2017-04-12 19:32:37 +01:00
Guido Cossu
8c540333d5
Merge branch 'develop' into feature/hmc_generalise
2017-04-05 14:41:04 +01:00
paboyle
d1d63a4f2d
sitmo default
2017-04-02 00:26:05 +09:00
Guido Cossu
120fb59978
Adding tests for WilsonFlow classes
2017-03-21 16:11:35 +09:00
Guido Cossu
b3dede4dd3
Merge branch 'develop' into feature/hmc_generalise
2017-03-10 23:57:37 +09:00
Francesco Sanfilippo
29b60f7e1a
adding --with switch to pass lime path
2017-02-21 23:09:39 +01:00
Francesco Sanfilippo
041884acf0
Prepending PACKAGE_ with GRID_ in Config.h
...
Avoid polluting linking progr
2017-02-21 22:51:36 +01:00
Guido Cossu
e0571c872b
Merge branch 'develop' into feature/hmc_generalise
2017-02-09 16:12:00 +00:00
Guido Cossu
677757cfeb
Added and tested SITMO PRNG
2017-01-25 12:47:22 +00:00
Guido Cossu
17629b8d9e
Merge branch 'develop' into feature/hmc_generalise
2017-01-25 11:33:53 +00:00
5803933aea
First implementation of HDF5 serial IO writer, reader is still empty
2017-01-17 16:21:18 -08:00
91e98b1dd5
Merge branch 'feature/hadrons' into develop
2016-12-15 18:15:56 +00:00