1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-11-15 02:05:37 +00:00
Commit Graph

6459 Commits

Author SHA1 Message Date
u37294
04927d2e40 SYCL prep - no sycl just make it compile through DPC++ 2020-05-04 10:28:29 -07:00
u37294
7caed4edd9 dpc++ didn't like rdtsc() 2020-05-04 10:27:05 -07:00
u37294
59c51d2c35 Make compile if HAVE_LIME=0 2020-05-04 10:26:20 -07:00
u37294
ff53b231c8 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2020-05-04 10:25:10 -07:00
u37294
fc19cf905b Lime optional 2020-05-04 10:24:48 -07:00
Christoph Lehner
2a1387e992 rankInnerProduct 2020-05-03 17:27:11 -04:00
Christoph Lehner
9bfa51bffb cleanup comment 2020-05-03 09:12:52 -04:00
Christoph Lehner
38532753f4 interface cleanup 2020-05-03 08:58:32 -04:00
Christoph Lehner
949be9605c fix pragmas 2020-05-02 16:20:03 -04:00
Christoph Lehner
63cf201ee7 Add AdviseInfrequentUse 2020-05-02 11:38:42 -04:00
Christoph Lehner
c8af498a2a BinaryIO fix for alternative little-endian format name (used in 96I ensemble) 2020-05-01 03:45:50 -04:00
Christoph Lehner
ddb192bac7 re-work double precision promotion for summit 2020-04-30 16:09:57 -04:00
Michael Marshall
7666300a6f Merge branch 'develop' into bugfix/ET_go_home
* develop:
  Basis rotate stack passig to GPU reduction
  Clean up warning
2020-04-30 20:10:32 +01:00
Michael Marshall
4a4b9e305d Fix: strToVec enters infinite loop and exhausts memory if operator>> fails before the end of string, e.g. if parsing "0_0_0" for momentum instead of "0 0 0". 2020-04-30 19:40:04 +01:00
Peter Boyle
9b2d2d0fc3 Basis rotate stack passig to GPU reduction 2020-04-30 12:31:07 -04:00
Peter Boyle
5011753f4f Clean up warning 2020-04-30 10:23:48 -04:00
Michael Marshall
dbaeefaeef All Eigen::TensorMap objects are fixed (i.e. cannot be dynamically resized) 2020-04-30 15:02:51 +01:00
Christopher Kelly
dee96cbf82 Added workaround in configure to still catch Cuda compiler when nvcc with extra arguments (eg -ccbin) is used as CXX 2020-04-29 10:37:11 -04:00
Peter Boyle
dd3ebc2ce4 Slow compile on NVCC switch off conserved current 2020-04-29 08:43:12 -04:00
Peter Boyle
103e7ae2f0 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2020-04-29 03:05:36 -04:00
Peter Boyle
29ae5615c0 Seqeuential fix 2020-04-29 03:05:15 -04:00
ferben
6240e02619 added assertion to avoid potential infinite loop 2020-04-27 18:50:53 +01:00
ferben
f4033ad8cb baryon speedup by a factor 2 2020-04-27 17:46:14 +01:00
nmeyer-ur
5abec5b8a9 SVE_readme update, update Grid_vector_types.h 2020-04-25 13:48:26 +02:00
nmeyer-ur
499edc0636 updated SVE_README.txt; defined ARMCLANGCOMPAT macro 2020-04-25 13:41:24 +02:00
nmeyer-ur
d990e61be3 armclang 20.1 settings in SVE readme 2020-04-25 12:11:43 +02:00
nmeyer-ur
3edb2dc2da removed -static from gcc CXXFLAGS 2020-04-24 13:04:34 +02:00
Christoph Lehner
f1fe444d4f blocked precision promotion infrastructure upgrade 2020-04-24 06:27:20 -04:00
nils meyer
345721220e resolved merge conflict 2020-04-24 10:14:21 +02:00
nils meyer
6db68d6ecb added SVE configure for armclang and gcc 2020-04-24 10:10:47 +02:00
Peter Boyle
dae820aa96
Merge pull request #277 from mmphys/bugfix/grid-config
Bugfix/grid config
2020-04-23 10:26:54 -04:00
Michael Marshall
5daf176f4a Updated to expose GRID_CXXLD in addition to CXXLD.
NB: CXXLD required as this is what drives linking behaviour.
2020-04-23 15:25:53 +01:00
Michael Marshall
e96c86ec14 Make grid-config message more specific for --cxx and --cxxld 2020-04-23 13:10:45 +01:00
nmeyer-ur
09f0963d1f changes in configure.ac ; to be verified 2020-04-23 11:27:03 +02:00
nils meyer
6f44e3c192 reverted changes in configure.ac ; included SVE configure readme 2020-04-23 11:18:50 +02:00
Peter Boyle
c2c3cad20d Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2020-04-23 04:35:42 -04:00
Peter Boyle
edec9ee2e2 Conserved current rewrite done. Zmobius working 2020-04-23 04:34:01 -04:00
Peter Boyle
ed70cce542 Test for 5D DWF obserevables 2020-04-23 04:29:45 -04:00
Michael Marshall
4701201b5f grid-config: Expose CXXLD (for GPU build) and update help 2020-04-22 18:42:30 +01:00
nils meyer
5893888f87 removed default no-strict-aliasing for gcc-10.0.1 exclusively 2020-04-22 19:29:55 +02:00
nmeyer-ur
39b448affb Merge remote-tracking branch 'origin/develop' into feature/a64fx-2 2020-04-22 17:34:12 +02:00
nils meyer
e54a8f05a9 Exchange1 with generic version for now, should use svtbl2 in final version 2020-04-20 22:45:27 +02:00
Peter Boyle
0782b76ed4
Merge pull request #274 from paboyle/feature/zmobius_paramcompute
ZMobius parameter computation
2020-04-20 14:39:29 -04:00
Christopher Kelly
0896f2cead Added missing include guards in bigfloat_double.h 2020-04-20 10:30:38 -04:00
Christopher Kelly
181709bba4 Merge branch 'develop' into feature/zmobius_paramcompute 2020-04-20 09:12:34 -04:00
nils meyer
64b72fc17f testing gcc 10.0.1: build errors in Exchange1 using -DA64FX and in Lattice_base.h building Dslash only 2020-04-19 01:25:40 +02:00
Christoph Lehner
091d5c605e towards more precise blocking 2020-04-17 04:25:28 -04:00
nils meyer
6fdce60492 revised BodyA64FX; 990 GiB/s Wilson, 687 GiB/s DW using intrinsics (armclang 20.0) 2020-04-16 22:43:32 +02:00
Peter Boyle
90229cfb0f
Merge pull request #270 from milc-qcd/feature/CGinfo
feature/CGinfo
2020-04-16 11:46:08 -04:00
Peter Boyle
0475c46ecb
Merge pull request #256 from djm2131/feature/BiCGSTAB
Import BiCGSTAB solvers and tests
2020-04-16 11:45:15 -04:00