1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-10 19:36:56 +01:00
Commit Graph

1145 Commits

Author SHA1 Message Date
dc5f32e5f0 Merge branch 'master' into hadrons 2016-04-30 00:18:31 -07:00
f6c53e5039 Merge commit '1e554350acae0e67fa7177ed0db9d4f684a54af2' 2016-04-30 00:17:52 -07:00
405b175665 Merge branch 'master' into hadrons 2016-04-30 00:16:06 -07:00
ba09cbae3e function to read std::vector from a string (blank separated values) 2016-04-30 00:15:44 -07:00
6aa000176f Fermion <-> Propagator functions 2016-04-30 00:14:33 -07:00
23b6172c31 Bernoulli RNG 2016-04-30 00:14:13 -07:00
3f128443ab OS X icpc fix 2016-04-30 00:13:33 -07:00
1e554350ac The threaded coms didn't agree with GCC. Suprised, and looks like GCC bug. 2016-04-29 16:49:18 -07:00
c79ea0dcef Fixingn IMCI 2016-04-22 21:52:54 -07:00
e3f141f82f Fixed SSE compile with typecasts 2016-04-22 10:30:30 -07:00
a6dfa2386b GCC choked on intrinsics calls that ICPC did not 2016-04-22 06:33:41 -07:00
d9b5e66877 Update Make.inc 2016-04-20 18:25:48 +01:00
8fd8bc25e9 simd 5th dim with rotation 2016-04-19 15:39:00 -07:00
ba427abde9 simd 5d 2016-04-19 15:38:39 -07:00
9b6ab6db16 simd in 5th dimension support 2016-04-19 15:38:01 -07:00
806a83d38b simd in fifth dim support for dwf 2016-04-19 15:36:19 -07:00
7223753355 Rotate in a direction > 2 for simd_layout 2016-04-19 15:35:15 -07:00
b27bac4669 Updates for simd in one dir 2016-04-19 15:34:10 -07:00
c8a93d6a93 Cartesian changes to allow all simd in one direction 2016-04-19 15:18:12 -07:00
04072a5e1f Rotate is a temporary hack. Would like to merge ALL
permutes as rotates of length 2, and make any rotate active
over any subset of lane bits. This is hard, and requires general
permute; current intrinsics mean this is only really possible for specific
case by case encodings as presently performed. Intel could produce a general
permute.. would help. IBM did it in VMX.
2016-04-19 15:15:34 -07:00
574ea4f843 const safety 2016-04-19 15:15:11 -07:00
587f80cd93 Updated to compile and pass under intel SDE 2016-04-19 15:13:54 -07:00
528eb773ad Merged.
Merge branch 'master' of https://github.com/paboyle/Grid
2016-04-19 22:24:34 +01:00
e5657510b0 Rotate support for Ls simd-ized 2016-04-19 22:24:18 +01:00
f473919526 Rotate support 2016-04-19 22:23:51 +01:00
e33b0f6ff7 cleaner output 2016-04-16 08:41:53 +01:00
9ee54e0db7 debug output removed 2016-04-16 08:41:28 +01:00
ab56ccdd25 -Complete and working implementation of Grid_empty 2016-04-15 13:17:42 -04:00
a646260e82 Merge remote-tracking branch 'origin/master' into ckelly-dec12-2015 2016-04-06 13:57:28 -04:00
af9c8d1372 -Checkerboard fixes for Lanczos 2016-04-06 13:50:56 -04:00
b1192a8908 Benchmark_zmm added 2016-04-06 03:00:07 -07:00
e8dddb1596 Adding extra benchmark 2016-04-06 10:32:54 +01:00
c7ba47bdc7 Merge branch 'master' of https://github.com/paboyle/Grid 2016-04-06 02:56:28 +01:00
e67fc2be18 Adding a trial for openmp overhead minimisation 2016-03-31 16:00:37 +01:00
f473ef7591 Fixing the compile 2016-03-31 07:47:42 -07:00
8052556275 Cleaning up the single/double kernel implementation switch 2016-03-31 14:51:32 +01:00
60d965f79e AVX512 improvements; sigfpe trapping too 2016-03-30 08:42:34 +01:00
83b15bfcdd Better Avx512 assembly sequence for SU3 using fmaddsub to get the imag imag sign 2016-03-30 08:39:39 +01:00
1ecbf9794d Merge branch 'master' of https://github.com/paboyle/Grid 2016-03-30 08:37:55 +01:00
2ded354403 configure 2016-03-30 00:17:43 -07:00
340428a1fe Eigen fixes and HDCR work 2016-03-30 00:16:02 -07:00
c77b7ee897 AddSub based alternate SU3 routine 2016-03-28 17:55:22 -06:00
b6c3bc574b Moving to a more coherent organisation of the inline assembly and arch dependencies. 2016-03-28 16:24:37 +01:00
1e355a51e1 Interface change 2016-03-27 23:46:55 -07:00
ad80f61fba AVX512 shaken out 2016-03-28 00:38:05 -06:00
21abaf7e91 Gamma sign change 2016-03-28 00:35:45 -06:00
165bffc2e7 Avx512 changes for assembler kernels 2016-03-26 22:25:45 -06:00
644fd6d32e Build avx512 clean 2016-03-25 09:35:33 -07:00
f54e0ec9bd Try lanczos to set up hdcr subspace 2016-03-17 10:36:16 +00:00
60d4564151 ICC no compile fix 2016-03-16 02:30:40 -07:00