1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-11-15 02:05:37 +00:00
Commit Graph

479 Commits

Author SHA1 Message Date
Peter Boyle
a75b6f6e78 Large scale change to support 5d fermion formulations.
Have 5d replicated wilson with 4d gauge working and matrix regressing
to Ls copies of wilson.
2015-05-31 15:09:02 +01:00
Peter Boyle
9cfc180334 Integer wrap problem fixed. 2015-05-29 14:11:34 +01:00
neo
575e6001f3 Merge remote-tracking branch 'upstream/master' 2015-05-29 11:41:39 +09:00
neo
4403e117a7 Some modifications to the configure to check SIMD support 2015-05-29 11:41:02 +09:00
Peter Boyle
445e38acf6 Weak scale the benchmarks automatically. 2015-05-28 13:47:01 +01:00
Peter Boyle
a5c3424cfb Works now with Clang-avx, Clang-sse and ICPC-avx, ICPC-sse 2015-05-28 11:35:43 +01:00
Peter Boyle
e8be96bfe7 Improving the reduction to go through our on permute.
Must also do this for avx512
2015-05-27 16:07:17 +01:00
neo
be66fdcfab Check at configure time if CPU supports the requested SIMD optimization 2015-05-27 18:30:11 +09:00
neo
28ac219d81 Included Gpermute in the new Grid_simd.h file style.
Now tested for SSE4. OK
2015-05-27 12:11:44 +09:00
neo
4e3f4104ab Added a .gitignore file to eliminate autoconf files from commits 2015-05-27 11:10:51 +09:00
Guido Cossu
2ccbff6c6c Corrected AVX regression error. Tested. 2015-05-27 10:49:33 +09:00
neo
9344d41ac5 Merge remote-tracking branch 'upstream/master'
Conflicts:
	Makefile.in
2015-05-27 10:41:33 +09:00
neo
538bc41bbb Adding support for doxygen generation 2015-05-27 10:34:56 +09:00
Peter Boyle
b6a28f1de7 Auto gen files should never have been committed, but making everyone run
aclocal, automake, autoconf is a pain in the ass.
2015-05-26 22:20:40 +01:00
Peter Boyle
6d2e056187 Simd revert to Guido's commit. I edited concurrently and things went bad. 2015-05-26 22:20:09 +01:00
Peter Boyle
fb37b57c2d Merge branch 'master' of https://github.com/paboyle/Grid
Conflicts:
	lib/Grid_simd.h
2015-05-26 20:04:08 +01:00
Peter Boyle
5e72e4c0d9 Strip out the dslash kernel implementation 2015-05-26 19:55:18 +01:00
Peter Boyle
a32ac287bb Hand unrolled version of dslash in a separate class.
Useful to compare; raises Intel compiler from 9GFlop/s to 17.5 Gflops.
                   on ivybridge core. Raises Clang form 14.5 to 17.5
2015-05-26 19:54:03 +01:00
neo
fb5d72973e More cleanup of Grid_simd.h 2015-05-26 13:54:34 +09:00
neo
3f576830f9 Cleaning up simd files 2015-05-26 13:31:10 +09:00
neo
257aa92421 Merge remote-tracking branch 'upstream/master'
Conflicts:
	lib/math/Grid_math_tensors.h
	lib/simd/Grid_vector_types.h
2015-05-26 13:14:06 +09:00
neo
ece86f717b checked performance of new vector libaries.
Added check for c++11 support on the configure.ac
2015-05-26 12:02:54 +09:00
Peter Boyle
3a6ff2d7b8 Makefile update 2015-05-25 14:43:08 +01:00
Peter Boyle
2ae6214104 Schur complement based red-black inversion working 2015-05-25 13:47:12 +01:00
Peter Boyle
1a9841a0f1 Better EO support letting Schur solver work 2015-05-25 13:46:28 +01:00
Peter Boyle
55685b7cf5 Most cosmetic 2015-05-25 13:45:32 +01:00
Peter Boyle
3358a77c7a Better checkerboard tracking. 2015-05-25 13:45:08 +01:00
Peter Boyle
bc947477f3 move constants into red black 2015-05-25 13:44:35 +01:00
Peter Boyle
29f72292ba Updates now schur red black solver working 2015-05-25 13:43:58 +01:00
Peter Boyle
9b5633ff4f Herm op 2015-05-25 13:42:36 +01:00
Peter Boyle
17a06af1ff red black fix 2015-05-25 13:42:12 +01:00
Peter Boyle
c25016030c Merge branch 'master' of https://github.com/paboyle/Grid 2015-05-23 09:36:08 +01:00
Peter Boyle
2806273340 Added 2015-05-23 09:36:01 +01:00
Peter Boyle
73ee36c48d Extra targets 2015-05-23 09:35:37 +01:00
Peter Boyle
b8fdb65fbf More targets 2015-05-23 09:34:50 +01:00
Peter Boyle
65f2e6b269 Improving even odd sector; lot of work and through required cleaning this 2015-05-23 09:34:16 +01:00
Peter Boyle
d07a5c084d Rely on default constructors 2015-05-23 09:33:42 +01:00
Peter Boyle
a2928321b6 Better pragma use 2015-05-23 09:32:37 +01:00
Peter Boyle
764732944f Cosmetic 2015-05-23 09:31:15 +01:00
Peter Boyle
ae58a9ada2 Iterator required 2015-05-23 09:30:28 +01:00
neo
1c862dc15b Completed implementation of new Grid_simd classes
Tested performance for SSE4, Ok.
AVX1/2, AVX512 yet untested
2015-05-22 17:33:15 +09:00
Peter Boyle
96e5c5c6ca Merge pull request #7 from coppolachan/master
Added full support for SSE4
2015-05-22 05:58:59 +01:00
Peter Boyle
d8061afe24 Streaming store option ifdef 2015-05-21 06:47:05 +01:00
Peter Boyle
874b2eb32d Compile time select if we do the streaming store copy. Relies on Clang++ eliminating object copies,
and other compliers do not necessarily cope.
2015-05-21 06:39:00 +01:00
Peter Boyle
f1fb92fd01 adding two routines containing only a single operation so I can easily see the assembly dump 2015-05-21 06:37:46 +01:00
Peter Boyle
3e1d1aff18 Minor change 2015-05-21 06:37:20 +01:00
Peter Boyle
c96af471ee useful to dump assembler 2015-05-21 06:36:47 +01:00
Peter Boyle
57a01e6bbb Didn't like a print statement 2015-05-21 06:36:15 +01:00
Peter Boyle
d806581666 better comms benchmarking 2015-05-21 06:35:46 +01:00
Peter Boyle
35055ed5c1 Unroll pragma abstraction 2015-05-21 06:34:33 +01:00