1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-04-05 11:45:56 +01:00

429 Commits

Author SHA1 Message Date
Peter Boyle
66d997e031 Large scale change to support 5d fermion formulations.
Have 5d replicated wilson with 4d gauge working and matrix regressing
to Ls copies of wilson.
2015-05-31 15:09:02 +01:00
Peter Boyle
8c357dca8b Integer wrap problem fixed. 2015-05-29 14:11:34 +01:00
neo
661c7e3e37 Merge remote-tracking branch 'upstream/master' 2015-05-29 11:41:39 +09:00
neo
96ad352741 Some modifications to the configure to check SIMD support 2015-05-29 11:41:02 +09:00
Peter Boyle
62dccb3247 Weak scale the benchmarks automatically. 2015-05-28 13:47:01 +01:00
Peter Boyle
c0c1ebe757 Works now with Clang-avx, Clang-sse and ICPC-avx, ICPC-sse 2015-05-28 11:35:43 +01:00
Peter Boyle
bd81ac0f17 Improving the reduction to go through our on permute.
Must also do this for avx512
2015-05-27 16:07:17 +01:00
neo
d8b05e001d Check at configure time if CPU supports the requested SIMD optimization 2015-05-27 18:30:11 +09:00
neo
9fd6506d1f Included Gpermute in the new Grid_simd.h file style.
Now tested for SSE4. OK
2015-05-27 12:11:44 +09:00
neo
75442e48ce Added a .gitignore file to eliminate autoconf files from commits 2015-05-27 11:10:51 +09:00
Guido Cossu
26ec41288d Corrected AVX regression error. Tested. 2015-05-27 10:49:33 +09:00
neo
13707e0808 Merge remote-tracking branch 'upstream/master'
Conflicts:
	Makefile.in
2015-05-27 10:41:33 +09:00
neo
12ae11ef62 Adding support for doxygen generation 2015-05-27 10:34:56 +09:00
Peter Boyle
e468d75286 Auto gen files should never have been committed, but making everyone run
aclocal, automake, autoconf is a pain in the ass.
2015-05-26 22:20:40 +01:00
Peter Boyle
22d073eb2b Simd revert to Guido's commit. I edited concurrently and things went bad. 2015-05-26 22:20:09 +01:00
Peter Boyle
ccf10a973a Merge branch 'master' of https://github.com/paboyle/Grid
Conflicts:
	lib/Grid_simd.h
2015-05-26 20:04:08 +01:00
Peter Boyle
6ef0096dc9 Strip out the dslash kernel implementation 2015-05-26 19:55:18 +01:00
Peter Boyle
20100d0a40 Hand unrolled version of dslash in a separate class.
Useful to compare; raises Intel compiler from 9GFlop/s to 17.5 Gflops.
                   on ivybridge core. Raises Clang form 14.5 to 17.5
2015-05-26 19:54:03 +01:00
neo
c04cad92ac More cleanup of Grid_simd.h 2015-05-26 13:54:34 +09:00
neo
aff978f60a Cleaning up simd files 2015-05-26 13:31:10 +09:00
neo
9ad6d0c65f Merge remote-tracking branch 'upstream/master'
Conflicts:
	lib/math/Grid_math_tensors.h
	lib/simd/Grid_vector_types.h
2015-05-26 13:14:06 +09:00
neo
377083e6ae checked performance of new vector libaries.
Added check for c++11 support on the configure.ac
2015-05-26 12:02:54 +09:00
Peter Boyle
c2ffb1a098 Makefile update 2015-05-25 14:43:08 +01:00
Peter Boyle
d7f5172860 Schur complement based red-black inversion working 2015-05-25 13:47:12 +01:00
Peter Boyle
201a110c51 Better EO support letting Schur solver work 2015-05-25 13:46:28 +01:00
Peter Boyle
1d4b1c48cc Most cosmetic 2015-05-25 13:45:32 +01:00
Peter Boyle
f6cade41b4 Better checkerboard tracking. 2015-05-25 13:45:08 +01:00
Peter Boyle
6e76f0c6cd move constants into red black 2015-05-25 13:44:35 +01:00
Peter Boyle
55ad54e0ff Updates now schur red black solver working 2015-05-25 13:43:58 +01:00
Peter Boyle
00ee531005 Herm op 2015-05-25 13:42:36 +01:00
Peter Boyle
ca30116144 red black fix 2015-05-25 13:42:12 +01:00
Peter Boyle
41ba13f951 Merge branch 'master' of https://github.com/paboyle/Grid 2015-05-23 09:36:08 +01:00
Peter Boyle
31a40fa37f Added 2015-05-23 09:36:01 +01:00
Peter Boyle
602248d5fe Extra targets 2015-05-23 09:35:37 +01:00
Peter Boyle
2ba641b25e More targets 2015-05-23 09:34:50 +01:00
Peter Boyle
2d30e82dcb Improving even odd sector; lot of work and through required cleaning this 2015-05-23 09:34:16 +01:00
Peter Boyle
0b165afd9e Rely on default constructors 2015-05-23 09:33:42 +01:00
Peter Boyle
3954792f37 Better pragma use 2015-05-23 09:32:37 +01:00
Peter Boyle
8c7b5f5d3b Cosmetic 2015-05-23 09:31:15 +01:00
Peter Boyle
be8b4f89d6 Iterator required 2015-05-23 09:30:28 +01:00
neo
57feda4328 Completed implementation of new Grid_simd classes
Tested performance for SSE4, Ok.
AVX1/2, AVX512 yet untested
2015-05-22 17:33:15 +09:00
Peter Boyle
a11850d2fb Merge pull request #7 from coppolachan/master
Added full support for SSE4
2015-05-22 05:58:59 +01:00
Peter Boyle
e0cc5ba920 Streaming store option ifdef 2015-05-21 06:47:05 +01:00
Peter Boyle
1b9ecbac3b Compile time select if we do the streaming store copy. Relies on Clang++ eliminating object copies,
and other compliers do not necessarily cope.
2015-05-21 06:39:00 +01:00
Peter Boyle
ac0941be9a adding two routines containing only a single operation so I can easily see the assembly dump 2015-05-21 06:37:46 +01:00
Peter Boyle
fb159e1cff Minor change 2015-05-21 06:37:20 +01:00
Peter Boyle
8bc0033326 useful to dump assembler 2015-05-21 06:36:47 +01:00
Peter Boyle
db786fac13 Didn't like a print statement 2015-05-21 06:36:15 +01:00
Peter Boyle
046485a7bb better comms benchmarking 2015-05-21 06:35:46 +01:00
Peter Boyle
9058135da0 Unroll pragma abstraction 2015-05-21 06:34:33 +01:00