neo
57feda4328
Completed implementation of new Grid_simd classes
...
Tested performance for SSE4, Ok.
AVX1/2, AVX512 yet untested
2015-05-22 17:33:15 +09:00
Peter Boyle
a11850d2fb
Merge pull request #7 from coppolachan/master
...
Added full support for SSE4
2015-05-22 05:58:59 +01:00
Peter Boyle
e0cc5ba920
Streaming store option ifdef
2015-05-21 06:47:05 +01:00
Peter Boyle
1b9ecbac3b
Compile time select if we do the streaming store copy. Relies on Clang++ eliminating object copies,
...
and other compliers do not necessarily cope.
2015-05-21 06:39:00 +01:00
Peter Boyle
ac0941be9a
adding two routines containing only a single operation so I can easily see the assembly dump
2015-05-21 06:37:46 +01:00
Peter Boyle
fb159e1cff
Minor change
2015-05-21 06:37:20 +01:00
Peter Boyle
8bc0033326
useful to dump assembler
2015-05-21 06:36:47 +01:00
Peter Boyle
db786fac13
Didn't like a print statement
2015-05-21 06:36:15 +01:00
Peter Boyle
046485a7bb
better comms benchmarking
2015-05-21 06:35:46 +01:00
Peter Boyle
9058135da0
Unroll pragma abstraction
2015-05-21 06:34:33 +01:00
neo
f8d8958884
Merge remote-tracking branch 'upstream/master'
...
Conflicts:
lib/simd/Grid_vector_types.h
tests/Makefile.am
2015-05-20 17:32:46 +09:00
neo
e529210f43
Implemented all SSE4 functions.
...
A test code Grid_simd_new.cc has been created to test the new class.
Tests are all OK.
2015-05-20 17:22:40 +09:00
Peter Boyle
8fdff33b3a
Merging in
...
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-19 21:30:13 +01:00
Peter Boyle
91ed085ca4
Build a simple kernel to compare intel compiler and clang in simple environment
2015-05-19 21:29:40 +01:00
Peter Boyle
efc0d1e0b9
Reworking to keep intel compiler happy
2015-05-19 21:29:07 +01:00
Peter Boyle
2d8b5a8191
Optimisation...
2015-05-19 15:50:47 +01:00
Peter Boyle
b520694b00
Merge branch 'coppolachan-master'
2015-05-19 15:05:32 +01:00
Peter Boyle
3fe7275332
Merged
...
Merge branch 'master' of https://github.com/coppolachan/Grid into coppolachan-master
Conflicts:
lib/simd/Grid_vector_types.h
2015-05-19 15:05:07 +01:00
azusayamaguchi
ee8cf77071
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-19 14:55:26 +01:00
azusayamaguchi
c8c74e591f
Add messages to get the number of threads for openmp
2015-05-19 14:54:42 +01:00
Peter Boyle
a6e1ea216d
Got unpreconditioned conjugate gradient to run and converge on a random (uniform random,
...
not even SU(3) for now) gauge field. Convergence history is correctly indepdendent of decomposition
on 1,2,4,8,16 mpi tasks.
Found a couple of simd bugs which required fixed and enhanced the Grid_simd.cc test suite.
Implemented the Mdag, M, MdagM, Meooe Mooee schur type stuff in the wilson dop.
2015-05-19 13:57:35 +01:00
neo
7fb3221d47
Partial implementation of the vector types SIMD
...
Implementing SSE4 now
A systematic series of tests must be written.
2015-05-19 17:21:17 +09:00
neo
639fd05239
Added check of mpfr and gmp at configure time
...
It generates automatically the linker flags or complains if not found.
2015-05-19 13:54:55 +09:00
neo
d6887beead
Merging with upstream
2015-05-19 13:36:03 +09:00
Peter Boyle
6f387b4916
Merge branch 'coppolachan-master'
2015-05-18 16:36:58 +01:00
Peter Boyle
9bfe0e63f4
lib/algorithms/approx/bigfloat.h
2015-05-18 16:35:48 +01:00
Peter Boyle
30494bd96d
Merge branch 'master' of https://github.com/coppolachan/Grid into coppolachan-master
...
Conflicts:
lib/algorithms/approx/bigfloat.h
2015-05-18 16:34:21 +01:00
Peter Boyle
6f038a7f6d
Convience function
2015-05-18 16:28:29 +01:00
Peter Boyle
193fd5532f
Remez tested
2015-05-18 12:09:25 +01:00
neo
fa1dc5e448
Minor modification to the configure.ac
...
Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed
2015-05-18 17:15:14 +09:00
neo
6d2accba7b
Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.
2015-05-18 16:48:14 +09:00
Peter Boyle
1887c77498
Getting closer to having a wilson solver... introducing a first and untested
...
cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of
algorithms/approx
algorithms/iterative
etc.. to start taking shape.
2015-05-18 07:47:05 +01:00
Peter Boyle
6eb8dfd902
Working towards solvers
2015-05-17 00:19:03 +01:00
Peter Boyle
e841395dfd
Updating preparing for solvers etc..
2015-05-16 23:35:08 +01:00
Peter Boyle
cf99a1f37d
Better build automation
2015-05-16 07:16:45 +01:00
Peter Boyle
deac65a92d
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-16 06:42:03 +01:00
Peter Boyle
f6e54a7bd4
Moved things around
2015-05-16 06:40:10 +01:00
Peter Boyle
53260e7a39
Typoo xifed
2015-05-16 05:49:32 +01:00
Peter Boyle
a900790b44
Update Grid_lattice_trace.h
2015-05-16 04:40:28 +01:00
Peter Boyle
b731bf6976
Pretty syntax
2015-05-16 04:37:26 +01:00
Peter Boyle
5f8b82b90c
Optimisation and syntax pretty
2015-05-16 04:36:22 +01:00
Peter Boyle
25bfa7e830
more digits
2015-05-16 04:33:40 +01:00
Peter Boyle
afda459886
strong inline
2015-05-16 04:33:10 +01:00
Peter Boyle
c43869a83a
Extra compile targs
2015-05-15 14:41:59 +01:00
Peter Boyle
87bc17831d
Added su3 matrix benchmark.
2015-05-15 14:41:19 +01:00
Peter Boyle
76cbcff2f1
Log the bug report code into the git repo.
2015-05-15 12:39:53 +01:00
Peter Boyle
f43589369a
Compile options tweak
2015-05-15 12:33:18 +01:00
Peter Boyle
c99922b591
Out of source compile now working
2015-05-15 12:21:40 +01:00
Peter Boyle
de0c199604
Convenience multi-compiler build with out of source compile
2015-05-15 12:21:10 +01:00
Peter Boyle
3ed30169ea
clang++ 3.4/5/7 compile happy for AVX and SSE
...
icpc compiles happy on MacOSX both with -xCOMMON-AV512 and native AVX
gcc-5 does not compile happy; can work around by renaming lattice peek/poke/transpose/trace templates
relative to tensor ones, but gcc goes into a recursive template instantiation due to
matching error. I think this is a gcc bug and have filed a report https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153
2015-05-15 11:52:11 +01:00