6ef0096dc9
Strip out the dslash kernel implementation
2015-05-26 19:55:18 +01:00
20100d0a40
Hand unrolled version of dslash in a separate class.
...
Useful to compare; raises Intel compiler from 9GFlop/s to 17.5 Gflops.
on ivybridge core. Raises Clang form 14.5 to 17.5
2015-05-26 19:54:03 +01:00
d7f5172860
Schur complement based red-black inversion working
2015-05-25 13:47:12 +01:00
201a110c51
Better EO support letting Schur solver work
2015-05-25 13:46:28 +01:00
1d4b1c48cc
Most cosmetic
2015-05-25 13:45:32 +01:00
f6cade41b4
Better checkerboard tracking.
2015-05-25 13:45:08 +01:00
6e76f0c6cd
move constants into red black
2015-05-25 13:44:35 +01:00
55ad54e0ff
Updates now schur red black solver working
2015-05-25 13:43:58 +01:00
00ee531005
Herm op
2015-05-25 13:42:36 +01:00
ca30116144
red black fix
2015-05-25 13:42:12 +01:00
41ba13f951
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-23 09:36:08 +01:00
2d30e82dcb
Improving even odd sector; lot of work and through required cleaning this
2015-05-23 09:34:16 +01:00
0b165afd9e
Rely on default constructors
2015-05-23 09:33:42 +01:00
3954792f37
Better pragma use
2015-05-23 09:32:37 +01:00
8c7b5f5d3b
Cosmetic
2015-05-23 09:31:15 +01:00
be8b4f89d6
Iterator required
2015-05-23 09:30:28 +01:00
e0cc5ba920
Streaming store option ifdef
2015-05-21 06:47:05 +01:00
1b9ecbac3b
Compile time select if we do the streaming store copy. Relies on Clang++ eliminating object copies,
...
and other compliers do not necessarily cope.
2015-05-21 06:39:00 +01:00
db786fac13
Didn't like a print statement
2015-05-21 06:36:15 +01:00
9058135da0
Unroll pragma abstraction
2015-05-21 06:34:33 +01:00
f8d8958884
Merge remote-tracking branch 'upstream/master'
...
Conflicts:
lib/simd/Grid_vector_types.h
tests/Makefile.am
2015-05-20 17:32:46 +09:00
e529210f43
Implemented all SSE4 functions.
...
A test code Grid_simd_new.cc has been created to test the new class.
Tests are all OK.
2015-05-20 17:22:40 +09:00
8fdff33b3a
Merging in
...
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-19 21:30:13 +01:00
efc0d1e0b9
Reworking to keep intel compiler happy
2015-05-19 21:29:07 +01:00
2d8b5a8191
Optimisation...
2015-05-19 15:50:47 +01:00
3fe7275332
Merged
...
Merge branch 'master' of https://github.com/coppolachan/Grid into coppolachan-master
Conflicts:
lib/simd/Grid_vector_types.h
2015-05-19 15:05:07 +01:00
ee8cf77071
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-19 14:55:26 +01:00
c8c74e591f
Add messages to get the number of threads for openmp
2015-05-19 14:54:42 +01:00
a6e1ea216d
Got unpreconditioned conjugate gradient to run and converge on a random (uniform random,
...
not even SU(3) for now) gauge field. Convergence history is correctly indepdendent of decomposition
on 1,2,4,8,16 mpi tasks.
Found a couple of simd bugs which required fixed and enhanced the Grid_simd.cc test suite.
Implemented the Mdag, M, MdagM, Meooe Mooee schur type stuff in the wilson dop.
2015-05-19 13:57:35 +01:00
7fb3221d47
Partial implementation of the vector types SIMD
...
Implementing SSE4 now
A systematic series of tests must be written.
2015-05-19 17:21:17 +09:00
639fd05239
Added check of mpfr and gmp at configure time
...
It generates automatically the linker flags or complains if not found.
2015-05-19 13:54:55 +09:00
d6887beead
Merging with upstream
2015-05-19 13:36:03 +09:00
30494bd96d
Merge branch 'master' of https://github.com/coppolachan/Grid into coppolachan-master
...
Conflicts:
lib/algorithms/approx/bigfloat.h
2015-05-18 16:34:21 +01:00
193fd5532f
Remez tested
2015-05-18 12:09:25 +01:00
fa1dc5e448
Minor modification to the configure.ac
...
Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed
2015-05-18 17:15:14 +09:00
6d2accba7b
Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.
2015-05-18 16:48:14 +09:00
1887c77498
Getting closer to having a wilson solver... introducing a first and untested
...
cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of
algorithms/approx
algorithms/iterative
etc.. to start taking shape.
2015-05-18 07:47:05 +01:00
6eb8dfd902
Working towards solvers
2015-05-17 00:19:03 +01:00
e841395dfd
Updating preparing for solvers etc..
2015-05-16 23:35:08 +01:00
53260e7a39
Typoo xifed
2015-05-16 05:49:32 +01:00
a900790b44
Update Grid_lattice_trace.h
2015-05-16 04:40:28 +01:00
b731bf6976
Pretty syntax
2015-05-16 04:37:26 +01:00
5f8b82b90c
Optimisation and syntax pretty
2015-05-16 04:36:22 +01:00
afda459886
strong inline
2015-05-16 04:33:10 +01:00
f43589369a
Compile options tweak
2015-05-15 12:33:18 +01:00
c99922b591
Out of source compile now working
2015-05-15 12:21:40 +01:00
3ed30169ea
clang++ 3.4/5/7 compile happy for AVX and SSE
...
icpc compiles happy on MacOSX both with -xCOMMON-AV512 and native AVX
gcc-5 does not compile happy; can work around by renaming lattice peek/poke/transpose/trace templates
relative to tensor ones, but gcc goes into a recursive template instantiation due to
matching error. I think this is a gcc bug and have filed a report https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153
2015-05-15 11:52:11 +01:00
180b06d7e3
GCC and ICPC complained on more careful typeing
2015-05-15 11:50:44 +01:00
3bd376853c
Move platform dependent out to Grid_simd.h
2015-05-15 11:50:00 +01:00
6bba16ccf7
ngo store
2015-05-15 11:49:39 +01:00