fb37b57c2d
Merge branch 'master' of https://github.com/paboyle/Grid
...
Conflicts:
lib/Grid_simd.h
2015-05-26 20:04:08 +01:00
5e72e4c0d9
Strip out the dslash kernel implementation
2015-05-26 19:55:18 +01:00
a32ac287bb
Hand unrolled version of dslash in a separate class.
...
Useful to compare; raises Intel compiler from 9GFlop/s to 17.5 Gflops.
on ivybridge core. Raises Clang form 14.5 to 17.5
2015-05-26 19:54:03 +01:00
fb5d72973e
More cleanup of Grid_simd.h
2015-05-26 13:54:34 +09:00
3f576830f9
Cleaning up simd files
2015-05-26 13:31:10 +09:00
257aa92421
Merge remote-tracking branch 'upstream/master'
...
Conflicts:
lib/math/Grid_math_tensors.h
lib/simd/Grid_vector_types.h
2015-05-26 13:14:06 +09:00
ece86f717b
checked performance of new vector libaries.
...
Added check for c++11 support on the configure.ac
2015-05-26 12:02:54 +09:00
2ae6214104
Schur complement based red-black inversion working
2015-05-25 13:47:12 +01:00
1a9841a0f1
Better EO support letting Schur solver work
2015-05-25 13:46:28 +01:00
55685b7cf5
Most cosmetic
2015-05-25 13:45:32 +01:00
3358a77c7a
Better checkerboard tracking.
2015-05-25 13:45:08 +01:00
bc947477f3
move constants into red black
2015-05-25 13:44:35 +01:00
29f72292ba
Updates now schur red black solver working
2015-05-25 13:43:58 +01:00
9b5633ff4f
Herm op
2015-05-25 13:42:36 +01:00
17a06af1ff
red black fix
2015-05-25 13:42:12 +01:00
c25016030c
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-23 09:36:08 +01:00
65f2e6b269
Improving even odd sector; lot of work and through required cleaning this
2015-05-23 09:34:16 +01:00
d07a5c084d
Rely on default constructors
2015-05-23 09:33:42 +01:00
a2928321b6
Better pragma use
2015-05-23 09:32:37 +01:00
764732944f
Cosmetic
2015-05-23 09:31:15 +01:00
ae58a9ada2
Iterator required
2015-05-23 09:30:28 +01:00
1c862dc15b
Completed implementation of new Grid_simd classes
...
Tested performance for SSE4, Ok.
AVX1/2, AVX512 yet untested
2015-05-22 17:33:15 +09:00
d8061afe24
Streaming store option ifdef
2015-05-21 06:47:05 +01:00
874b2eb32d
Compile time select if we do the streaming store copy. Relies on Clang++ eliminating object copies,
...
and other compliers do not necessarily cope.
2015-05-21 06:39:00 +01:00
57a01e6bbb
Didn't like a print statement
2015-05-21 06:36:15 +01:00
35055ed5c1
Unroll pragma abstraction
2015-05-21 06:34:33 +01:00
9098d7d0a3
Merge remote-tracking branch 'upstream/master'
...
Conflicts:
lib/simd/Grid_vector_types.h
tests/Makefile.am
2015-05-20 17:32:46 +09:00
3a3f54932a
Implemented all SSE4 functions.
...
A test code Grid_simd_new.cc has been created to test the new class.
Tests are all OK.
2015-05-20 17:22:40 +09:00
dc4014668d
Merging in
...
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-19 21:30:13 +01:00
b562b50196
Reworking to keep intel compiler happy
2015-05-19 21:29:07 +01:00
46ab8edf30
Optimisation...
2015-05-19 15:50:47 +01:00
3d66d00313
Merged
...
Merge branch 'master' of https://github.com/coppolachan/Grid into coppolachan-master
Conflicts:
lib/simd/Grid_vector_types.h
2015-05-19 15:05:07 +01:00
a4b3bc7714
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-19 14:55:26 +01:00
592cec72e2
Add messages to get the number of threads for openmp
2015-05-19 14:54:42 +01:00
ffc00caea3
Got unpreconditioned conjugate gradient to run and converge on a random (uniform random,
...
not even SU(3) for now) gauge field. Convergence history is correctly indepdendent of decomposition
on 1,2,4,8,16 mpi tasks.
Found a couple of simd bugs which required fixed and enhanced the Grid_simd.cc test suite.
Implemented the Mdag, M, MdagM, Meooe Mooee schur type stuff in the wilson dop.
2015-05-19 13:57:35 +01:00
b29caead32
Partial implementation of the vector types SIMD
...
Implementing SSE4 now
A systematic series of tests must be written.
2015-05-19 17:21:17 +09:00
4cadf11d1d
Added check of mpfr and gmp at configure time
...
It generates automatically the linker flags or complains if not found.
2015-05-19 13:54:55 +09:00
b5af3fbe45
Merging with upstream
2015-05-19 13:36:03 +09:00
cf9bbee256
Merge branch 'master' of https://github.com/coppolachan/Grid into coppolachan-master
...
Conflicts:
lib/algorithms/approx/bigfloat.h
2015-05-18 16:34:21 +01:00
2843264bd8
Remez tested
2015-05-18 12:09:25 +01:00
17e4e478cd
Minor modification to the configure.ac
...
Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed
2015-05-18 17:15:14 +09:00
cee363e28c
Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.
2015-05-18 16:48:14 +09:00
d0e4673a3f
Getting closer to having a wilson solver... introducing a first and untested
...
cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of
algorithms/approx
algorithms/iterative
etc.. to start taking shape.
2015-05-18 07:47:05 +01:00
8e99e4671f
Working towards solvers
2015-05-17 00:19:03 +01:00
dc6b6bdc96
Updating preparing for solvers etc..
2015-05-16 23:35:08 +01:00
39e7ef1243
Typoo xifed
2015-05-16 05:49:32 +01:00
9c38a52bad
Update Grid_lattice_trace.h
2015-05-16 04:40:28 +01:00
1247d7aea8
Pretty syntax
2015-05-16 04:37:26 +01:00
9f0e990b40
Optimisation and syntax pretty
2015-05-16 04:36:22 +01:00
49f56a25d1
strong inline
2015-05-16 04:33:10 +01:00