neo
3a3f54932a
Implemented all SSE4 functions.
...
A test code Grid_simd_new.cc has been created to test the new class.
Tests are all OK.
2015-05-20 17:22:40 +09:00
Peter Boyle
dc4014668d
Merging in
...
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-19 21:30:13 +01:00
Peter Boyle
3f57662cd0
Build a simple kernel to compare intel compiler and clang in simple environment
2015-05-19 21:29:40 +01:00
Peter Boyle
b562b50196
Reworking to keep intel compiler happy
2015-05-19 21:29:07 +01:00
Peter Boyle
46ab8edf30
Optimisation...
2015-05-19 15:50:47 +01:00
Peter Boyle
5f0530b68a
Merge branch 'coppolachan-master'
2015-05-19 15:05:32 +01:00
Peter Boyle
3d66d00313
Merged
...
Merge branch 'master' of https://github.com/coppolachan/Grid into coppolachan-master
Conflicts:
lib/simd/Grid_vector_types.h
2015-05-19 15:05:07 +01:00
azusayamaguchi
a4b3bc7714
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-19 14:55:26 +01:00
azusayamaguchi
592cec72e2
Add messages to get the number of threads for openmp
2015-05-19 14:54:42 +01:00
Peter Boyle
ffc00caea3
Got unpreconditioned conjugate gradient to run and converge on a random (uniform random,
...
not even SU(3) for now) gauge field. Convergence history is correctly indepdendent of decomposition
on 1,2,4,8,16 mpi tasks.
Found a couple of simd bugs which required fixed and enhanced the Grid_simd.cc test suite.
Implemented the Mdag, M, MdagM, Meooe Mooee schur type stuff in the wilson dop.
2015-05-19 13:57:35 +01:00
neo
b29caead32
Partial implementation of the vector types SIMD
...
Implementing SSE4 now
A systematic series of tests must be written.
2015-05-19 17:21:17 +09:00
neo
4cadf11d1d
Added check of mpfr and gmp at configure time
...
It generates automatically the linker flags or complains if not found.
2015-05-19 13:54:55 +09:00
neo
b5af3fbe45
Merging with upstream
2015-05-19 13:36:03 +09:00
Peter Boyle
c7314e526e
Merge branch 'coppolachan-master'
2015-05-18 16:36:58 +01:00
Peter Boyle
f9a8377fe6
lib/algorithms/approx/bigfloat.h
2015-05-18 16:35:48 +01:00
Peter Boyle
cf9bbee256
Merge branch 'master' of https://github.com/coppolachan/Grid into coppolachan-master
...
Conflicts:
lib/algorithms/approx/bigfloat.h
2015-05-18 16:34:21 +01:00
Peter Boyle
a19deba26c
Convience function
2015-05-18 16:28:29 +01:00
Peter Boyle
2843264bd8
Remez tested
2015-05-18 12:09:25 +01:00
neo
17e4e478cd
Minor modification to the configure.ac
...
Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed
2015-05-18 17:15:14 +09:00
neo
cee363e28c
Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.
2015-05-18 16:48:14 +09:00
Peter Boyle
d0e4673a3f
Getting closer to having a wilson solver... introducing a first and untested
...
cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of
algorithms/approx
algorithms/iterative
etc.. to start taking shape.
2015-05-18 07:47:05 +01:00
Peter Boyle
8e99e4671f
Working towards solvers
2015-05-17 00:19:03 +01:00
Peter Boyle
dc6b6bdc96
Updating preparing for solvers etc..
2015-05-16 23:35:08 +01:00
Peter Boyle
9b0aae665f
Better build automation
2015-05-16 07:16:45 +01:00
Peter Boyle
4a0da933f0
Merge branch 'master' of https://github.com/paboyle/Grid
2015-05-16 06:42:03 +01:00
Peter Boyle
e2f6745a0e
Moved things around
2015-05-16 06:40:10 +01:00
Peter Boyle
39e7ef1243
Typoo xifed
2015-05-16 05:49:32 +01:00
Peter Boyle
9c38a52bad
Update Grid_lattice_trace.h
2015-05-16 04:40:28 +01:00
Peter Boyle
1247d7aea8
Pretty syntax
2015-05-16 04:37:26 +01:00
Peter Boyle
9f0e990b40
Optimisation and syntax pretty
2015-05-16 04:36:22 +01:00
Peter Boyle
56667e9d32
more digits
2015-05-16 04:33:40 +01:00
Peter Boyle
49f56a25d1
strong inline
2015-05-16 04:33:10 +01:00
Peter Boyle
c2ca396353
Extra compile targs
2015-05-15 14:41:59 +01:00
Peter Boyle
7a63bdbd72
Added su3 matrix benchmark.
2015-05-15 14:41:19 +01:00
Peter Boyle
516aac6518
Log the bug report code into the git repo.
2015-05-15 12:39:53 +01:00
Peter Boyle
675fd1a065
Compile options tweak
2015-05-15 12:33:18 +01:00
Peter Boyle
a98f3e0f5e
Out of source compile now working
2015-05-15 12:21:40 +01:00
Peter Boyle
f92fda0cfd
Convenience multi-compiler build with out of source compile
2015-05-15 12:21:10 +01:00
Peter Boyle
100323ab4d
clang++ 3.4/5/7 compile happy for AVX and SSE
...
icpc compiles happy on MacOSX both with -xCOMMON-AV512 and native AVX
gcc-5 does not compile happy; can work around by renaming lattice peek/poke/transpose/trace templates
relative to tensor ones, but gcc goes into a recursive template instantiation due to
matching error. I think this is a gcc bug and have filed a report https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153
2015-05-15 11:52:11 +01:00
Peter Boyle
6965a136a0
Remove debug masking
2015-05-15 11:51:15 +01:00
Peter Boyle
254dee6ac7
GCC and ICPC complained on more careful typeing
2015-05-15 11:50:44 +01:00
Peter Boyle
264850bc16
Move platform dependent out to Grid_simd.h
2015-05-15 11:50:00 +01:00
Peter Boyle
9a120cf5ec
ngo store
2015-05-15 11:49:39 +01:00
Peter Boyle
8d77d758c3
Parallel for replace
2015-05-15 11:48:04 +01:00
Peter Boyle
0e7945fe54
Forces inlining upon icpc
2015-05-15 11:43:49 +01:00
Peter Boyle
bd721ce1c8
Force inlining upon icpc
2015-05-15 11:43:20 +01:00
Peter Boyle
a852d13f03
More elegant enable_if
2015-05-15 11:42:51 +01:00
Peter Boyle
a26fdab719
More elegant to do boolean logic inside the enable_if construct
...
Should have done that from the beginning and should move this into
a global edit
2015-05-15 11:42:03 +01:00
Peter Boyle
af6e8f7829
Force inlining on ICPC because inline apparently is not enoguh
2015-05-15 11:41:31 +01:00
Peter Boyle
cbfa4097b4
strong_inline forces ICPC to do it.
2015-05-15 11:40:59 +01:00