3a3f54932a
Implemented all SSE4 functions.
...
A test code Grid_simd_new.cc has been created to test the new class.
Tests are all OK.
2015-05-20 17:22:40 +09:00
b29caead32
Partial implementation of the vector types SIMD
...
Implementing SSE4 now
A systematic series of tests must be written.
2015-05-19 17:21:17 +09:00
4cadf11d1d
Added check of mpfr and gmp at configure time
...
It generates automatically the linker flags or complains if not found.
2015-05-19 13:54:55 +09:00
b5af3fbe45
Merging with upstream
2015-05-19 13:36:03 +09:00
cf9bbee256
Merge branch 'master' of https://github.com/coppolachan/Grid into coppolachan-master
...
Conflicts:
lib/algorithms/approx/bigfloat.h
2015-05-18 16:34:21 +01:00
2843264bd8
Remez tested
2015-05-18 12:09:25 +01:00
17e4e478cd
Minor modification to the configure.ac
...
Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed
2015-05-18 17:15:14 +09:00
cee363e28c
Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.
2015-05-18 16:48:14 +09:00
d0e4673a3f
Getting closer to having a wilson solver... introducing a first and untested
...
cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of
algorithms/approx
algorithms/iterative
etc.. to start taking shape.
2015-05-18 07:47:05 +01:00
8e99e4671f
Working towards solvers
2015-05-17 00:19:03 +01:00
dc6b6bdc96
Updating preparing for solvers etc..
2015-05-16 23:35:08 +01:00
39e7ef1243
Typoo xifed
2015-05-16 05:49:32 +01:00
9c38a52bad
Update Grid_lattice_trace.h
2015-05-16 04:40:28 +01:00
1247d7aea8
Pretty syntax
2015-05-16 04:37:26 +01:00
9f0e990b40
Optimisation and syntax pretty
2015-05-16 04:36:22 +01:00
49f56a25d1
strong inline
2015-05-16 04:33:10 +01:00
675fd1a065
Compile options tweak
2015-05-15 12:33:18 +01:00
a98f3e0f5e
Out of source compile now working
2015-05-15 12:21:40 +01:00
100323ab4d
clang++ 3.4/5/7 compile happy for AVX and SSE
...
icpc compiles happy on MacOSX both with -xCOMMON-AV512 and native AVX
gcc-5 does not compile happy; can work around by renaming lattice peek/poke/transpose/trace templates
relative to tensor ones, but gcc goes into a recursive template instantiation due to
matching error. I think this is a gcc bug and have filed a report https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153
2015-05-15 11:52:11 +01:00
254dee6ac7
GCC and ICPC complained on more careful typeing
2015-05-15 11:50:44 +01:00
264850bc16
Move platform dependent out to Grid_simd.h
2015-05-15 11:50:00 +01:00
9a120cf5ec
ngo store
2015-05-15 11:49:39 +01:00
8d77d758c3
Parallel for replace
2015-05-15 11:48:04 +01:00
0e7945fe54
Forces inlining upon icpc
2015-05-15 11:43:49 +01:00
bd721ce1c8
Force inlining upon icpc
2015-05-15 11:43:20 +01:00
a852d13f03
More elegant enable_if
2015-05-15 11:42:51 +01:00
a26fdab719
More elegant to do boolean logic inside the enable_if construct
...
Should have done that from the beginning and should move this into
a global edit
2015-05-15 11:42:03 +01:00
af6e8f7829
Force inlining on ICPC because inline apparently is not enoguh
2015-05-15 11:41:31 +01:00
cbfa4097b4
strong_inline forces ICPC to do it.
2015-05-15 11:40:59 +01:00
8c40dd9c4f
Force strong_inline to force ipcc's hand
2015-05-15 11:40:31 +01:00
b38bf82d48
Switch to strong_inline macro to force icpc's hand
2015-05-15 11:40:00 +01:00
adc4f86020
Promote to strong inline to force ICPC's hand. Annoying.
2015-05-15 11:39:25 +01:00
5b46992a15
Formatting change
2015-05-15 11:38:54 +01:00
e7d25647e6
Filed bug report Bug 66153 on GCC-5.
...
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153
2015-05-15 11:38:04 +01:00
c28551f40f
Silly formatting change
2015-05-15 11:37:07 +01:00
6c7eb60d6f
gcc doesn't like collapse(2) for some reason I can't figure
2015-05-15 11:36:22 +01:00
051b23fe10
ICPC and GCC5 fixes
2015-05-15 11:35:02 +01:00
4e462209c7
Using boolean logic inside enable_if is more elegant
2015-05-15 11:32:45 +01:00
8d1b26dd4b
Key of mm_malloc.h
2015-05-15 11:32:11 +01:00
cc6218a692
strong inline required to force icpc
2015-05-15 11:31:41 +01:00
5166888c0a
Linear op added
2015-05-13 11:25:34 +01:00
0097b81778
OMP dslash working
2015-05-13 10:59:22 +01:00
add4495a4a
cout IO for all types
2015-05-13 09:24:10 +01:00
541d52ab97
I have made the Cshift work successfully with open mp threading in
...
every routine. Collapse(2) is now working under clang-omp++.
2015-05-13 00:31:00 +01:00
556befaaaa
Enhanced SIMD interfacing
2015-05-12 20:41:44 +01:00
c6baa3e657
Threading support rework.
...
Placed parallel pragmas as macros; implemented deterministic thread reduction in style of
BFM.
2015-05-12 07:51:41 +01:00
6e6843ac69
Moving some things around for pretty
2015-05-11 19:09:49 +01:00
c8dc8ff891
Adding a better controlled threading class, preparing to
...
force in deterministic reduction.
2015-05-11 18:59:03 +01:00
b613ed0bb8
Got command line args working
2015-05-11 14:36:48 +01:00
b42453d1fd
Command line args and a general clean up
2015-05-11 12:43:10 +01:00