bd81ac0f17
Improving the reduction to go through our on permute.
...
Must also do this for avx512
2015-05-27 16:07:17 +01:00
75442e48ce
Added a .gitignore file to eliminate autoconf files from commits
2015-05-27 11:10:51 +09:00
26ec41288d
Corrected AVX regression error. Tested.
2015-05-27 10:49:33 +09:00
12ae11ef62
Adding support for doxygen generation
2015-05-27 10:34:56 +09:00
377083e6ae
checked performance of new vector libaries.
...
Added check for c++11 support on the configure.ac
2015-05-26 12:02:54 +09:00
639fd05239
Added check of mpfr and gmp at configure time
...
It generates automatically the linker flags or complains if not found.
2015-05-19 13:54:55 +09:00
fa1dc5e448
Minor modification to the configure.ac
...
Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed
2015-05-18 17:15:14 +09:00
6d2accba7b
Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.
2015-05-18 16:48:14 +09:00
1887c77498
Getting closer to having a wilson solver... introducing a first and untested
...
cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of
algorithms/approx
algorithms/iterative
etc.. to start taking shape.
2015-05-18 07:47:05 +01:00
3ed30169ea
clang++ 3.4/5/7 compile happy for AVX and SSE
...
icpc compiles happy on MacOSX both with -xCOMMON-AV512 and native AVX
gcc-5 does not compile happy; can work around by renaming lattice peek/poke/transpose/trace templates
relative to tensor ones, but gcc goes into a recursive template instantiation due to
matching error. I think this is a gcc bug and have filed a report https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153
2015-05-15 11:52:11 +01:00
b4a570477c
I have made the Cshift work successfully with open mp threading in
...
every routine. Collapse(2) is now working under clang-omp++.
2015-05-13 00:31:00 +01:00
ebcb87abe1
Got command line args working
2015-05-11 14:36:48 +01:00
242e447bc5
Lots of changes required to compile for MIC under ICPC
2015-05-10 23:29:21 +01:00
5415180676
Wilson perf improvements with Gauge prefetching
2015-05-06 06:37:21 +01:00
7b0dd6c5d6
Cleaned up for Linux
2015-05-05 22:09:22 +01:00
6a39089a43
Starting a benchmarking sub dir
2015-05-02 17:52:36 +01:00
bdf18941a2
Improving the byte swap support for portability
2015-05-01 10:57:33 +01:00
b7416d79e3
Begginings of slice summation and subblocking
2015-04-23 11:04:59 +01:00
1851327d19
Got the NERSC IO working and fixed a bug in cshift.
2015-04-22 22:46:48 +01:00
388b735fd0
Build reorg
2015-04-18 14:56:05 +01:00
1972eea128
spin trace type work
2015-04-16 14:48:21 +01:00
982274e5a0
Major rework of extract/merge/permute processing debugged and working.
2015-04-06 11:26:24 +01:00
7b97e50b7b
MPI is now working and passing basic tests. Will start to construct a more sensible test suite shortly
...
since testing requirements now go beyond what a single Grid_main.cc can do.
Will need a more organised src tree for this and will require substantial reorg of build system.
2015-04-03 04:52:53 +01:00
e0af0e658d
Commit
2015-03-29 22:04:49 +01:00
4e8b9c6928
Changes for MIC
2015-03-04 13:25:23 +00:00
523abad40f
Place them in to avoid forced autoreconf on user
2015-03-04 11:53:59 +00:00