neo
6b8fe04054
Experimental support for ARM
2015-06-09 15:46:21 +09:00
Peter Boyle
b9e9777912
PartialFraction Hw with Zolo and Tanh approx converged under CG and passed EO breakdown
...
and hermiticity tests.
2015-06-04 13:28:37 +01:00
neo
b9edadc53e
Addedd Ta functionality to the tensor types
...
Merge remote-tracking branch 'upstream/master'
Conflicts:
configure
2015-06-04 18:11:32 +09:00
neo
ff9340d4d5
Small modification to the configure files
2015-06-04 14:17:58 +09:00
Peter Boyle
4bcc319e11
Reorganise of file naming
2015-06-03 12:47:05 +01:00
neo
575e6001f3
Merge remote-tracking branch 'upstream/master'
2015-05-29 11:41:39 +09:00
neo
4403e117a7
Some modifications to the configure to check SIMD support
2015-05-29 11:41:02 +09:00
Peter Boyle
e8be96bfe7
Improving the reduction to go through our on permute.
...
Must also do this for avx512
2015-05-27 16:07:17 +01:00
neo
be66fdcfab
Check at configure time if CPU supports the requested SIMD optimization
2015-05-27 18:30:11 +09:00
neo
4e3f4104ab
Added a .gitignore file to eliminate autoconf files from commits
2015-05-27 11:10:51 +09:00
Guido Cossu
2ccbff6c6c
Corrected AVX regression error. Tested.
2015-05-27 10:49:33 +09:00
neo
538bc41bbb
Adding support for doxygen generation
2015-05-27 10:34:56 +09:00
neo
ece86f717b
checked performance of new vector libaries.
...
Added check for c++11 support on the configure.ac
2015-05-26 12:02:54 +09:00
neo
4cadf11d1d
Added check of mpfr and gmp at configure time
...
It generates automatically the linker flags or complains if not found.
2015-05-19 13:54:55 +09:00
neo
17e4e478cd
Minor modification to the configure.ac
...
Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed
2015-05-18 17:15:14 +09:00
neo
cee363e28c
Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.
2015-05-18 16:48:14 +09:00
Peter Boyle
d0e4673a3f
Getting closer to having a wilson solver... introducing a first and untested
...
cut at Conjugate gradient. Also copied in Remez, Zolotarev, Chebyshev from
Mike Clark, Tony Kennedy and my BFM package respectively since we know we will
need these. I wanted the structure of
algorithms/approx
algorithms/iterative
etc.. to start taking shape.
2015-05-18 07:47:05 +01:00
Peter Boyle
100323ab4d
clang++ 3.4/5/7 compile happy for AVX and SSE
...
icpc compiles happy on MacOSX both with -xCOMMON-AV512 and native AVX
gcc-5 does not compile happy; can work around by renaming lattice peek/poke/transpose/trace templates
relative to tensor ones, but gcc goes into a recursive template instantiation due to
matching error. I think this is a gcc bug and have filed a report https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66153
2015-05-15 11:52:11 +01:00
Peter Boyle
541d52ab97
I have made the Cshift work successfully with open mp threading in
...
every routine. Collapse(2) is now working under clang-omp++.
2015-05-13 00:31:00 +01:00
Peter Boyle
b613ed0bb8
Got command line args working
2015-05-11 14:36:48 +01:00
Peter Boyle
2203c6e597
Lots of changes required to compile for MIC under ICPC
2015-05-10 23:29:21 +01:00
Peter Boyle
52403d587c
Wilson perf improvements with Gauge prefetching
2015-05-06 06:37:21 +01:00
Peter Boyle
cdd5cdeda2
Cleaned up for Linux
2015-05-05 22:09:22 +01:00
Peter Boyle
6a39089a43
Starting a benchmarking sub dir
2015-05-02 17:52:36 +01:00
Peter Boyle
bdf18941a2
Improving the byte swap support for portability
2015-05-01 10:57:33 +01:00
Peter Boyle
b7416d79e3
Begginings of slice summation and subblocking
2015-04-23 11:04:59 +01:00
Peter Boyle
1851327d19
Got the NERSC IO working and fixed a bug in cshift.
2015-04-22 22:46:48 +01:00
Peter Boyle
388b735fd0
Build reorg
2015-04-18 14:56:05 +01:00
Peter Boyle
1972eea128
spin trace type work
2015-04-16 14:48:21 +01:00
Peter Boyle
982274e5a0
Major rework of extract/merge/permute processing debugged and working.
2015-04-06 11:26:24 +01:00
Peter Boyle
7b97e50b7b
MPI is now working and passing basic tests. Will start to construct a more sensible test suite shortly
...
since testing requirements now go beyond what a single Grid_main.cc can do.
Will need a more organised src tree for this and will require substantial reorg of build system.
2015-04-03 04:52:53 +01:00
Peter Boyle
e0af0e658d
Commit
2015-03-29 22:04:49 +01:00
Azusa Yamaguchi
4e8b9c6928
Changes for MIC
2015-03-04 13:25:23 +00:00
Peter Boyle
523abad40f
Place them in to avoid forced autoreconf on user
2015-03-04 11:53:59 +00:00