neo
|
cee363e28c
|
Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.
|
2015-05-18 16:48:14 +09:00 |
|
Peter Boyle
|
675fd1a065
|
Compile options tweak
|
2015-05-15 12:33:18 +01:00 |
|
Peter Boyle
|
9a120cf5ec
|
ngo store
|
2015-05-15 11:49:39 +01:00 |
|
Peter Boyle
|
556befaaaa
|
Enhanced SIMD interfacing
|
2015-05-12 20:41:44 +01:00 |
|
Peter Boyle
|
c6baa3e657
|
Threading support rework.
Placed parallel pragmas as macros; implemented deterministic thread reduction in style of
BFM.
|
2015-05-12 07:51:41 +01:00 |
|
Peter Boyle
|
2203c6e597
|
Lots of changes required to compile for MIC under ICPC
|
2015-05-10 23:29:21 +01:00 |
|
Peter Boyle
|
52403d587c
|
Wilson perf improvements with Gauge prefetching
|
2015-05-06 06:37:21 +01:00 |
|
Peter Boyle
|
249165d1b2
|
Added streaming stores
|
2015-05-05 18:09:28 +01:00 |
|
Peter Boyle
|
9d93d1e6d4
|
Comms and memory benchmarks added
|
2015-05-03 09:44:47 +01:00 |
|
Peter Boyle
|
b0485894b3
|
Shaken out stencil to the point where I think wilson dslash is correct.
Need to audit code carefully, consolidate between stencil and cshift,
and then benchmark and optimise.
|
2015-04-28 08:11:59 +01:00 |
|
Peter Boyle
|
35cfef2129
|
Big updates with progress towards wilson matrix
|
2015-04-26 15:51:09 +01:00 |
|
Peter Boyle
|
b8eef54fa7
|
First implementation of Dirac matrices as a Gamma class.
|
2015-04-24 18:20:03 +01:00 |
|
Peter Boyle
|
a9e574dd27
|
Snippets from Guido to optimise Reduce
|
2015-04-23 08:31:40 +01:00 |
|
Peter Boyle
|
aee6669d0b
|
Build reorg with which I am a bit happier
|
2015-04-18 21:22:50 +01:00 |
|