paboyle
|
957a706d0b
|
Useful script
|
2017-04-20 13:17:44 +01:00 |
|
paboyle
|
d2312e9874
|
Drop compressor entirely from Cshift to only Stencil.
|
2017-04-20 13:16:55 +01:00 |
|
paboyle
|
fc4ab9ccd5
|
Working half precision comms
|
2017-04-20 11:20:26 +01:00 |
|
paboyle
|
4a340aa5ca
|
Massive compressor rework to support reduced precision comms
|
2017-04-20 09:28:27 +01:00 |
|
paboyle
|
3b7de792d5
|
Type comparison in the traits work
|
2017-04-18 13:28:04 +01:00 |
|
paboyle
|
557c3fa109
|
Pretty change
|
2017-04-18 13:27:38 +01:00 |
|
paboyle
|
ec18e9f7f6
|
Merge branch 'develop' into feature/half-prec-comms
|
2017-04-18 11:39:39 +01:00 |
|
paboyle
|
a839d5bc55
|
Updated todo list
|
2017-04-18 11:22:17 +01:00 |
|
paboyle
|
de41b84c5c
|
Merge branch 'feature/normHP' into develop
|
2017-04-18 10:57:21 +01:00 |
|
paboyle
|
8e161152e4
|
MultiRHS solver improvements with slice operations moved into lattice and sped up.
Block solver requires a lot of performance work.
|
2017-04-18 10:51:55 +01:00 |
|
paboyle
|
3141ebac10
|
MultiRHS working, starting to optimise. Block doesn't and I thought it already was; puzzled.
|
2017-04-17 10:50:19 +01:00 |
|
paboyle
|
7ede696126
|
Non compile of tests fixed
|
2017-04-16 23:40:00 +01:00 |
|
paboyle
|
bf516c3b81
|
higher precision reduction variables in norm and inner product
|
2017-04-15 12:27:28 +01:00 |
|
paboyle
|
441a52ee5d
|
First cut at higher precision reduction
|
2017-04-15 10:57:21 +01:00 |
|
paboyle
|
a8db024c92
|
Cleaning up the dense matrix and lanczos sector
|
2017-04-15 08:54:11 +01:00 |
|
paboyle
|
a9c22d5f43
|
Verbose removal
|
2017-04-14 14:38:49 +01:00 |
|
paboyle
|
3ca41458a3
|
Fix to no USE_FP16 case
|
2017-04-14 14:20:54 +01:00 |
|
paboyle
|
9e2d29c644
|
USE_FP16 macro
|
2017-04-14 14:17:14 +01:00 |
|
Peter Boyle
|
951be75292
|
Half precision conversion working on AVX512 now too
|
2017-04-13 17:35:11 +01:00 |
|
Peter Boyle
|
b9113ed310
|
Patches for knl
|
2017-04-13 12:02:12 -04:00 |
|
paboyle
|
42fb49d3fd
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2017-04-13 14:12:47 +01:00 |
|
paboyle
|
2a54c9aaab
|
Merge branch 'feature/block-cg' into develop
|
2017-04-13 14:12:24 +01:00 |
|
paboyle
|
0957378679
|
Fixing conditional ugly way
|
2017-04-13 13:47:56 +01:00 |
|
paboyle
|
2ed6c76fc5
|
Getting multiline if then fi working
|
2017-04-13 13:43:13 +01:00 |
|
paboyle
|
d3b9a7fa14
|
F16c apparently requires AVX, even if the 128 bit are used.
Seems odd.
|
2017-04-13 13:19:11 +01:00 |
|
paboyle
|
75ea306ce9
|
Another try at travis
|
2017-04-13 13:05:32 +01:00 |
|
paboyle
|
4226c633c4
|
Default to FP16 off again
|
2017-04-13 12:51:39 +01:00 |
|
paboyle
|
5a4eafbf7e
|
.travis
|
2017-04-13 12:50:43 +01:00 |
|
paboyle
|
eb8e26018b
|
Travis update for macos
|
2017-04-13 12:35:11 +01:00 |
|
paboyle
|
db5ea001a3
|
Update to use Xcode 8.3 since -mfp16 causes SIGILL
|
2017-04-13 12:22:40 +01:00 |
|
paboyle
|
2846f079e5
|
Predicate tests on fp16 being enabled
|
2017-04-13 12:08:05 +01:00 |
|
paboyle
|
1d502e4ed6
|
FP16 optional compile time
|
2017-04-13 11:55:24 +01:00 |
|
paboyle
|
73cdf0fffe
|
Drop f16c from SSE because of a macos compile error on travis
|
2017-04-13 11:23:41 +01:00 |
|
paboyle
|
1c25773319
|
Trap illegal instructions
|
2017-04-13 10:51:40 +01:00 |
|
paboyle
|
c38400b26f
|
Trap signals
|
2017-04-13 10:35:20 +01:00 |
|
paboyle
|
9c3065b860
|
Debug flags off again
|
2017-04-13 10:01:32 +01:00 |
|
paboyle
|
94eb829d08
|
Align cast fixed for __mm128i gcc complained
|
2017-04-13 08:40:44 +01:00 |
|
paboyle
|
68392ddb5b
|
Exchange in generic
Precision change in AVX, SSE, AVX512, Generic. QPX still to do.
|
2017-04-13 08:38:12 +01:00 |
|
paboyle
|
cb6b81ae82
|
Half precision conversion
|
2017-04-12 19:32:37 +01:00 |
|
|
8ef4300412
|
spurious .dirstamp files removed
|
2017-04-10 17:00:22 +01:00 |
|
|
98a24ebf31
|
The macro “magics” is very intensive for the preprocessor in the measurement code which has numerous serialisable classes. Reducing the number of serialisable fields to 64 (instead of 1024) helps a lot, this is enough for now and can be extended trivially if needed in the future.
|
2017-04-10 16:58:54 +01:00 |
|
paboyle
|
b12dc89d26
|
Commenting and clean up
|
2017-04-10 20:38:20 +09:00 |
|
paboyle
|
d80d802f9d
|
MultiRHS solver test
|
2017-04-10 00:12:12 +09:00 |
|
paboyle
|
3d99b09dba
|
Start of blockCG
|
2017-04-09 23:42:10 +09:00 |
|
paboyle
|
db5f6d3ae3
|
Verbose fix
|
2017-04-09 23:41:30 +09:00 |
|
paboyle
|
683550f116
|
Const args improvement
|
2017-04-09 23:41:04 +09:00 |
|
paboyle
|
55d0329624
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2017-04-07 11:08:14 +09:00 |
|
paboyle
|
86aaa35294
|
Christoph needs SchurDiagTwoKappa which is mobius specific.
|
2017-04-07 11:07:40 +09:00 |
|
Guido Cossu
|
172d3dc93a
|
Correcting names in tests
|
2017-04-05 16:24:04 +01:00 |
|
paboyle
|
5592f7b8c1
|
Creation mode better implementation
|
2017-04-05 02:35:34 +09:00 |
|