b65e72e521
Merge pull request #43 from rprollins/bench/output-format
...
Benchmark_dwf_sweep and Benchmark_zmm output formats
2016-08-04 16:47:01 +01:00
d1aaff65e8
README update
2016-08-04 16:27:02 +01:00
93d29bb699
build system improvements after discussion with Peter
2016-08-04 16:19:59 +01:00
3b376ed54e
build system: error if MPI not found
2016-08-03 15:23:38 +01:00
d5c1f614ba
gitignore update
2016-08-03 15:14:33 +01:00
2edc24225d
untracking ltmain.sh
2016-08-03 15:12:44 +01:00
629283726b
build system: local Grid link flag moved to configure.ac
2016-08-03 15:07:42 +01:00
6adb66dd08
build system: finer management of GMP/MPFR dependence
2016-08-03 15:06:45 +01:00
5be92bb708
link fix in README
2016-08-03 12:40:56 +01:00
f4c049ea6d
README update
2016-08-03 12:38:54 +01:00
bc092ad30f
build system fix
2016-08-03 11:47:38 +01:00
dad642ed1b
various build system fixes and improvements
2016-08-03 11:39:20 +01:00
63ae39abc7
proper propagation of OpenMP flags
2016-08-02 17:41:32 +01:00
9e5b934d21
improved LAPACK configuration
2016-08-02 17:26:54 +01:00
a7b483d67a
Tests in subdirectories are not built by default
2016-08-02 12:14:28 +01:00
bb99ce0680
bootstrap script fix
2016-08-01 09:51:06 +01:00
83307df1af
travis update for new build system
2016-08-01 09:38:40 +01:00
e9f30cab2c
first working version for the new build system
2016-07-30 17:53:18 +01:00
Richard Rollins
df6c9f55d1
Use common benchmark output format for dwf_sweep and zmm
2016-07-20 17:38:56 +01:00
paboyle
27f3ecc833
Merge branch 'feature/bugfix-ck-cj' into develop
2016-07-16 01:59:52 +01:00
paboyle
f9e90eeb1f
Sign error on the force for 4d fields fixed
2016-07-16 01:52:44 +01:00
paboyle
fad5c675eb
sign error on the 4d gparity force
2016-07-16 01:51:56 +01:00
paboyle
4908b77d46
Fixed conflicts. PLEASE avoid making wholesale cosmetic only changes, this created
...
a HUGE amount of difficult to resolve and understand conflicts .
Wholesale formatting, reordering functions etc... in a central file like Tensor_class
or Grid_vector_types while others are also editing without making substantial functionality
changes creates pain.
2016-07-15 20:59:07 +01:00
paboyle
f4dd5062d7
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-07-15 19:26:06 +01:00
paboyle
da34d75841
Merge branch 'feature/Ls-vectorised-actions' into develop
2016-07-15 19:09:47 +01:00
paboyle
980ff18956
Solving the instantiation no compile issue
2016-07-15 17:19:44 +01:00
paboyle
1a6c7204ac
Disable instantiation; Use cache version instead
2016-07-15 00:34:39 +01:00
paboyle
49310fbab3
Done with red black change over
2016-07-15 00:08:43 +01:00
paboyle
6049d5ac47
Update
2016-07-15 00:08:32 +01:00
paboyle
35d0d35238
Updated file list
2016-07-15 00:02:53 +01:00
paboyle
c0e878705e
Updated file list
2016-07-15 00:02:39 +01:00
paboyle
5c0c8efb9e
Updated file list
2016-07-15 00:02:11 +01:00
paboyle
dfd714e1ef
Multiple implementations for the 5d hopping terms, depending on cache friendly
...
ops and/or the 5th direction being vectorised
All use 4d redblack.
2016-07-15 00:00:09 +01:00
paboyle
79a8ca1a62
Rewrite for performance. Impl dependent instantiations give
...
4d linalg impls of the 5d hopping terms (and inverse)
Cache friendly loop orderings of the above
Dense matrix stored and apply to the above
-- Switch to Ls vectorised, and use dense matrix approach for the MooeeInv
and rotate/shift of the Mooee M5D routines.
2016-07-14 23:58:15 +01:00
paboyle
fb45eb2eb2
5d ls vec rename of impl class
2016-07-14 23:57:26 +01:00
paboyle
a307274c96
Fermion impl rename for ls vectorised 5d approaches
2016-07-14 23:56:13 +01:00
paboyle
3f2c44a5fe
Updating the class to 5d selection based on impl type
2016-07-14 23:55:26 +01:00
paboyle
48fb1cdc11
Update domain 5d vectorised impl type, move the type over to 4d redblack with
...
the dense OO inverse
2016-07-14 23:54:35 +01:00
paboyle
8a79e93cc2
Rename the 5d domain wall fermion vectorised Ls impl class
2016-07-14 23:53:00 +01:00
paboyle
3493b51879
Modest updates
2016-07-14 23:52:13 +01:00
paboyle
de3e79d300
red black for Ls vectorised is 4d red black. Update accordingly now I've made this choice
2016-07-14 23:49:42 +01:00
paboyle
dd62a61c5c
Added broadcast and rotation of simd vectors
2016-07-14 23:49:00 +01:00
paboyle
8f47d0b5ab
Rotation needed for hopping term in fifth dim with Ls vectorised fields
2016-07-14 23:45:36 +01:00
paboyle
42af132dab
Fix for chris kellys request to peek poke on checkerboarded fields
2016-07-14 23:44:48 +01:00
paboyle
9db2c6525d
updating benchmarks for red black 4d for Ls vectorised code
2016-07-14 23:44:02 +01:00
paboyle
adbc7c1188
Adding files for multiple implementations (cache opt) and Ls vectorisation
...
of the 5D cayley form chiral fermions for the 5d matrix. With Ls entirely
in the vector direction, s-hopping terms involve rotations.
The serial dependence of the LDU inversion for Mobius and 4d even odd
checkerboarding is removed by simply applying Ls^2 operations (vectorised
many ways) as a dense matrix operation.
This should give similar throughput but high flops (non-compulsory flops)
but enable use of the KNL cache friendly kernels throughout the code.
Ls is still constrained to be a multiple of Nsimd, which is as much as 8 for AVX512
with single precision.
2016-07-14 22:59:21 +01:00
Christopher Kelly
8b9301a74c
Merge branch 'feature/bugfixes' into develop
2016-07-13 12:31:34 -04:00
Christopher Kelly
6f47fbb1e2
Disabled parallel for loops in ExtractSlice and InsertSlice due to race conditions. Likely will need to do so for localConvert too.
2016-07-13 10:49:18 -04:00
Christopher Kelly
a3c0fb79b6
Fix to iVector and iMatrix pokeIndex and checkerboard local site indexing.
2016-07-11 17:15:22 -04:00
paboyle
62601bb649
Bug fix
2016-07-08 20:46:29 +01:00