azusayamaguchi
|
b3e7f600da
|
Partial implementation of 4d vectorisation assembler
|
2016-12-16 23:50:30 +00:00 |
|
azusayamaguchi
|
d4071daf2a
|
Template specialise
|
2016-12-16 22:28:29 +00:00 |
|
azusayamaguchi
|
a2a6329094
|
AVX512 only for ASM compilation
|
2016-12-16 22:03:29 +00:00 |
|
azusayamaguchi
|
eabc577940
|
Assembler possibly working
|
2016-12-16 16:55:36 +00:00 |
|
Azusa Yamaguchi
|
426197e446
|
Nc=3
|
2016-12-12 09:10:54 +00:00 |
|
Azusa Yamaguchi
|
99e2c1e666
|
Kernels options
|
2016-12-12 09:08:53 +00:00 |
|
Azusa Yamaguchi
|
1440565a10
|
Decrease verbosity
|
2016-12-12 09:08:04 +00:00 |
|
Azusa Yamaguchi
|
c097fd041a
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
|
2016-11-29 13:44:17 +00:00 |
|
Azusa Yamaguchi
|
77fb25fb29
|
Push 5d tests
|
2016-11-29 13:43:56 +00:00 |
|
Azusa Yamaguchi
|
389e0a77bd
|
Staggerd Fermion 5D
|
2016-11-29 13:13:56 +00:00 |
|
Guido Cossu
|
ae9688e343
|
Reporting also the total mflops
|
2016-11-28 11:37:02 +00:00 |
|
Azusa Yamaguchi
|
668ca57702
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
|
2016-11-22 13:49:11 +00:00 |
|
azusayamaguchi
|
f7b60004f3
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-04 16:08:07 +00:00 |
|
azusayamaguchi
|
b7d55f7dfb
|
Fix a typo in reorg of the --dslash-asm
|
2016-11-04 11:35:08 +00:00 |
|
Azusa Yamaguchi
|
ee686a7d85
|
Compiles now
|
2016-11-03 16:58:23 +00:00 |
|
Azusa Yamaguchi
|
1c5b7a6be5
|
Staggered phases first cut, c1, c2, u0
|
2016-11-03 16:26:56 +00:00 |
|
|
75bbf6a0af
|
Merge branch 'develop' into feature/feynman-rules
|
2016-11-03 13:52:11 +00:00 |
|
paboyle
|
c067051d5f
|
Merge branch 'develop' into release/v0.6.0
|
2016-11-02 13:59:18 +00:00 |
|
Guido Cossu
|
ae8561892e
|
Eliminating useless defines
|
2016-11-02 10:21:06 +00:00 |
|
paboyle
|
bb94ddd0eb
|
Tidy up of mpi3; also some cleaning of the dslash controls.
|
2016-11-02 08:07:09 +00:00 |
|
Azusa Yamaguchi
|
164d3691db
|
Staggered
|
2016-11-01 14:24:22 +00:00 |
|
Guido Cossu
|
e8c3174ae2
|
Small change in the defines
|
2016-10-30 12:23:11 +00:00 |
|
Guido Cossu
|
9b066e94d0
|
Compilation with both single and double precision
|
2016-10-30 12:04:06 +00:00 |
|
Guido Cossu
|
e1042aef77
|
First version of the doube prec for testing purposes
It does not compile single and double version at the same time
|
2016-10-28 17:20:04 +01:00 |
|
|
ca21003f01
|
Merge branch 'feature/fft-opt' into feature/feynman-rules
# Conflicts:
# lib/FFT.h
# lib/qcd/action/fermion/WilsonFermion5D.h
# tests/core/Test_fft.cc
|
2016-10-26 18:44:47 +01:00 |
|
azusayamaguchi
|
c190221fd3
|
Internal SHM comms in non-simd directions working
Need to fix simd directions
|
2016-10-22 18:14:27 +01:00 |
|
azusayamaguchi
|
6a9eae6b6b
|
Reporting improvements
|
2016-10-21 13:36:18 +01:00 |
|
|
bd6a228af6
|
Merge commit '20a091c3eddfdb67a82ece6413740a93650a2f98' into feature/feynman-rules
|
2016-10-21 13:10:30 +01:00 |
|
paboyle
|
b58adc6a4b
|
commVector
|
2016-10-20 17:00:15 +01:00 |
|
|
997fd882ff
|
Merge branch 'develop' into feature/feynman-rules
# Conflicts:
# lib/Threads.h
# lib/qcd/action/fermion/WilsonFermion.cc
# lib/qcd/action/fermion/WilsonFermion.h
# lib/qcd/utils/SUn.h
# lib/simd/Grid_avx.h
# lib/simd/Intel512common.h
|
2016-10-19 18:35:18 +01:00 |
|
azusayamaguchi
|
81f2aeaece
|
KNL streaming stores, and KNL performance coutners
|
2016-10-12 11:45:22 +01:00 |
|
paboyle
|
3619167d62
|
Mass parameter
|
2016-10-10 23:47:33 +01:00 |
|
paboyle
|
96f1d1b828
|
Debugged Domain wall and Overlap feynman rules (infinite Ls, finite mass).
|
2016-10-10 23:46:45 +01:00 |
|
paboyle
|
657e0a8f4d
|
Mass parameter
|
2016-10-10 23:46:10 +01:00 |
|
paboyle
|
616e7cd83e
|
Mass parameter
|
2016-10-10 23:45:48 +01:00 |
|
paboyle
|
6f26d2e8d4
|
Overlap tree level feynman rule
|
2016-10-10 23:45:18 +01:00 |
|
paboyle
|
c014574504
|
A "please implement me" feynman rule. If this were abstract virtual it would
require/force implementation
|
2016-10-10 23:44:00 +01:00 |
|
paboyle
|
d7ce164e6e
|
Feynman rule for DWF
|
2016-10-10 23:43:36 +01:00 |
|
paboyle
|
c0d5b99016
|
Dminus
|
2016-10-10 23:43:19 +01:00 |
|
paboyle
|
09ca32d678
|
Dminus added for Cayley
|
2016-10-10 23:42:55 +01:00 |
|
Guido Cossu
|
b56c9ffa52
|
Fix for AVXFMA
|
2016-10-10 14:43:37 +01:00 |
|
Guido Cossu
|
2e453dfbf5
|
Added some instrumentation to benchmark the force computation
|
2016-10-06 17:52:45 +01:00 |
|
paboyle
|
4089984431
|
Timing hooks
|
2016-10-06 09:25:12 +01:00 |
|
Guido Cossu
|
c78bbd0f8c
|
Fix ASM compilation
|
2016-10-04 15:37:32 +01:00 |
|
paboyle
|
b6713ecb60
|
Momentum space rules for Overlap, DWF untested to date
|
2016-09-26 09:39:09 +01:00 |
|
Guido Cossu
|
b6597b74e7
|
Added support for the Two index Symmetric and Antisymmetric representations
Tested for HMC convergence: OK
Added also a test file showing an example for mixed representations
|
2016-09-22 14:17:37 +01:00 |
|
Guido Cossu
|
b9c80318a2
|
Merge branch 'develop' into feature/hirep
|
2016-09-13 10:01:51 +01:00 |
|
Guido Cossu
|
f76f281e58
|
Cleaning files after fix
|
2016-09-09 11:34:25 +01:00 |
|
Guido Cossu
|
aa20cc8b52
|
Fixing compilation error with AVX512 flag
|
2016-09-09 02:58:52 -07:00 |
|
Guido Cossu
|
0fd179fb33
|
Merge branch 'develop' into feature/hirep
|
2016-09-01 12:59:53 +01:00 |
|