86c9c4da8b
changes
2020-04-09 16:40:06 +02:00
bd310932f7
changes
2020-04-09 16:32:31 +02:00
77fa586f6c
introduced A64FX Wilson kernels
2020-04-09 13:30:06 +02:00
a2188ea875
remove debugging printf from WilsonKernelsImplementation
2020-03-26 09:12:36 -04:00
c9b737a4e7
make trace,adj,transpose unary operators
2020-03-16 17:58:30 -04:00
7c061e20c9
All directions of dirac operator for fastt coarsening
2020-01-27 12:40:13 -05:00
e5d1c09665
Faster DhopDirAll for little dirac operator coarsening
2020-01-27 12:38:54 -05:00
8016a465ae
Remove extraneous variable
2020-01-27 12:35:37 -05:00
d8b9742092
DhopDirAll for faster matrix elements of little Dirac operator
2020-01-27 12:34:54 -05:00
96671bbb24
Added ability to pass callback to MADWF that is called every inner iteration and allows user to, for example, adjust the inner solver tolerance depending on residual
...
Added a general implementation of the Remez algorithm for producing arbitrary rational polynomial approximation with optional restriction to even/odd polynomials
Added implementation of computation of ZMobius parameters
Added Test_zMADWF_prec to test ZMobius in MADWF
2020-01-17 12:45:30 -08:00
e583035614
Change to interface to minise comms in evaluating coarse space operator
2020-01-06 11:43:59 -05:00
3c3d6a94f3
OPtimising the force term a bit
2020-01-04 03:16:23 -05:00
039eb7b2eb
Make the force term and coarsening multigrid more optimised
2020-01-04 03:12:17 -05:00
f7373e97a4
Missing conjugate in MooeeInvDag
2019-12-16 10:05:50 +01:00
848079e8ba
Merge pull request #235 from grid-test-organisation/feature/5d-improvement
...
MooeeInv and M5D optimisations + enable threading with nvcc
2019-12-10 21:45:03 -05:00
9b6b0caa55
Junk commit fix
2019-12-09 03:01:58 -05:00
2a48617ac5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-12-09 03:00:00 -05:00
3d2fe80780
Temporary size depends on checkerboard/uncheckerboard. The Mdir cares
2019-12-09 02:58:24 -05:00
803329af99
Merge branch 'develop' into feature/distil
...
* develop:
Fix after GPU merge: Phase in Free Propagator
z2-momentum phase module
# Conflicts:
# Hadrons/Modules/MSource/MomentumPhase.hpp
2019-10-07 13:09:52 +01:00
5f22810f55
Fix after GPU merge: Phase in Free Propagator
2019-10-02 14:49:35 +01:00
2e963d1a78
Fix location of Grid.h and remove reference to QCD namespace
2019-09-16 15:34:47 +01:00
a7fa86dc29
MooeeInv improvement for DW EOFA + comments
2019-09-05 12:05:21 +01:00
fdd9b14e82
speed up MooeeInvDag for DWF EOFA
2019-09-02 14:49:51 +01:00
e66669d300
fast MooeeInv for EOFA
2019-09-02 14:26:13 +01:00
0efaf3c4fa
access M5D coeffs through pointers
2019-09-02 11:33:00 +01:00
3ef519aaa4
fast MooeeInv
2019-09-02 11:18:14 +01:00
e279b2be29
Merge develop
2019-08-14 23:01:59 +01:00
48e6efc7c9
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
Grid/qcd/action/fermion/WilsonKernelsAsm.cc
Grid/qcd/action/fermion/implementation/ImprovedStaggeredFermionImplementation.h
Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h
benchmarks/Benchmark_comms.cc
2019-08-14 18:56:54 +01:00
53e3ab4131
Fix force term
2019-08-11 11:06:13 +01:00
1282e1067f
Do the force term on the accelerator too. Needed particularly because comms buffers
...
are device memory.
2019-07-29 22:58:35 +01:00
fe700a183a
Getting HMC to run
2019-07-26 12:18:29 +01:00
08904f830e
Merge develop
2019-07-16 11:59:56 +01:00
fa9cd50c5b
Merge branch 'develop' into feature/gpu-port
2019-07-16 11:55:17 +01:00
6179acfda0
Put back a call that was required
2019-07-14 13:59:54 +01:00
07601ac1f5
Replace instantiation of Gparity
2019-07-12 17:18:12 +01:00
705a8098b2
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
...
Conflicts:
Grid/stencil/Stencil.h
2019-07-12 17:14:11 +01:00
78ebd93281
Cuda 9.1 happy
2019-07-12 17:11:00 +01:00
bd155ca5c0
Overlap comms with comput now supported
2019-07-12 09:09:40 +01:00
532e226b22
cuda 9.1 fixes
2019-07-01 07:29:22 +01:00
d7b3efe893
Compile fix
2019-06-15 17:03:15 +01:00
0184719216
Change to predicate type
2019-06-15 12:52:26 +01:00
24202dbc51
Thread loop construct change
2019-06-15 12:52:07 +01:00
d763c303c5
Clean acceleerator barrier
2019-06-15 12:51:45 +01:00
8e394d3bf9
New loop construct
2019-06-15 12:51:15 +01:00
b881d5489b
Move SchurDiagTwoKappa to Algorithms
2019-06-15 12:50:45 +01:00
49f90cc7eb
use pragma once
2019-06-15 12:45:22 +01:00
b77af0210b
Thread loop. Probably deprecate this impl
2019-06-15 12:44:56 +01:00
5254ede2d8
New loops. Revisit as accelerator loop in future audit
2019-06-15 12:44:29 +01:00
16e5d7945e
Hard to make 5D vec work with GPU code
2019-06-15 12:43:43 +01:00
decc99ca76
Accelerator version
2019-06-15 12:43:00 +01:00