Peter Boyle
6cdb09c884
Faster copy region
2020-04-10 11:10:52 -04:00
Peter Boyle
a65bc64f10
Accelerator peek poke
2020-04-10 11:09:59 -04:00
Peter Boyle
11dec4883c
Don't throw assert
2020-04-10 11:09:11 -04:00
Peter Boyle
afa458c812
Extra solvers
2020-04-10 11:08:19 -04:00
Peter Boyle
dc50190b8f
Faster GPU basis rotation
...
May need to later include Regensburg optimised CPU variant
2020-04-10 11:06:04 -04:00
Christoph Lehner
96e8e44fd4
Merge pull request #2 from DanielRichtmann/feature/fused-innerproduct-norm2
...
Fused innerProduct + norm2 on first argument operation
2020-04-06 13:16:58 +02:00
Daniel Richtmann
5fc8a273e7
Fused innerProduct + norm2 on first argument operation
2020-04-06 11:52:29 +02:00
d671a63e78
Update README.md
2020-04-03 19:52:15 +01:00
Christoph Lehner
856d168e41
global sum over vectors of uint64_t
2020-03-29 07:56:05 -04:00
6235c7ba98
IPP path fix in configure
2020-03-27 17:23:29 +00:00
7e13724882
removing Hadrons
2020-03-27 12:03:32 +00:00
Christoph Lehner
b6cbdd2aa3
Merge pull request #1 from DanielRichtmann/feature/read-openqcd
...
Feature/read openqcd
2020-03-26 17:39:04 +01:00
Christoph Lehner
a2188ea875
remove debugging printf from WilsonKernelsImplementation
2020-03-26 09:12:36 -04:00
Daniel Richtmann
989af65807
Check in parallel reader for openqcd configs
2020-03-24 11:20:54 +01:00
Christoph Lehner
60db3133d3
make trace,adj,transpose unary operators
2020-03-16 17:59:56 -04:00
Christoph Lehner
c9b737a4e7
make trace,adj,transpose unary operators
2020-03-16 17:58:30 -04:00
Daniel Richtmann
037bb6ea73
Check in reader for openqcd configs
...
This reader is suboptimal in the sense that it opens the entire config on every MPI rank.
2020-03-16 14:28:02 +01:00
05ebc458e2
Merge pull request #260 from mmphys/feature/distil
...
Distillation: save eigenvalues of the Laplacian for all timeslices
2020-03-13 14:00:21 +00:00
Michael Marshall
3753508957
Making change 1) as simple as possible 2) as much like MSink/Point.hpp as possible
2020-03-12 13:47:51 +00:00
Michael Marshall
c1677fccf6
Merge branch 'develop' into feature/distil
...
* develop:
bugfix ZPerambulator
registered module supporting ZMobius action
changed to push_back according to request
Added Hadrons_Error in case blockSize is set too large
bugfix in perambulator module
# Conflicts:
# Hadrons/Modules/MDistil/Perambulator.hpp
2020-03-12 12:45:18 +00:00
35e8e31749
Merge pull request #272 from mmphys/feature/ZPeramb
...
bugfix ZPerambulator
2020-03-12 12:28:04 +00:00
34813e9b04
Merge branch 'develop' into feature/ZPeramb
2020-03-12 12:27:56 +00:00
Felix Erben
373cf61abb
bugfix ZPerambulator
2020-03-12 11:44:43 +00:00
4e8fbc4b49
Merge pull request #271 from mmphys/feature/ZDistil
...
registered module supporting ZMobius action
2020-03-12 10:54:07 +00:00
ferben
516ac1d4d5
registered module supporting ZMobius action
2020-03-12 10:52:27 +00:00
318f63eb34
Merge pull request #268 from mmphys/a2a-error-log
...
Added Hadrons_Error in case blockSize is set too large
2020-03-11 11:09:00 +00:00
16503d7532
Merge pull request #267 from mmphys/feature/distil-bugfix
...
bugfix in perambulator module
2020-03-11 11:08:23 +00:00
ferben
0fa93383b7
changed to push_back according to request
2020-03-11 09:05:01 +00:00
ferben
0a827aa7bf
Added Hadrons_Error in case blockSize is set too large
2020-03-11 08:52:52 +00:00
Carleton DeTar
165c68e28e
Change TrueResiduals to TrueResidualShift and IterationsToComplete to IterationsToCompleteShift
2020-02-29 17:51:51 -06:00
ferben
b32b1ca642
bugfix in perambulator module
2020-02-26 12:06:45 +00:00
Carleton DeTar
9479bc8486
Make IterationsToComplete and TrueResidual externally accessible
2020-02-19 17:43:57 -06:00
Peter Boyle
8a5c13d5fb
Still fast moving in changes
2020-02-06 17:57:26 -05:00
Peter Boyle
bdccb0c91f
Working 2 types of decomposition
2020-02-06 17:26:55 -05:00
Peter Boyle
68b45f6444
Lower left/upper right region cut paste
2020-02-06 15:50:26 -05:00
Peter Boyle
ef9b3e658a
extra typedef
2020-02-06 15:47:14 -05:00
Peter Boyle
b9ca40cc44
More precise power method at start
2020-02-06 10:09:14 -05:00
Peter Boyle
2f421a5db1
Commeent fix
2020-02-06 10:08:27 -05:00
Michael Marshall
10192dfc71
Wall source momenta must be specified for spatial components only.
...
So we don't break existing scripts, allow momentum in time direction as well, but only if zero.
Fail early, so do the check in setup()
2020-01-31 15:02:03 +00:00
Michael Marshall
c69a3b6ef6
When saving eigenvectors, LapEvec now saves eigenvalues for every timeslice as well.
...
I.e. nT x nVec eigenvalues are saved in FileName.evals.conf.h5.
A new named tensor, "TimesliceEvals" can be used to simplify restoring these from disk.
NB: The changes in BaseIO add support so that Eigen tensors can be easily used in MPI operations, e.g. GlobalSum.
See LapEvec.hpp for an example of how this is done.
2020-01-29 21:20:20 +00:00
Peter Boyle
852fc1b001
True Hierachical multigrid for DWF
2020-01-27 13:45:10 -05:00
Peter Boyle
2b5de5bba5
MdagM operator without norm option
2020-01-27 13:44:30 -05:00
Peter Boyle
2e85cae74e
Add Jacobi polynomials
2020-01-27 13:43:49 -05:00
Peter Boyle
76c823781e
Much faster coarsening
2020-01-27 13:43:19 -05:00
Peter Boyle
114db3b99d
Optional MdagM without norms
2020-01-27 13:42:51 -05:00
Peter Boyle
49e123dbda
Use explicit linalg calls to get coalesce optimisations on GPU
2020-01-27 12:44:51 -05:00
Peter Boyle
8cec294ec9
Make CG a bit less verbose as gettign annoying in nested algorithms.
...
Can use Iterative logging if you want to see more
2020-01-27 12:44:04 -05:00
Peter Boyle
eb5b720e94
Normal Equations can be used in HDCR now
2020-01-27 12:43:29 -05:00
Peter Boyle
b2736ec80b
Make PrecGCR recursive - it can precondition itself
2020-01-27 12:42:48 -05:00
Peter Boyle
086256a032
Less sloppy convergence test on PowerMethod
2020-01-27 12:41:59 -05:00