nmeyer-ur
304762e7ac
changes
2020-04-09 16:26:01 +02:00
nmeyer-ur
d79ab03a6c
changes
2020-04-09 16:19:25 +02:00
nmeyer-ur
d5708e0eb2
more changes
2020-04-09 15:43:34 +02:00
nmeyer-ur
123f6b7a61
more changes
2020-04-09 15:17:19 +02:00
nmeyer-ur
2b6457dd9a
added xp/xm recon accum
2020-04-09 15:13:19 +02:00
nmeyer-ur
b367cbd422
defined ADD_RESULT
2020-04-09 15:08:45 +02:00
nmeyer-ur
e252c1aca3
addressing
2020-04-09 15:03:12 +02:00
nmeyer-ur
b140c6a4f9
addressing
2020-04-09 15:01:15 +02:00
nmeyer-ur
326de36467
revised sU addressing scheme
2020-04-09 14:44:25 +02:00
nmeyer-ur
9f224a1647
fixed typo in single
2020-04-09 14:30:21 +02:00
nmeyer-ur
bb46ba9b5f
fixed array size in single
2020-04-09 14:28:45 +02:00
nmeyer-ur
dd5a22b36b
revised declarations
2020-04-09 14:21:27 +02:00
nmeyer-ur
1ea85b9972
Disabled build message
2020-04-09 13:47:21 +02:00
nmeyer-ur
8fb63f1c25
added A64FX Wilson kernels single precision
2020-04-09 13:41:04 +02:00
nmeyer-ur
77fa586f6c
introduced A64FX Wilson kernels
2020-04-09 13:30:06 +02:00
Daniel Richtmann
5fc8a273e7
Fused innerProduct + norm2 on first argument operation
2020-04-06 11:52:29 +02:00
nmeyer-ur
15238e8d5e
reduce acle works, clean up
2020-04-03 20:40:44 +02:00
nmeyer-ur
b27e31957a
reduce acle revised
2020-04-03 19:46:15 +02:00
nmeyer-ur
46927771e3
reduce acle still needs overhaul
2020-04-03 19:30:48 +02:00
nmeyer-ur
d8cea77707
define simd width in header
2020-04-03 19:22:25 +02:00
nmeyer-ur
5f8a76d490
clean up, reduction in acle
2020-04-03 19:18:24 +02:00
nmeyer-ur
28d49a3b60
build problem resolved
2020-04-03 16:52:48 +02:00
nmeyer-ur
b4c624ece6
added A64FX support
2020-04-03 15:43:23 +02:00
2c22db841a
Added momentum scaling to scalar HMC theories in order to follow UKQCD/CPS conventions
2020-04-02 17:38:47 +01:00
Christoph Lehner
856d168e41
global sum over vectors of uint64_t
2020-03-29 07:56:05 -04:00
Christoph Lehner
b6cbdd2aa3
Merge pull request #1 from DanielRichtmann/feature/read-openqcd
...
Feature/read openqcd
2020-03-26 17:39:04 +01:00
Christoph Lehner
a2188ea875
remove debugging printf from WilsonKernelsImplementation
2020-03-26 09:12:36 -04:00
Daniel Richtmann
989af65807
Check in parallel reader for openqcd configs
2020-03-24 11:20:54 +01:00
Christoph Lehner
c9b737a4e7
make trace,adj,transpose unary operators
2020-03-16 17:58:30 -04:00
Daniel Richtmann
037bb6ea73
Check in reader for openqcd configs
...
This reader is suboptimal in the sense that it opens the entire config on every MPI rank.
2020-03-16 14:28:02 +01:00
Carleton DeTar
165c68e28e
Change TrueResiduals to TrueResidualShift and IterationsToComplete to IterationsToCompleteShift
2020-02-29 17:51:51 -06:00
Carleton DeTar
9479bc8486
Make IterationsToComplete and TrueResidual externally accessible
2020-02-19 17:43:57 -06:00
Peter Boyle
8a5c13d5fb
Still fast moving in changes
2020-02-06 17:57:26 -05:00
Peter Boyle
bdccb0c91f
Working 2 types of decomposition
2020-02-06 17:26:55 -05:00
Peter Boyle
68b45f6444
Lower left/upper right region cut paste
2020-02-06 15:50:26 -05:00
Peter Boyle
ef9b3e658a
extra typedef
2020-02-06 15:47:14 -05:00
Peter Boyle
b9ca40cc44
More precise power method at start
2020-02-06 10:09:14 -05:00
Peter Boyle
2f421a5db1
Commeent fix
2020-02-06 10:08:27 -05:00
Michael Marshall
c69a3b6ef6
When saving eigenvectors, LapEvec now saves eigenvalues for every timeslice as well.
...
I.e. nT x nVec eigenvalues are saved in FileName.evals.conf.h5.
A new named tensor, "TimesliceEvals" can be used to simplify restoring these from disk.
NB: The changes in BaseIO add support so that Eigen tensors can be easily used in MPI operations, e.g. GlobalSum.
See LapEvec.hpp for an example of how this is done.
2020-01-29 21:20:20 +00:00
Peter Boyle
2b5de5bba5
MdagM operator without norm option
2020-01-27 13:44:30 -05:00
Peter Boyle
2e85cae74e
Add Jacobi polynomials
2020-01-27 13:43:49 -05:00
Peter Boyle
76c823781e
Much faster coarsening
2020-01-27 13:43:19 -05:00
Peter Boyle
114db3b99d
Optional MdagM without norms
2020-01-27 13:42:51 -05:00
Peter Boyle
49e123dbda
Use explicit linalg calls to get coalesce optimisations on GPU
2020-01-27 12:44:51 -05:00
Peter Boyle
8cec294ec9
Make CG a bit less verbose as gettign annoying in nested algorithms.
...
Can use Iterative logging if you want to see more
2020-01-27 12:44:04 -05:00
Peter Boyle
eb5b720e94
Normal Equations can be used in HDCR now
2020-01-27 12:43:29 -05:00
Peter Boyle
b2736ec80b
Make PrecGCR recursive - it can precondition itself
2020-01-27 12:42:48 -05:00
Peter Boyle
086256a032
Less sloppy convergence test on PowerMethod
2020-01-27 12:41:59 -05:00
Peter Boyle
afc7426f39
Much bigger pointer cache in case of Nvidia due to cost of setting up UVM allocations
2020-01-27 12:41:16 -05:00
Peter Boyle
7c061e20c9
All directions of dirac operator for fastt coarsening
2020-01-27 12:40:13 -05:00