901ee77b84
Mixed precision multishift test can now be performed with/without G-parity using cmdline check and can load a pregenerated configuration
2021-01-20 11:45:44 -05:00
1b84f59273
Added a mixed precision multishift algorithm for which the matrix multiplies are performed in single precision but the search directions are accumulated in double precision.
...
A reliable update step is performed at a tunable frequency to correct the residual. A final mixed-prec single-shift solve is performed on each pole to perform cleanup if necessary.
A test is provided to demonstrate the algorithm.
2021-01-06 12:24:44 -05:00
1fb41a4300
Added copyLane function to Tensor_extract_merge.h which copies one lane of data from an input tensor object to a different lane of an output tensor object of potentially different precision
...
precisionChange lattice function now uses copyLane to remove need for temporary scalar objects, reducing register footprint and significantly improving performance
2021-01-06 11:50:56 -05:00
287bac946f
ConjugateGradientMixedPrec now stores final true residual and uses the precisionChange workspaces for improved efficiency
2021-01-06 09:50:41 -05:00
80c14be65e
Added core test to check precision change
2021-01-06 09:34:44 -05:00
d7a2a4852d
Reimplemented precisionChange to run on GPUs. A workspace containing the mapping table can be optionally precomputed and reused for improved performance.
2021-01-06 09:30:49 -05:00
d185f2eaa7
OneFlavourEvenOddRatioRationalPseudoFermionAction now derives from GeneralEvenOddRatioRationalPseudoFermionAction, simply performs transcription of parameters
2020-12-23 16:26:10 -05:00
813d4cd900
Added test program that ensures the generic checkerboarded RHMC (with parameters set appropriately) gives the same answer as the existing 1f code
2020-12-23 16:01:42 -05:00
75c6c6b173
General RHMC pseudofermion action now allows for different rational approximations to be used in the MD and action evaluation
2020-12-23 11:19:26 -05:00
220ad5e3ee
Added more verbose log output to GeneralEvenOddRatioRationalPseudoFermionAction
...
In GeneralEvenOddRatioRationalPseudoFermionAction, setting the bounds check frequency to 0 now disables the check
2020-12-22 11:08:22 -05:00
ba5dc670a5
Reimplemented GparityWilsonImpl::InsertForce5D to run efficiently on GPUs
...
Swapped order of templated tensor code and c-number specializations in Tensor_outer.h to fix compile issue with type deduction on Summit
2020-12-22 10:10:07 -05:00
a0ca362690
Added an RHMC pseudofermion action, GeneralEvenOddRatioRationalPseudoFermionAction, that works for an arbitrary fractional power, not just a square root
...
Added a test evolution for the above, Test_rhmc_EOWilsonRatioPowQuarter, demonstrating conservation of Hamiltonian
Fixed HMC ignoring the MetropolisTest parameter of HMCparameters
2020-12-17 16:21:58 -05:00
249b6e61ec
For G-parity BCs the Nd-1 direction is now assumed to be the time direction and setting a twist in this direction will apply antiperiodic BCs
...
Added option to run Test_gparity with antiperiodic time BCs
2020-12-17 14:09:00 -05:00
9e7bacb5a4
Merge branch 'develop' into feature/gparity_twist_GPU
2020-12-07 11:55:39 -05:00
2ef1fa66a8
Improved performance of G-parity kernel for GPUs by simplifying multLink implementation
2020-12-07 11:53:35 -05:00
cf76741ec6
Intel DPCPP Gold happy now (compiles all, runs Benchmark_dwf_fp32 )
2020-12-03 03:47:11 -08:00
497e7c1c40
Duplicate code
2020-12-02 17:55:30 -08:00
888eacd3b8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-24 21:46:33 -05:00
321f0f51b5
Project to SU(N)
2020-11-24 21:46:10 -05:00
30ad9578a2
Merge branch 'lehner-feature/gpt' into develop
2020-11-24 06:10:24 -05:00
9dce101586
Merge branch 'feature/gpt' of https://github.com/lehner/Grid into lehner-feature/gpt
2020-11-24 06:10:16 -05:00
97e264d0ff
Christoph's changes
2020-11-23 15:46:11 +00:00
683a5e5bf5
Stencil use host vector for integera table on enable-shared=no and mirror it on device
2020-11-23 15:39:51 +00:00
d4861a362c
Stencil use non-UVM memory for look up table on enable-shared=no
2020-11-23 15:38:49 +00:00
5ff3eae027
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-20 13:14:44 -05:00
147dc15d26
Update
2020-11-20 13:13:59 -05:00
c61ea72949
Merge pull request #19 from paboyle/develop
...
Sync
2020-11-20 17:31:13 +01:00
86e8b9fe38
ALLOC_ALIGN removed
2020-11-20 17:07:16 +01:00
612e468889
Configurable ALLOC_ALIGN and ALLOC_CACHE
2020-11-20 16:48:28 +01:00
4ea8d128c2
Merge pull request #18 from paboyle/develop
...
Sync
2020-11-20 15:36:50 +01:00
e49b7f2f88
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-19 19:24:41 +01:00
aace3d47b9
partial work in progress
2020-11-19 19:24:14 +01:00
d5049949a4
Starting to fix reunitarise
2020-11-19 19:23:41 +01:00
f1c7480e3c
Warning remove
2020-11-19 19:23:03 +01:00
5adae5d6ff
Unused variable remove
2020-11-19 19:22:12 +01:00
a8412ace05
Merge pull request #317 from i-kanamori/develop
...
adding an error check for input: Parameters.StartingType
2020-11-18 23:09:40 -05:00
9fd1c2ad4b
Merge pull request #325 from DanielRichtmann/feature/threaded-clover-inversion
...
Threaded clover term inversion
2020-11-18 23:08:37 -05:00
4cf3575353
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-18 03:07:36 +00:00
804a810d68
Wildcard mismatch
2020-11-18 03:06:53 +00:00
8fcb392e24
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-17 04:51:31 -08:00
dd8d70eeff
Build without LIME
2020-11-17 04:41:15 -08:00
aa8aba6543
--shm-force-mpi
2020-11-16 20:15:50 -05:00
13df14f96e
Switch off SHM paths with --disable-shm
2020-11-16 18:07:15 -05:00
3aab983760
Flop count set as in DiRAC-ITT-2020 (mistaken 20% low, but must maintain consistency)
DiRAC-ITT-2020-UCX-WORKAROUND
2020-11-16 17:13:58 +01:00
9c4dcc5ea3
Merge branch 'master' into develop
2020-11-16 16:34:57 +01:00
a1063ddbb9
Update options and simplify
2020-11-13 04:11:03 +01:00
18ef8056ec
Hide Shared Memory
2020-11-13 04:10:40 +01:00
1c673977fa
Must ask for COMMMS_THREADS
2020-11-13 03:59:36 +01:00
e9bc748828
Useful GPU machine benchmark for GDR used to shakeout Booster at Juelich - see slack earlyaccess channel
2020-11-13 03:58:34 +01:00
f48156529b
Work on 2,2,2,8 ranks
2020-11-13 03:57:58 +01:00