Peter Boyle
546be724e7
Merge pull request #421 from UniOfLeicester/feature/accel_Copy_plane
...
Populate the Cshift_table in the GPU
2023-03-24 12:04:06 -04:00
Peter Boyle
481bbaf1fc
Interface to query memory use
2023-03-23 12:55:31 -04:00
Peter Boyle
281488611a
WriteDiscard on construct
2023-03-23 10:28:50 -04:00
Peter Boyle
bae0f8ea99
Merge pull request #425 from rrhodgson/feature/CacheLogging
...
Huge Cache
2023-03-21 08:59:08 -04:00
Peter Boyle
bbbcd36ae5
Merge pull request #426 from rrhodgson/feature/LCDeflation
...
Batched Local Coherence Tools
2023-03-21 08:58:40 -04:00
Peter Boyle
39c0815d9e
WriteDiscard
2023-03-21 08:57:29 -04:00
a3e935c902
Batched block project/promote size checks
2023-02-27 11:38:16 +00:00
7731c7db8e
Add huge cache type and allow Ncache==0
2023-02-26 14:15:28 +00:00
ff97340324
Expose cached bytes
2023-02-26 12:22:45 +00:00
920a51438d
Added batched Mixed precision CG
2023-02-14 17:04:13 +00:00
be528b6d27
Add batched block project/promote functions
2023-02-14 14:37:10 +00:00
Peter Boyle
796abfad80
Merge pull request #422 from fjosw/fix/NVCC_DIAG_PRAGMA_SUPPORT
...
Disable diagnostic pragma warnings for CUDA 12+
2023-01-17 09:34:49 -05:00
ad0270ac8c
fix: diagnostic pragma warnings fixed for CUDA 12+
2023-01-12 12:36:30 +00:00
Makis Kappas
7d62f1d6d2
Populate the Cshift_table in the GPU
...
Cshift is allocated in Unified memory and used
in the LambdaApply kernels but also populated
from the host. This creates a lot of Unified HtoD
and DtoH mem operations and has a negative effect
in performance. With this commit we populate the
Cshift table in the device with the
populate_Cshift_table() kernel.
2023-01-11 21:26:25 +00:00
Peter Boyle
4ca1bf7cca
Added gauge invariance test
2022-12-21 07:23:16 -05:00
Peter Boyle
2ff868f7a5
CPU open doesn't need to free space
2022-12-20 05:10:23 -05:00
Peter Boyle
ede02b6883
Memory manager debug Felix case
2022-12-20 05:10:23 -05:00
Peter Boyle
1822ced302
Bug fix
2022-12-20 05:10:23 -05:00
Peter Boyle
37ba32776f
More logging
2022-12-20 05:10:23 -05:00
Peter Boyle
99b3697b03
More loggin
2022-12-20 05:10:23 -05:00
Peter Boyle
43a45ec97b
SSC_START
2022-12-20 05:10:23 -05:00
Peter Boyle
b00a4142e5
A=A fix
2022-12-20 05:10:23 -05:00
Peter Boyle
3791bc527b
Logging pulled in from dirichlet branch
2022-12-20 05:10:23 -05:00
Peter Boyle
d8c29f5fcf
Updated FFT test for PETSc
2022-12-18 12:05:00 -05:00
Peter Boyle
281f8101fe
Matt FFT test
2022-12-17 20:35:33 -05:00
Peter Boyle
07acfe89f2
Merge pull request #417 from rrhodgson/feature/fermtoprop
...
Feature/fermtoprop
2022-12-06 12:45:03 -05:00
40234f531f
FermToProp accelerator_for -> thread_for
2022-12-06 17:34:51 +00:00
d49694f38f
PropToFerm fix
2022-12-06 15:48:54 +00:00
Peter Boyle
97a098636d
FermToProp
2022-11-30 15:36:35 -05:00
Peter Boyle
e13930c8b2
Faster fermtoprop case
2022-11-30 15:11:29 -05:00
Peter Boyle
0655dab466
Open MP on host enabled
2022-11-08 13:38:54 -08:00
Peter Boyle
7f097bcc28
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2022-11-08 13:23:40 -08:00
Peter Boyle
5c75aa5008
Device mem
2022-11-08 13:22:57 -08:00
Peter Boyle
1873101362
PVC
2022-11-08 13:22:45 -08:00
Peter Boyle
63fd1dfa62
Config on PVC
2022-11-08 13:22:09 -08:00
Peter Boyle
bd68861b28
SYCL sum
2022-11-08 12:49:26 -08:00
Peter Boyle
82e959f66c
SYCL reduction
2022-11-08 12:45:25 -08:00
Peter Boyle
62e52de06d
Merge pull request #414 from fjosw/feat/eCloverGPU
...
Compact Exponential Cloverterm on GPU
2022-11-01 09:15:44 -04:00
184adeedb8
feat: renamed open_boundaries to fixedBoundaries
2022-10-26 12:53:46 +01:00
5fa6a8b96d
docs: CompactClover debug info generalized.
2022-10-26 12:41:14 +01:00
a2a879b668
docs: CompactClover Debug Info improved.
2022-10-25 17:20:42 +01:00
9317d893b2
docs: details about inversion of CompactClover term added.
2022-10-25 17:10:06 +01:00
86075fdd45
feat: MassTerm and ExponentiateClover merged into InstantiateClover
2022-10-25 17:05:34 +01:00
b36442e263
feat: CloverHelpers::InvertClover implemented which handles the
...
inversion of the Clover term depending on clover type and the boundary
conditions.
2022-10-25 16:57:01 +01:00
513d797ea6
fix: signature of CompactWilsonCloverHelpers::Exponentiate fixed.
2022-10-25 16:17:22 +01:00
9e4835a3e3
feat: changed CompactWilsonExpClover exponentiation to Taylor expansion
...
with Horner scheme.
2022-10-25 15:19:43 +01:00
Peter Boyle
477ebf24f4
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2022-10-04 11:19:43 -07:00
Peter Boyle
0d5639f707
Run script update
2022-10-04 11:13:41 -07:00
Peter Boyle
413312f9a9
Benchmark the halo construction.
...
THe bye counts are out and should be doubled for SIMD directions
2022-10-04 11:12:59 -07:00
Peter Boyle
03508448f8
Remove verbose
2022-10-04 11:12:15 -07:00