Michael Marshall
244b4aa07f
Serialise std::vector of numeric types as multidimensional object if size is regular ... or individually if ragged
2021-05-21 20:08:56 +01:00
u61464
8cfc7342cd
staggered hand unroll read coalesce
2021-05-05 14:17:18 -07:00
u61464
15ae317858
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-05-04 08:40:38 -07:00
u61464
834f536b5f
Fastest option on SyCL is now std::complex
2021-05-04 08:40:18 -07:00
Peter Boyle
c332d9f08b
Merge pull request #356 from felixerben/bugfix/stoutSmearing
...
Jamie's fix
2021-04-27 14:10:49 -04:00
cf2923d5dd
Jamie's fix
2021-04-27 16:53:37 +01:00
Peter Boyle
0e4413ddde
Merge pull request #355 from felixerben/bugfix/stoutSmearing
...
bugfix 3D stout smearing
2021-04-27 08:01:55 -04:00
009ccd581e
bugfix 3D stout smearing
2021-04-26 10:36:33 +01:00
Peter Boyle
8cd4263974
Tests compile
2021-04-25 22:20:37 -04:00
Peter Boyle
d45c868656
Change interface
2021-04-25 10:53:34 -04:00
Peter Boyle
955a8113de
Expose label only to reduce number of parameters
2021-04-25 10:36:38 -04:00
Peter Boyle
dbe210dd53
Open the ens_id
2021-04-25 10:25:59 -04:00
Peter Boyle
86e11743ca
set twists
2021-04-20 10:19:11 -04:00
Peter Boyle
980e721f6e
Update MetaData.h
2021-04-13 09:33:01 -04:00
Peter Boyle
e2a0142d87
Merge pull request #348 from AndrewYongZhenNing/develop
...
Conserved Tadpole Implementation for Shamir Action Only
2021-04-06 10:49:00 -04:00
895244ecc3
Merge with upstream; implemented conserved tadpole for Shamir action.
2021-04-06 13:46:33 +01:00
addeb621a7
Implemented tadpole operator for Shamir action.
2021-04-06 13:45:37 +01:00
Peter Boyle
a7fb25adf6
Make Cshift fields static to avoid repeated reallocaate overhead
2021-03-29 21:44:14 +02:00
Peter Boyle
e947992957
Improved force terms
2021-03-29 20:04:06 +02:00
Peter Boyle
bb89a82a07
Staggered coalseced read
2021-03-29 20:01:15 +02:00
Peter Boyle
8bdadbadac
Cold start
2021-03-18 15:41:14 -04:00
Peter Boyle
15c50a7442
Explicit instantiate the template function
2021-03-18 15:40:42 -04:00
Peter Boyle
49b0af2c95
Update of tests to compile with the sRNG addition.
...
Audited the code conventions (again) with the CPS momentum denominator
and added anti periodic in time to the Test_mobius_force.cc and
tested the Test_dwf_gpforce.
Promoted thesee to test full HMC hamiltonian, tr P^2/2 + phidag MdagM phi
with the same pdot and Udot as audited in the Integrator.h etc...
With full comments and sources for factors.
2021-03-18 09:10:02 -04:00
Peter Boyle
9c2b37218a
sRNG parameter added
2021-03-18 06:24:11 -04:00
Peter Boyle
3c67d626ba
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-12 15:36:55 +01:00
Peter Boyle
51f506553c
Read out the local ID once, and store
2021-03-12 15:33:04 +01:00
Peter Boyle
226be84937
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-12 09:31:50 -05:00
Peter Boyle
001814b442
updated to do list. Start adding DDHMC work items
2021-03-12 09:31:17 -05:00
Peter Boyle
db3ac67506
Update thread issue
2021-03-12 14:55:07 +01:00
Peter Boyle
da91a884ef
NVCC versions found buggy added as guard
2021-03-11 23:54:53 +01:00
Peter Boyle
a71e6755e3
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-11 22:43:06 +01:00
Peter Boyle
cd5891eecd
Test that fails on Cuda 11.0
2021-03-11 22:34:28 +01:00
Peter Boyle
5bb7336f27
Merge pull request #347 from pjgeorg/fix-autotools-avx512
...
Fix inconsistent SIMD option AVX512
Thanks
2021-03-11 16:29:07 -05:00
Peter Boyle
ce1fc1f48a
Possible fallback plan for Fionn's compiler bbug in nvcc
2021-03-11 22:20:53 +01:00
Peter Georg
82402c6a7c
Add simd option SKL for ICC
2021-03-11 13:08:40 +01:00
Peter Georg
d9c4afe5b7
Fix inconsistent configure option AVX512
...
Before this change AVX512 enabled different instruction sets depending
on the compiler:
For Intel C++ Compiler Classic (ICC):
AVX512F, AVX512CD, AVX512DQ, AVX512BW, AVX512VL
i.e. Intel Xeon Skylake and newer
For Intel ICX, gcc, clang:
AVX512F, AVX512CD, AVX512ER, AVX512PF
i.e. Intel Xeon Phi x200/x205 (KNL/KNM)
With this commit AVX512 now only enables the common instruction sets
supported by all CPUs supporting any AVX-512 instructions set:
AVX512F and AVX512CD (called COMMON-AVX512 by icc)
2021-03-11 12:58:49 +01:00
Peter Boyle
f786ff8d69
Extend test from Fionn, fails on A100 apparently
2021-03-10 14:32:06 -05:00
u61464
a651caed5f
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-10 06:23:51 -08:00
u61464
0e21adb3f6
Gives 200GF/s on SyCL/DG1 8^4, doesn't uglify develop for other platforms too badly.
...
Easy to revert to clean more C++ stylistic code. Theres a SYCL_HACK macro I will clean up later once dpcpp
evolves a central nervous systems.
2021-03-10 05:40:51 -08:00
Peter Boyle
58bf9b9e6d
Clean up test
2021-03-10 02:45:22 +01:00
Peter Boyle
2146eebb65
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-09 04:31:46 +01:00
Peter Boyle
6a429ee6d3
2d loop hits Nvidia 16bit limit on large local vols
2021-03-09 04:31:10 +01:00
Peter Boyle
4d1ea15c79
More verbosity. The 16bit limit on Grid.y, Grid.z is annoying
2021-03-09 04:29:37 +01:00
Peter Boyle
a76cb005e0
Update Tensor_exp.h
2021-03-08 13:37:57 -05:00
Peter Boyle
a9604367c1
Merge pull request #336 from lehner/feature/gpt
...
Make ShmDims configurable; adjust GRID_MAX_SIMD to allow for 128 byte width on GPUs
2021-03-05 13:17:19 -05:00
Peter Boyle
d7065023cc
Merge pull request #332 from mmphys/feature/mres_schur
...
Optional changes to Test_cayley_mres e.g. Schur solver
2021-03-05 12:47:07 -05:00
Peter Boyle
89d299ceec
Merge pull request #333 from mmphys/bugfix/LatTransfer
...
Fix convertType for GPU in Lattice_transfer.h
2021-03-05 12:46:33 -05:00
Peter Boyle
e34eda66df
Merge pull request #344 from felixerben/feature/XiToSigma
...
Feature/xi to sigma
2021-03-05 12:45:44 -05:00
Christoph Lehner
b24181aa4f
Update Coordinate.h
...
Revert GRID_MAX_SIMD change
2021-03-05 16:56:58 +01:00
Peter Boyle
aa173e2998
Update README.md
2021-03-05 10:25:33 -05:00