1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-17 07:17:06 +01:00
Commit Graph

58 Commits

Author SHA1 Message Date
9b85bf9402 better projection test 2023-05-02 15:42:20 +01:00
86b02c3cd8 cleaning up requested by Julian 2023-05-02 13:31:17 +01:00
7b3b7093fa cleaning up requested by Ed 2023-05-02 12:50:57 +01:00
881b08a465 Correct implementation of SpTa 2023-04-27 18:17:06 +01:00
dace904c10 fix typo 2023-04-14 18:06:18 +01:00
778291230a expand ProjecOnGaugeGroup, change ProjectOnSp2nAlgebra into SpTa, fixing some of its issues 2023-04-04 17:48:13 +01:00
026e736dfa Projection on algebra can now be templated. Fix #12 2023-04-03 16:31:19 +01:00
cbc053c3db Revert "projection on Sp2n algebra, to be used instead of Ta"
This reverts commit ba7f9d7b70.
2023-03-17 11:36:58 +00:00
ba7f9d7b70 projection on Sp2n algebra, to be used instead of Ta 2023-03-15 15:55:12 +00:00
4044536eea add projection on sp2n algebra 2021-10-26 10:20:44 +01:00
4d8ae6221c fix projection 2021-10-22 10:44:54 +01:00
283f14b7c1 fix sp2n projection 2021-10-12 16:00:32 +01:00
7e3b298d3d project on sp2n 2021-10-12 16:00:32 +01:00
af98525766 Merge pull request #359 from paboyle/feature/serialisation-update
Feature/serialisation update
2021-09-16 10:24:52 -04:00
4c5440fb06 const happy for sycl 2021-06-15 21:45:07 +00:00
76af169f05 Add global namespace to Writer<T> and Reader<T> inside GRID_SERIALIZABLE_CLASS_MEMBERS (so that "using Grid" not necessary).
Fix issue with output of Grid::iMatrix so that M<3>{{148,149,150,} {151,152,153,} {154155156}} becomes M<3>{{148,149,150} {151,152,153} {154,155,156}}
2021-05-31 08:43:02 +01:00
15ae317858 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2021-05-04 08:40:38 -07:00
834f536b5f Fastest option on SyCL is now std::complex 2021-05-04 08:40:18 -07:00
e947992957 Improved force terms 2021-03-29 20:04:06 +02:00
a76cb005e0 Update Tensor_exp.h 2021-03-08 13:37:57 -05:00
679d1d22f7 Sycl happier 2021-03-03 11:21:43 -08:00
f9b1f240f6 Better SIMD usage/coalescence 2021-02-26 17:51:41 +01:00
99445673f6 Gparity fix, and plaquette IO 2021-01-14 21:00:36 -05:00
5adae5d6ff Unused variable remove 2020-11-19 19:22:12 +01:00
cc9c993f74 Project on group fix on GPU tracked to reciprocal sqrt collision between CUDA and Grid rsqrt 2020-10-31 18:12:47 -04:00
6c31b99f1f I knew coupling Eigen Tensor to Grid serialisation was a bad iddea.
Now the complex is different on GPU creates probblems
2020-08-31 23:49:19 -04:00
968a90633a Zero -> zeroit in Tensor_index 2020-07-31 02:07:17 -04:00
936c5ecf69 Reduction GPU no compile fix 2020-06-24 17:28:31 -04:00
cdf0a04fc5 Merge branch 'develop' into sycl 2020-06-09 04:00:12 -04:00
f8b8e00090 Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
Aim to reduce the amount of cuda and other code variations floating around all over the place.

Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
28a1fcaaff First compile against SYCL 2020-05-05 11:13:27 -07:00
ddb192bac7 re-work double precision promotion for summit 2020-04-30 16:09:57 -04:00
f1fe444d4f blocked precision promotion infrastructure upgrade 2020-04-24 06:27:20 -04:00
091d5c605e towards more precise blocking 2020-04-17 04:25:28 -04:00
b473405652 Tensor ambiguous fix 2019-08-29 09:36:41 -05:00
28d6be2a4e Fix GCC complaint 2019-08-22 18:56:37 +01:00
96ac56cace Double precision variants for summation accuracy 2019-08-14 13:08:01 +01:00
a23dc295ac Remove compiler errors and warnings 2019-07-18 14:47:02 +01:00
08904f830e Merge develop 2019-07-16 11:59:56 +01:00
fa9cd50c5b Merge branch 'develop' into feature/gpu-port 2019-07-16 11:55:17 +01:00
d6ffadb33b Coalesced write 2019-07-02 17:25:13 +01:00
b8f7bfbb26 Dont stream as poor perf in some cases 2019-07-01 07:30:25 +01:00
d976e5c514 Pow is being awkward in thrust for reasons I don't understand. Possible thrust bug. 2019-06-16 12:05:11 +01:00
b285138be4 Better checking on types 2019-06-15 08:27:48 +01:00
29a244e423 Test of using a lane variable instead of repeated reference to threadIdx.y 2019-06-08 13:46:26 +01:00
0ee6e77cbc Compiles GPU and CPU, still gives good performance on CPU 2019-06-05 13:28:16 +01:00
8794d35c78 GPU 2019-06-04 20:52:27 +01:00
6e2e904a0e NVCC compiles happy. Start to develop strategy for writing generic
code for GPU kernels and CPU kernels.
2019-06-04 20:46:35 +01:00
ffde81f22a Nsimd() and coalesced support 2019-05-25 12:44:07 +01:00
d8098f1ecd coalesced support 2019-05-25 12:43:31 +01:00