1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-29 15:06:00 +01:00
Commit Graph

31 Commits

Author SHA1 Message Date
Peter Boyle 936c5ecf69 Reduction GPU no compile fix 2020-06-24 17:28:31 -04:00
Peter Boyle cdf0a04fc5 Merge branch 'develop' into sycl 2020-06-09 04:00:12 -04:00
Peter Boyle f8b8e00090 Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
Aim to reduce the amount of cuda and other code variations floating around all over the place.

Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
Peter Boyle 28a1fcaaff First compile against SYCL 2020-05-05 11:13:27 -07:00
Christoph Lehner ddb192bac7 re-work double precision promotion for summit 2020-04-30 16:09:57 -04:00
Christoph Lehner f1fe444d4f blocked precision promotion infrastructure upgrade 2020-04-24 06:27:20 -04:00
Christoph Lehner 091d5c605e towards more precise blocking 2020-04-17 04:25:28 -04:00
Peter Boyle b473405652 Tensor ambiguous fix 2019-08-29 09:36:41 -05:00
Peter Boyle 28d6be2a4e Fix GCC complaint 2019-08-22 18:56:37 +01:00
Peter Boyle 96ac56cace Double precision variants for summation accuracy 2019-08-14 13:08:01 +01:00
Peter Boyle a23dc295ac Remove compiler errors and warnings 2019-07-18 14:47:02 +01:00
Peter Boyle 08904f830e Merge develop 2019-07-16 11:59:56 +01:00
Peter Boyle fa9cd50c5b Merge branch 'develop' into feature/gpu-port 2019-07-16 11:55:17 +01:00
Peter Boyle d6ffadb33b Coalesced write 2019-07-02 17:25:13 +01:00
Peter Boyle b8f7bfbb26 Dont stream as poor perf in some cases 2019-07-01 07:30:25 +01:00
Peter Boyle d976e5c514 Pow is being awkward in thrust for reasons I don't understand. Possible thrust bug. 2019-06-16 12:05:11 +01:00
Peter Boyle b285138be4 Better checking on types 2019-06-15 08:27:48 +01:00
Peter Boyle 29a244e423 Test of using a lane variable instead of repeated reference to threadIdx.y 2019-06-08 13:46:26 +01:00
Peter Boyle 0ee6e77cbc Compiles GPU and CPU, still gives good performance on CPU 2019-06-05 13:28:16 +01:00
Peter Boyle 8794d35c78 GPU 2019-06-04 20:52:27 +01:00
Peter Boyle 6e2e904a0e NVCC compiles happy. Start to develop strategy for writing generic
code for GPU kernels and CPU kernels.
2019-06-04 20:46:35 +01:00
Peter Boyle ffde81f22a Nsimd() and coalesced support 2019-05-25 12:44:07 +01:00
Peter Boyle d8098f1ecd coalesced support 2019-05-25 12:43:31 +01:00
Michael Marshall 12d8bf1ced Eigen::Tensor serialisation. Tested on single and double precision builds 2019-03-20 22:27:41 +00:00
Mike 91cffef883 Updates after review with Peter. 2019-03-07 14:30:35 +00:00
Mike b7db99967a Recommendations for Traits classes 2019-02-28 20:06:59 +00:00
Peter Boyle e73b909a48 Make tests running past nvcc. Different NVCC versions proving tricky to keep happy. This is 9.2 2019-01-02 12:05:30 +00:00
Peter Boyle 8c91e82ee8 GPU clean up, remove parallel_for. Split into accelerator_loop, thread_loop
cases, and collides with parallel_for in thrust
2019-01-01 15:06:46 +00:00
Peter Boyle 422764757d Updates in tests to make all of Grid compile 2018-12-14 16:55:54 +00:00
Peter Boyle b57a4d32aa Merge branch 'develop' into feature/gpu-port 2018-12-13 05:11:34 +00:00
portelli fb7d021b9d Hadrons: moving Hadrons to root directory, build system improvements 2018-08-28 15:00:40 +01:00