1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-06-22 17:52:02 +01:00

Commit Graph

  • 1c881ce23c HIP does not like half2 visible members x and y so must define own Half2 Peter Boyle 2020-09-16 02:28:33 +01:00
  • dacbbdd051 Hip Happy Birthday Peter Boyle 2020-09-16 00:37:02 +01:00
  • 2859955a03 HIP requires "inline" Peter Boyle 2020-09-16 00:36:13 +01:00
  • cc220abd1d inline for HIP Peter Boyle 2020-09-16 00:35:38 +01:00
  • d1c0c0197e HipCC requires inline on definition Peter Boyle 2020-09-16 00:35:06 +01:00
  • fd9424ef27 innlines required to make HIP happy Peter Boyle 2020-09-16 00:34:32 +01:00
  • a5c35c4024 Make HIP / Vega happy Peter Boyle 2020-09-16 00:33:53 +01:00
  • e03b64dc06 HIP default flaags to work on ROCM Peter Boyle 2020-09-16 00:33:09 +01:00
  • 4677c40195 HIP improvements Peter Boyle 2020-09-16 00:32:27 +01:00
  • 288c615782 Hip improvements Peter Boyle 2020-09-16 00:31:50 +01:00
  • 48e81cf6f8 Hip Pragmas Peter Boyle 2020-09-16 00:31:03 +01:00
  • 5cffa05c7e remove slab allocator file Christoph Lehner 2020-09-13 14:06:25 -04:00
  • d50a2164d7 remove slab allocator Christoph Lehner 2020-09-13 14:06:06 -04:00
  • 32ff766dbd fix evict scheme, slab alloc Christoph Lehner 2020-09-13 14:02:53 -04:00
  • 01652d8cfe SlabAllocator Christoph Lehner 2020-09-13 05:56:02 -04:00
  • 4d2dc7ba03 Enable even-odd for CoarsenedMatrix Daniel Richtmann 2020-09-07 17:57:07 +02:00
  • 3064c9a6e2 Improve the matching of stencil coarsening feature/hw-multigrid Peter Boyle 2020-09-08 15:36:58 -04:00
  • 729882827c Improve the coarse matrix calc Peter Boyle 2020-09-08 15:36:33 -04:00
  • 51d1beb1f3 Merge pull request #15 from paboyle/develop Christoph Lehner 2020-09-07 14:20:33 +02:00
  • baa668d3ac Merge branch 'develop' into feature/hw-multigrid Peter Boyle 2020-09-03 22:16:50 -04:00
  • 3c82d16ed8 4D multigrid Peter Boyle 2020-09-03 22:11:17 -04:00
  • 5c8c0c2d7c BiCG Peter Boyle 2020-09-03 22:10:40 -04:00
  • e5a100846c Useful tthing to preserve Peter Boyle 2020-09-03 22:09:57 -04:00
  • a74e2dc12e Printing mem info Peter Boyle 2020-09-03 22:08:44 -04:00
  • 595f512a6e G5 for coarse space too Peter Boyle 2020-09-03 22:07:40 -04:00
  • a6499b22ff Stats printing Peter Boyle 2020-09-03 22:00:46 -04:00
  • b4e42a59c6 Stats traacking improvement Peter Boyle 2020-09-03 22:00:14 -04:00
  • 8c913e0edd Clearer UVM ttreatment Peter Boyle 2020-09-03 21:59:05 -04:00
  • fd3f93d8d3 Zero changes Peter Boyle 2020-09-03 21:57:11 -04:00
  • e9543cdacd Time deflation Peter Boyle 2020-09-03 21:56:02 -04:00
  • 98f7b3d298 Pcg Peter Boyle 2020-09-03 21:55:05 -04:00
  • b7b164ea24 Test operator and deebug code Peter Boyle 2020-09-03 21:54:20 -04:00
  • 77124d99d5 Merge branch 'develop' into feature/hw-multigrid Peter Boyle 2020-09-03 21:52:04 -04:00
  • 65b724bb5f 2 level hddcr Peter Boyle 2020-09-03 21:46:43 -04:00
  • 6dbd117aa5 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2020-09-03 20:30:49 -04:00
  • 198b29f618 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2020-09-03 20:29:54 -04:00
  • a8309638d4 UVM check in MPI calls Peter Boyle 2020-09-03 20:29:26 -04:00
  • f98a4e880e Merge pull request #310 from kostrzewa/accelerator_vector_stream_op_no_backspace Peter Boyle 2020-09-03 20:24:59 -04:00
  • 8244caff25 Remove the asynchronous non-Stencil calls. Peter Boyle 2020-09-03 18:52:55 -04:00
  • bcd7895362 Include cuda.h Peter Boyle 2020-09-03 15:49:13 -04:00
  • 85b1c5df39 A never hit case that is not 100% confident is asserted for safety Peter Boyle 2020-09-03 15:48:16 -04:00
  • b4255140d6 Stale data member eliminated Peter Boyle 2020-09-03 15:47:46 -04:00
  • 0c3095e173 Comms buffers to device memory Peter Boyle 2020-09-03 15:45:35 -04:00
  • d3ce60713d UVM, Device and Lattice/aligned allocators Peter Boyle 2020-09-03 15:44:13 -04:00
  • eac1f08b7b Close expressions passed as an argument Peter Boyle 2020-09-01 15:30:33 -04:00
  • 1654c4f3c0 Closure improved Peter Boyle 2020-09-01 15:29:45 -04:00
  • 8807d998bc closure improved Peter Boyle 2020-09-01 15:29:11 -04:00
  • 5791021dcd Speed up Cshift more with coalesced Peter Boyle 2020-09-01 15:28:15 -04:00
  • c273fb051c Peek poke laattice Peter Boyle 2020-09-01 15:27:59 -04:00
  • c545530170 little worry large Nbasis doesnt compile GPU Peter Boyle 2020-09-01 00:14:33 -04:00
  • d982a5b6d5 Fix coaarsened Peter Boyle 2020-09-01 00:14:04 -04:00
  • 15ca8637f3 No norms in HermOp Peter Boyle 2020-09-01 00:13:32 -04:00
  • cbc995b74c Made better interface Peter Boyle 2020-09-01 00:12:54 -04:00
  • 8b74174d74 Eigen tensor serialisatiino happy undeer GPU. Regret agreeing to let us couple Eigen types to Grid IO Peter Boyle 2020-09-01 00:03:26 -04:00
  • e21fef17df real and imag part not in ET Peter Boyle 2020-08-31 23:56:26 -04:00
  • 3d27708f07 Basic where test Peter Boyle 2020-08-31 23:55:49 -04:00
  • b918744184 Prettificatoin Peter Boyle 2020-08-31 23:54:46 -04:00
  • 7d14a3c086 Where working Peter Boyle 2020-08-31 23:53:46 -04:00
  • e14a84317d GPU math unary calls Peter Boyle 2020-08-31 23:50:49 -04:00
  • 6c31b99f1f I knew coupling Eigen Tensor to Grid serialisation was a bad iddea. Now the complex is different on GPU creates probblems Peter Boyle 2020-08-31 23:49:19 -04:00
  • 9522dcd611 Remove dead commented ouot coode Peter Boyle 2020-08-31 23:40:29 -04:00
  • ed469898dc coalesced ET expressions Peter Boyle 2020-08-31 23:38:40 -04:00
  • 1eee94a809 Sorting real/im in read coalesced GPU ET Peter Boyle 2020-08-31 23:36:49 -04:00
  • 54523369a3 do not use backspace in Coordinate output stream operator Bartosz Kostrzewa 2020-08-31 19:39:36 +02:00
  • a98c91c2a5 Merge pull request #309 from kostrzewa/format_benchmark_wilson_sweep Peter Boyle 2020-08-31 12:43:46 -04:00
  • a9b92867a8 use tabulator Bartosz Kostrzewa 2020-08-31 18:41:17 +02:00
  • 65920faeba correct formatting of Benchmark_wilson_sweep output Bartosz Kostrzewa 2020-08-31 18:39:27 +02:00
  • 249e2db87d Merge pull request #14 from DanielRichtmann/feature/gpt-coarsenedmatrix Christoph Lehner 2020-08-27 15:18:56 +02:00
  • cf3535d16e Expose more functions in CMat Daniel Richtmann 2020-08-27 14:06:48 +02:00
  • d61ee817f4 Merge pull request #13 from DanielRichtmann/feature/gpt-coarsenedmatrix Christoph Lehner 2020-08-27 12:11:06 +02:00
  • 3448b7387c Almost there to coalesced ET Peter Boyle 2020-08-26 17:04:49 -04:00
  • 47b89d2739 Pragma protection improvementt Peter Boyle 2020-08-26 17:04:27 -04:00
  • 2a75516330 state MPI/SLURM message only on world_rank zero Christoph Lehner 2020-08-26 12:34:17 -04:00
  • b2087f14c4 Fix CoarsenedMatrix regarding illegal memory accesses Daniel Richtmann 2020-08-24 16:54:36 +02:00
  • dd1ba266b2 Fix mapping between dir + disp and point in CMat Daniel Richtmann 2020-07-17 11:58:02 +02:00
  • 1292d59563 Add a typedef + broaden interface of CMat Daniel Richtmann 2020-06-11 13:16:00 +02:00
  • 9877ed9bf8 Merge pull request #12 from paboyle/develop Christoph Lehner 2020-08-22 16:35:35 +02:00
  • f0dc0f3621 fix compile issue on Qpace3 Christoph Lehner 2020-08-22 13:57:33 +02:00
  • 1efe30d6cc SLurm stop nodes using same GPU Peter Boyle 2020-08-21 02:02:53 +02:00
  • 0b787e9fe0 Avoid namespaec collision to make gcc happy Peter Boyle 2020-08-20 22:23:29 +02:00
  • 37ec4b241c Default thread count sensible Peter Boyle 2020-08-20 22:12:31 +02:00
  • 63b0a19f37 Merge pull request #11 from paboyle/develop Christoph Lehner 2020-08-20 20:53:39 +02:00
  • 90ea7dfa99 Accelerator loops for device resident comms buf Peter Boyle 2020-08-19 22:40:44 +02:00
  • f866d7c33e Merge pull request #307 from lehner/feature/gpt Peter Boyle 2020-08-18 23:27:21 -04:00
  • 542bdef198 cleanup comments Christoph Lehner 2020-08-14 18:39:44 +02:00
  • 06007db3d9 true shm_none implementation with GPUs that disables the use of device shared memory for the stencils Christoph Lehner 2020-08-14 18:37:00 +02:00
  • 12e6059a70 Merge branch 'feature/gpt' of https://github.com/lehner/Grid into feature/gpt Christoph Lehner 2020-08-13 16:16:52 +02:00
  • dbaa24ebf6 further GPU memory access fixes (with this GPT passes all single-rank tests on non-summit GPUs) Christoph Lehner 2020-08-13 16:14:15 +02:00
  • d8c0c0ba0a Fix and compiles feature/a2a-offload Peter Boyle 2020-08-12 14:35:08 -04:00
  • c6cf918d4c Typo Peter Boyle 2020-08-12 14:24:39 -04:00
  • 6d0a907c5c first try at A2A four quark offload Peter Boyle 2020-08-12 14:17:46 -04:00
  • 3276aa67dc Update Peter Boyle 2020-08-12 14:15:53 -04:00
  • f91e3af97f Checking in before trying to reduce memory footprint Chulwoo Jung 2020-08-08 22:11:14 -04:00
  • 3b30b9f0c0 Merge branch 'feature/gpt' of https://github.com/lehner/Grid into feature/gpt Christoph Lehner 2020-08-06 16:59:17 +02:00
  • 69db4816f7 fix variable capture in Scatter_plane_merge on accelerators Christoph Lehner 2020-08-06 16:57:16 +02:00
  • 3abe09025a when using SHM_NONE allow multiple ranks per node but without using shared memory Christoph Lehner 2020-08-06 14:42:38 +02:00
  • e33878e0de Trigger re-run of CI Christoph Lehner 2020-08-06 11:50:24 +02:00
  • 27b4fbf3f0 assert for forbidden code path and fix check for faster CPU codepath in basisRotate Christoph Lehner 2020-08-03 07:57:33 -04:00
  • 968a90633a Zero -> zeroit in Tensor_index Christoph Lehner 2020-07-31 02:07:17 -04:00
  • 6365a89ba3 create separate InitMessage for MemoryManager that can be called after communicator setup Christoph Lehner 2020-07-30 07:25:05 -04:00