1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-05-13 05:34:30 +01:00

Commit Graph

  • f98a4e880e Merge pull request #310 from kostrzewa/accelerator_vector_stream_op_no_backspace Peter Boyle 2020-09-03 20:24:59 -04:00
  • 8244caff25 Remove the asynchronous non-Stencil calls. Peter Boyle 2020-09-03 18:52:55 -04:00
  • bcd7895362 Include cuda.h Peter Boyle 2020-09-03 15:49:13 -04:00
  • 85b1c5df39 A never hit case that is not 100% confident is asserted for safety Peter Boyle 2020-09-03 15:48:16 -04:00
  • b4255140d6 Stale data member eliminated Peter Boyle 2020-09-03 15:47:46 -04:00
  • 0c3095e173 Comms buffers to device memory Peter Boyle 2020-09-03 15:45:35 -04:00
  • d3ce60713d UVM, Device and Lattice/aligned allocators Peter Boyle 2020-09-03 15:44:13 -04:00
  • eac1f08b7b Close expressions passed as an argument Peter Boyle 2020-09-01 15:30:33 -04:00
  • 1654c4f3c0 Closure improved Peter Boyle 2020-09-01 15:29:45 -04:00
  • 8807d998bc closure improved Peter Boyle 2020-09-01 15:29:11 -04:00
  • 5791021dcd Speed up Cshift more with coalesced Peter Boyle 2020-09-01 15:28:15 -04:00
  • c273fb051c Peek poke laattice Peter Boyle 2020-09-01 15:27:59 -04:00
  • c545530170 little worry large Nbasis doesnt compile GPU Peter Boyle 2020-09-01 00:14:33 -04:00
  • d982a5b6d5 Fix coaarsened Peter Boyle 2020-09-01 00:14:04 -04:00
  • 15ca8637f3 No norms in HermOp Peter Boyle 2020-09-01 00:13:32 -04:00
  • cbc995b74c Made better interface Peter Boyle 2020-09-01 00:12:54 -04:00
  • 8b74174d74 Eigen tensor serialisatiino happy undeer GPU. Regret agreeing to let us couple Eigen types to Grid IO Peter Boyle 2020-09-01 00:03:26 -04:00
  • e21fef17df real and imag part not in ET Peter Boyle 2020-08-31 23:56:26 -04:00
  • 3d27708f07 Basic where test Peter Boyle 2020-08-31 23:55:49 -04:00
  • b918744184 Prettificatoin Peter Boyle 2020-08-31 23:54:46 -04:00
  • 7d14a3c086 Where working Peter Boyle 2020-08-31 23:53:46 -04:00
  • e14a84317d GPU math unary calls Peter Boyle 2020-08-31 23:50:49 -04:00
  • 6c31b99f1f I knew coupling Eigen Tensor to Grid serialisation was a bad iddea. Now the complex is different on GPU creates probblems Peter Boyle 2020-08-31 23:49:19 -04:00
  • 9522dcd611 Remove dead commented ouot coode Peter Boyle 2020-08-31 23:40:29 -04:00
  • ed469898dc coalesced ET expressions Peter Boyle 2020-08-31 23:38:40 -04:00
  • 1eee94a809 Sorting real/im in read coalesced GPU ET Peter Boyle 2020-08-31 23:36:49 -04:00
  • 54523369a3 do not use backspace in Coordinate output stream operator Bartosz Kostrzewa 2020-08-31 19:39:36 +02:00
  • a98c91c2a5 Merge pull request #309 from kostrzewa/format_benchmark_wilson_sweep Peter Boyle 2020-08-31 12:43:46 -04:00
  • a9b92867a8 use tabulator Bartosz Kostrzewa 2020-08-31 18:41:17 +02:00
  • 65920faeba correct formatting of Benchmark_wilson_sweep output Bartosz Kostrzewa 2020-08-31 18:39:27 +02:00
  • 249e2db87d Merge pull request #14 from DanielRichtmann/feature/gpt-coarsenedmatrix Christoph Lehner 2020-08-27 15:18:56 +02:00
  • cf3535d16e Expose more functions in CMat Daniel Richtmann 2020-08-27 14:06:48 +02:00
  • d61ee817f4 Merge pull request #13 from DanielRichtmann/feature/gpt-coarsenedmatrix Christoph Lehner 2020-08-27 12:11:06 +02:00
  • 3448b7387c Almost there to coalesced ET Peter Boyle 2020-08-26 17:04:49 -04:00
  • 47b89d2739 Pragma protection improvementt Peter Boyle 2020-08-26 17:04:27 -04:00
  • 2a75516330 state MPI/SLURM message only on world_rank zero Christoph Lehner 2020-08-26 12:34:17 -04:00
  • b2087f14c4 Fix CoarsenedMatrix regarding illegal memory accesses Daniel Richtmann 2020-08-24 16:54:36 +02:00
  • dd1ba266b2 Fix mapping between dir + disp and point in CMat Daniel Richtmann 2020-07-17 11:58:02 +02:00
  • 1292d59563 Add a typedef + broaden interface of CMat Daniel Richtmann 2020-06-11 13:16:00 +02:00
  • 9877ed9bf8 Merge pull request #12 from paboyle/develop Christoph Lehner 2020-08-22 16:35:35 +02:00
  • f0dc0f3621 fix compile issue on Qpace3 Christoph Lehner 2020-08-22 13:57:33 +02:00
  • 1efe30d6cc SLurm stop nodes using same GPU Peter Boyle 2020-08-21 02:02:53 +02:00
  • 0b787e9fe0 Avoid namespaec collision to make gcc happy Peter Boyle 2020-08-20 22:23:29 +02:00
  • 37ec4b241c Default thread count sensible Peter Boyle 2020-08-20 22:12:31 +02:00
  • 63b0a19f37 Merge pull request #11 from paboyle/develop Christoph Lehner 2020-08-20 20:53:39 +02:00
  • 90ea7dfa99 Accelerator loops for device resident comms buf Peter Boyle 2020-08-19 22:40:44 +02:00
  • f866d7c33e Merge pull request #307 from lehner/feature/gpt Peter Boyle 2020-08-18 23:27:21 -04:00
  • 542bdef198 cleanup comments Christoph Lehner 2020-08-14 18:39:44 +02:00
  • 06007db3d9 true shm_none implementation with GPUs that disables the use of device shared memory for the stencils Christoph Lehner 2020-08-14 18:37:00 +02:00
  • 12e6059a70 Merge branch 'feature/gpt' of https://github.com/lehner/Grid into feature/gpt Christoph Lehner 2020-08-13 16:16:52 +02:00
  • dbaa24ebf6 further GPU memory access fixes (with this GPT passes all single-rank tests on non-summit GPUs) Christoph Lehner 2020-08-13 16:14:15 +02:00
  • d8c0c0ba0a Fix and compiles feature/a2a-offload Peter Boyle 2020-08-12 14:35:08 -04:00
  • c6cf918d4c Typo Peter Boyle 2020-08-12 14:24:39 -04:00
  • 6d0a907c5c first try at A2A four quark offload Peter Boyle 2020-08-12 14:17:46 -04:00
  • 3276aa67dc Update Peter Boyle 2020-08-12 14:15:53 -04:00
  • f91e3af97f Checking in before trying to reduce memory footprint Chulwoo Jung 2020-08-08 22:11:14 -04:00
  • 3b30b9f0c0 Merge branch 'feature/gpt' of https://github.com/lehner/Grid into feature/gpt Christoph Lehner 2020-08-06 16:59:17 +02:00
  • 69db4816f7 fix variable capture in Scatter_plane_merge on accelerators Christoph Lehner 2020-08-06 16:57:16 +02:00
  • 3abe09025a when using SHM_NONE allow multiple ranks per node but without using shared memory Christoph Lehner 2020-08-06 14:42:38 +02:00
  • e33878e0de Trigger re-run of CI Christoph Lehner 2020-08-06 11:50:24 +02:00
  • 27b4fbf3f0 assert for forbidden code path and fix check for faster CPU codepath in basisRotate Christoph Lehner 2020-08-03 07:57:33 -04:00
  • 968a90633a Zero -> zeroit in Tensor_index Christoph Lehner 2020-07-31 02:07:17 -04:00
  • 6365a89ba3 create separate InitMessage for MemoryManager that can be called after communicator setup Christoph Lehner 2020-07-30 07:25:05 -04:00
  • ddbb008694 Merge pull request #10 from lehner/feature/gpt-sycl Christoph Lehner 2020-07-30 13:12:09 +02:00
  • 7997e0a449 Merge branch 'feature/gpt' into feature/gpt-sycl Christoph Lehner 2020-07-30 13:11:31 +02:00
  • 197612bc7a fast cpu basisRotate and other small cleanups Christoph Lehner 2020-07-30 07:08:54 -04:00
  • 0e88bf4bff remove Nils's default pragma Christoph Lehner 2020-07-29 10:24:35 -04:00
  • 3e64d78469 include versions.h again and add back asserts in Test_simd Christoph Lehner 2020-07-29 10:18:05 -04:00
  • 2004611def Merge pull request #9 from nmeyer-ur/feature/a64fx-2 Christoph Lehner 2020-07-29 14:54:20 +02:00
  • a2868c96a4 Merge pull request #8 from paboyle/develop Christoph Lehner 2020-07-29 14:10:07 +02:00
  • 69bd7082f1 Merge branch 'rmhmc_fix' into feature/rmhmc feature/rmhmc Chulwoo Jung 2020-07-23 14:47:24 -04:00
  • 873f039c72 Test_hmc_WilsonGauge_Implicit compiles Chulwoo Jung 2020-07-23 14:45:47 -04:00
  • 16303e5f16 T_hmc_WilsonGauge_Implicit not compiling with latest develop Chulwoo Jung 2020-07-22 23:00:58 -04:00
  • 88683fa648 Merge branch 'develop' of https://github.com/paboyle/Grid into feature/rmhmc Chulwoo Jung 2020-07-22 15:54:07 -04:00
  • 43298ef681 Make precision switchable for cublas routines Chulwoo Jung 2020-07-22 14:49:32 -04:00
  • 7cf7f11e1a Doc recompile Peter Boyle 2020-07-22 14:44:11 -04:00
  • ea7f8fda5e fix typo nmeyer-ur 2020-07-22 09:34:05 +02:00
  • 906b78811b exit in Init when using --comms-overlap nmeyer-ur 2020-07-22 08:57:01 +02:00
  • 7e70df27e4 Confirmed double precision working Chulwoo Jung 2020-07-21 00:48:46 -04:00
  • c55d657736 Merge branch 'dev-BlockLanczosOpt' of https://github.com/yongchull/Grid into feature/block_lanczos Chulwoo Jung 2020-07-20 16:36:46 -04:00
  • fe5b23e144 RMHMC implementation, originaly from Guido Cossu Chulwoo Jung 2020-07-19 01:42:31 -04:00
  • e1327e7ea0 Optional bounds check debug code Peter Boyle 2020-07-16 16:57:46 -04:00
  • 569f78c2cf Stenccil improvement Peter Boyle 2020-07-16 16:57:13 -04:00
  • 488c79d5a1 Bound improvement Peter Boyle 2020-07-15 19:58:08 -04:00
  • 97703b181b Merge pull request #7 from paboyle/develop Christoph Lehner 2020-07-12 16:24:53 +02:00
  • d9474c6cb6 compiler-independent build using --enable-simd=A64FX nmeyer-ur 2020-07-09 10:07:02 +02:00
  • bbd145382b enable --enable-simd=A64FX in configure nmeyer-ur 2020-07-08 12:43:51 +02:00
  • 1b08cb7300 Merge branch 'develop' into feature/a64fx-2 nmeyer-ur 2020-07-08 08:18:18 +02:00
  • 337d9dc043 move barrier in Benchmark_wilson nmeyer-ur 2020-07-08 08:13:40 +02:00
  • 8726e94ea7 merge upstream develop nmeyer-ur 2020-07-07 20:26:47 +02:00
  • 67db4993c2 reset head, update SVE readme nmeyer-ur 2020-07-07 19:54:52 +02:00
  • f1f655d92b Merge pull request #304 from Heinrich-BR/develop syck portelli 2020-07-06 10:16:03 +01:00
  • 43334e88c3 Tiny change in a comment for clarity h.b.rocha 2020-07-04 16:11:16 +01:00
  • 4f1e66b044 Fixed HMC SU(N) integrator which was causing fields to leave Lie Algebra manifold for N>2 h.b.rocha 2020-07-04 03:53:06 +01:00
  • dc6b0f20b2 Fixed array bounds Peter Boyle 2020-07-02 12:20:20 -04:00
  • c0badc3e16 Summit bounce back to git Peter Boyle 2020-07-02 10:48:39 -04:00
  • fd3c8b0e85 correct build instructions qp4 nmeyer-ur 2020-07-01 09:00:38 +02:00
  • 58f6529b55 Slowly piecing together Peter Boyle 2020-06-30 16:42:03 -04:00
  • e3f056dfbb Hw multigrid operator Peter Boyle 2020-06-30 16:10:16 -04:00
  • da0ffa7a79 Two spin update defer commit to repository Peter Boyle 2020-06-30 16:09:48 -04:00