1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-04-03 18:55:56 +01:00

Commit Graph

  • ccf147d6c1 Select the compiler that gives better performance on sunspot Peter Boyle 2024-05-07 18:45:56 +00:00
  • 7aa12b446f New config command for sunspot Peter Boyle 2024-05-07 18:45:40 +00:00
  • c293228102 layout control Peter Boyle 2024-05-07 18:45:21 +00:00
  • 5c4c9f721a Remove pbs file and replace with bench1 and bench2 for 1 and 2 nodes Peter Boyle 2024-05-07 18:44:49 +00:00
  • 057f86c1de 2 queues works ok in performance Peter Boyle 2024-05-07 18:42:50 +00:00
  • cd52e3cbc2 Jobs on subspot Peter Boyle 2024-05-07 18:38:15 +00:00
  • 24602e1259 Accidental synchronise Peter Boyle 2024-05-07 17:28:38 +00:00
  • 8a098889fc
    Update FlightRecorder.cc Peter Boyle 2024-04-30 21:15:08 +01:00
  • 5c3ace7c3e Merge branch 'develop' into feature/scidac-wp1 Peter Boyle 2024-04-30 05:26:06 -04:00
  • aa148455b7 Updated todo list Peter Boyle 2024-04-30 05:24:39 -04:00
  • 98cf247f33 prepare to switch to mixed precision Peter Boyle 2024-04-30 05:23:45 -04:00
  • 0cf16522d1 Refine with HDCG choice Peter Boyle 2024-04-30 05:22:14 -04:00
  • 7b7c75f9e5 Setup Peter Boyle 2024-04-30 05:21:02 -04:00
  • aefd255a3c Verbose Peter Boyle 2024-04-30 05:20:41 -04:00
  • 1c5aa939fd Subspace setup changes Peter Boyle 2024-04-30 05:19:09 -04:00
  • 3a0ff17be0 Verbose changes Peter Boyle 2024-04-30 05:17:28 -04:00
  • 47829ae5cc Verbose changes Peter Boyle 2024-04-30 05:16:46 -04:00
  • bfa7b69aff Verbose changes Peter Boyle 2024-04-16 15:42:46 -04:00
  • 2aaa959b5f Printing changes Peter Boyle 2024-04-16 15:41:25 -04:00
  • ce2970b93a Printing changes Peter Boyle 2024-04-16 15:40:38 -04:00
  • 7b76970d10 Verbose changes Peter Boyle 2024-04-16 15:40:10 -04:00
  • 9fd41882d2 Herm Op update Peter Boyle 2024-04-16 15:39:27 -04:00
  • ff2ea5de18
    Update Tensor_traits.h Peter Boyle 2024-04-11 14:25:45 -04:00
  • 5147a42818 Updated hdcg Peter Boyle 2024-04-05 01:05:57 -04:00
  • 57552d8ca3 Assign from non-lattice made accelerator resident Peter Boyle 2024-04-05 01:05:12 -04:00
  • 13713b2a76 Much faster little dirac operator calculation Peter Boyle 2024-04-05 01:04:40 -04:00
  • 36a14e4ee3 Best setup and introduce an HDCG refine method Peter Boyle 2024-04-05 01:03:33 -04:00
  • b4cc788b8c First version used in mrhsHDCG Need to consolidate files. Plan: Make this version able to go virtual base, then absorb chulwoos version when it is proven Peter Boyle 2024-04-05 01:02:21 -04:00
  • 0f0e7512f3 Keep MRHS in a different file Peter Boyle 2024-04-05 00:59:53 -04:00
  • 1196b1a161 Less verbose Peter Boyle 2024-04-05 00:58:58 -04:00
  • 2c8c3be9ee Adef2Mrhs Peter Boyle 2024-04-05 00:57:13 -04:00
  • 5b79d51c22 Improvements Peter Boyle 2024-04-01 14:18:40 -04:00
  • da890dc293 Verbose changes Peter Boyle 2024-04-01 14:18:00 -04:00
  • 93d0a1e73a HISQ view call Peter Boyle 2024-04-01 14:16:47 -04:00
  • f0a8c7d045 Playing with chebyshevs Peter Boyle 2024-04-01 14:16:11 -04:00
  • db8793777c Logging/verbose Peter Boyle 2024-04-01 14:15:41 -04:00
  • c745484e65 9.5x speed up version Peter Boyle 2024-04-01 14:14:30 -04:00
  • da59379612 Large reg file for double feature/gparity-merge-april24 Peter Boyle 2024-03-26 17:03:20 +00:00
  • 3ef2a41518 ifdef guard ommitted Peter Boyle 2024-03-26 14:50:32 +00:00
  • aa96f420c6 Acclerator ware MPI guard on the Unix domain sockets Peter Boyle 2024-03-26 14:41:25 +00:00
  • 49e9e4ed0e Fences Peter Boyle 2024-03-26 14:14:06 +00:00
  • f7b8163016 Deterministic MPI reduce options Peter Boyle 2024-03-26 14:11:40 +00:00
  • 93769eacd3 Updated configure for bounce through host Peter Boyle 2024-03-26 14:10:24 +00:00
  • 59b0cc11df REduce the time in single Peter Boyle 2024-03-26 00:42:40 +00:00
  • f32c275376 Updated config options for MPI not being aware of GPU Peter Boyle 2024-03-26 00:42:00 +00:00
  • 5404fc66ab Merge needs a fence on SYCL Peter Boyle 2024-03-26 00:38:41 +00:00
  • 1f53458af8 Options to bounce through a host buffer if --disable-accelerator-aware-mpi Peter Boyle 2024-03-26 00:37:19 +00:00
  • 434c3e7f1d We have a choice of GET or PUT across NVlink Peter Boyle 2024-03-25 14:32:44 +00:00
  • 500b119f3d Deterministic MPI Peter Boyle 2024-03-22 15:55:23 +00:00
  • 4b87259c1b New config command for sunspot Peter Boyle 2024-03-22 15:43:49 +00:00
  • 503dec34ef This appears working now on Sunspot Peter Boyle 2024-03-22 15:43:30 +00:00
  • d1e9fe50d2 Xor csum for repro testing Peter Boyle 2024-03-22 15:42:57 +00:00
  • d01e5fa838 Improved FlightRecorder Peter Boyle 2024-03-22 15:42:32 +00:00
  • a477c25e8c Sunspot repro tests Peter Boyle 2024-03-22 15:42:11 +00:00
  • 1bd20cd9e8 FlightRecorder Peter Boyle 2024-03-22 15:40:01 +00:00
  • e49e95b037 Upgrade of the Britney test with flight recorder and fast xor checksum Peter Boyle 2024-03-22 15:39:27 +00:00
  • 6f59fed563 Flight recorder, resurrecting the "world famous" Britney test Peter Boyle 2024-03-22 15:32:32 +00:00
  • 60b7f6c99d Flight recorder, resurrecting the "world famous" Britney test Peter Boyle 2024-03-22 15:32:26 +00:00
  • b92dfcc8d3 Flight recorder, resurrecting the "world famous" Britney test Peter Boyle 2024-03-22 15:30:27 +00:00
  • f6fd6dd053 Flight recorder, resurrecting the "world famous" Britney test Peter Boyle 2024-03-22 15:30:01 +00:00
  • 79ad567dd5 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-03-19 15:43:42 +00:00
  • fab1efb48c More britney logging improvements Peter Boyle 2024-03-19 14:36:21 +00:00
  • 660eb76d93 FFTW from OneAPI Peter Boyle 2024-03-19 14:28:33 +00:00
  • 461cd045c6 sliceSum cleanup dbollweg 2024-03-13 18:18:44 -04:00
  • fee65d7a75
    Merge branch 'paboyle:develop' into sycl_slicesum_update dbollweg 2024-03-13 18:06:17 -04:00
  • 31f9971dbf avoid PI_ERROR_OUT_OF_RESOURCES in sycl sliceSum dbollweg 2024-03-13 13:39:26 -04:00
  • 62e7bf024a Updated flight logging for Britney test Peter Boyle 2024-03-12 20:10:04 +00:00
  • 95f3d69cf9 Extra hardware test hook Peter Boyle 2024-03-12 20:09:37 +00:00
  • 89c0519f83 Repro test Peter Boyle 2024-03-12 16:11:33 +00:00
  • 2704b82084 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-03-12 15:16:24 +00:00
  • cf8632bbac Britney test option Peter Boyle 2024-03-12 15:15:35 +00:00
  • d224297972 PBS scripts Peter Boyle 2024-03-12 15:15:16 +00:00
  • a4d11a630f
    Merge pull request #458 from paboyle/fix/HOST_NAME_MAX Peter Boyle 2024-03-07 07:50:25 -05:00
  • 2b4399f8b1 more HOST_NAME_MAX fix fix/HOST_NAME_MAX Antonin Portelli 2024-03-07 15:26:01 +09:00
  • f17b8de907 fallback to _POSIX_HOST_NAME_MAX if HOST_NAME_MAX is not defined Antonin Portelli 2024-03-07 15:22:08 +09:00
  • d87296f3e8 Merge branch 'develop' of https://github.com/dbollweg/Grid into develop dbollweg 2024-03-06 16:54:22 -05:00
  • be94cf1c6f Fewer wait-calls in sycl slicesum dbollweg 2024-03-06 16:53:13 -05:00
  • cc04dc42dc Merge branch 'develop' into feature/scidac-wp1 Peter Boyle 2024-03-06 14:55:21 -05:00
  • 070b61f08f Simplifying the MultiRHS solver to make it do SRHS *and* MRHS Peter Boyle 2024-03-06 14:04:33 -05:00
  • 7e5bd46dd3 Booster update Peter Boyle 2024-03-06 19:03:45 +01:00
  • 228bbb9d81 Benchmark results Peter Boyle 2024-03-06 19:03:35 +01:00
  • b812a7b4c6 Staggered launch script Peter Boyle 2024-03-06 01:32:40 +00:00
  • 891a366f73 Repro CG script Peter Boyle 2024-03-06 01:22:55 +00:00
  • 10116b3be8 Force device copyable and tell SYCL to shut it. Peter Boyle 2024-03-06 01:13:27 +00:00
  • a46a0f0882 force device copyable and don't take crap from SYCL Peter Boyle 2024-03-06 01:12:49 +00:00
  • a26a8a38f4 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2024-03-06 00:05:00 +00:00
  • 7435315d50 More blasted shell variables Peter Boyle 2024-03-06 00:03:59 +00:00
  • 9b5f741e85 Reproducing CG can be more useful now Peter Boyle 2024-03-06 00:03:16 +00:00
  • 517822fdd2 SPR HBM benchmarking right and also PVC batched GEMM Peter Boyle 2024-03-06 00:02:27 +00:00
  • 1b93a9be88 Print out the hostname Peter Boyle 2024-03-06 00:01:58 +00:00
  • 783a66b348 Deterministic reduction please Peter Boyle 2024-03-06 00:01:37 +00:00
  • 976c3e9b59 Hack for flight logging CG inner products. Can be made to work, but could put in some more serious infrastructure for repro testing and blame attribution (Britney test) if necessary Peter Boyle 2024-03-05 23:59:57 +00:00
  • f8ca971dae Use of a bare PRECISION macro is not namespace safe and collides with SYCL Peter Boyle 2024-03-05 23:59:13 +00:00
  • 21bc8c24df OneMKL batched blas starting Peter Boyle 2024-03-05 23:58:20 +00:00
  • 30228214f7 SYCL conflict with Eigen Peter Boyle 2024-03-05 23:56:10 +00:00
  • 2ae980ae43
    Update sourceme.sh Peter Boyle 2024-03-05 13:39:18 -05:00
  • 6153dec2e4
    Update setup.sh Peter Boyle 2024-03-05 13:38:32 -05:00
  • c805f86343 USQCD benchmark Peter Boyle 2024-03-01 00:05:04 -05:00
  • 04ca065281 Only one rank opens Peter Boyle 2024-02-29 20:09:11 -05:00
  • 88d8fa43d7 Benchmark development Peter Boyle 2024-02-29 20:01:44 -05:00