1
0
mirror of https://github.com/paboyle/Grid.git synced 2025-08-09 16:07:06 +01:00

Commit Graph

  • b281b0166e Put the barrier in the subroutine Peter Boyle 2023-04-04 07:36:03 -07:00
  • 6a21f694ff Apply barrier in Gather kernel sequence. Could place before comms, or in Gather, but decided to insist Gather means Gather is done Peter Boyle 2023-04-04 07:33:24 -07:00
  • fc4db5e963 Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet Peter Boyle 2023-04-03 18:26:11 -04:00
  • 6252ffaf76 No unified Peter Boyle 2023-04-03 18:25:22 -04:00
  • 026e736dfa Projection on algebra can now be templated. Fix #12 Alessandro Lupo 2023-04-03 16:31:19 +01:00
  • 4275b3f431 Fix typo and remove unnecessary lines Alessandro Lupo 2023-04-03 12:01:52 +01:00
  • af64c1c6b6 Had managed to drop the accelerator_barrier() in the Wilson Compressor gather Peter Boyle 2023-03-30 17:34:44 -04:00
  • 866f48391a Temporary fix for develop incorrect results Peter Boyle 2023-03-30 17:10:13 -04:00
  • a4df527d74 Merge pull request #428 from mmphys/bugfix/comm_none Peter Boyle 2023-03-30 08:38:14 -04:00
  • 5764d21161 Fixes for --enable-comms=none Michael Marshall 2023-03-30 10:15:28 +01:00
  • 496d04cd85 Weaken the Fence Peter Boyle 2023-03-29 18:58:51 -04:00
  • 12d20d8e15 Merge branch 'release/0.10.0' master Peter Boyle 2023-03-29 16:35:33 -04:00
  • 10e6d7c6ce Merge branch 'feature/dirichlet' into develop Peter Boyle 2023-03-29 16:26:47 -04:00
  • c42e25e5b8 Dirichlet remove Peter Boyle 2023-03-29 16:25:52 -04:00
  • 25777e5967 Merge branch 'release/0.9.0' Peter Boyle 2023-03-29 15:27:58 -04:00
  • a00ae981e0 Fence propagation from SYCL release/0.9.1 Peter Boyle 2023-03-29 15:00:40 -04:00
  • 58e020b62a Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet Peter Boyle 2023-03-29 14:37:40 -04:00
  • a7e1aceeca Compile fix on Nvidia Peter Boyle 2023-03-29 14:36:50 -04:00
  • 7212432f43 More careful fencing Peter Boyle 2023-03-28 20:10:22 -07:00
  • 4a261fab30 Changes premerge to develop Peter Boyle 2023-03-28 20:04:21 -07:00
  • 6af97069b9 Preparing for close of feature/dirichlet Initial code change review complete Peter Boyle 2023-03-28 13:39:44 -07:00
  • 5068413cdb Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet Peter Boyle 2023-03-28 08:35:38 -07:00
  • 71c6960eea Commet Peter Boyle 2023-03-28 08:34:24 -07:00
  • ddf6d5c9e3 Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet Peter Boyle 2023-03-28 11:33:05 -04:00
  • 39214702f6 feat: indentation fixed. Fabian Joswig 2023-03-28 16:30:34 +02:00
  • 3e4614c63a feat: draft for bug-report issue template added. Fabian Joswig 2023-03-28 16:24:35 +02:00
  • 900e01f49b Temporary Peter Boyle 2023-03-27 21:35:06 -07:00
  • 2376156fbc Merge branch 'develop' into feature/dirichlet Peter Boyle 2023-03-27 21:33:50 -07:00
  • 3f2fd49db4 Merge branch 'develop' of https://github.com/paboyle/Grid into develop Peter Boyle 2023-03-27 17:29:54 -07:00
  • 0efa107cb6 Script update Peter Boyle 2023-03-27 17:29:43 -07:00
  • 8feedb4f6f Include files moved Peter Boyle 2023-03-27 17:29:21 -07:00
  • 05e562e3d7 Move the copy synch out to stencil and do one per call instead of one per packet Peter Boyle 2023-03-27 17:28:38 -07:00
  • dd3bbb8fa2 MOve the synchronise out to the stencil so one call instead of one call per packet Peter Boyle 2023-03-27 17:27:45 -07:00
  • 2fbcf13c46 SYCL fix Peter Boyle 2023-03-27 14:25:14 -07:00
  • 4ea48ef0c4 Merge pull request #419 from lehner/feature/gpt Peter Boyle 2023-03-24 15:42:16 -04:00
  • 5c85774ee3 Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet Peter Boyle 2023-03-24 15:40:57 -04:00
  • d8a9a745d8 stream synchronise Peter Boyle 2023-03-24 15:40:30 -04:00
  • dcf172da3b Merge pull request #415 from paboyle/feature/block_lanczos22 Peter Boyle 2023-03-24 12:08:16 -04:00
  • d57ed25071 Merge branch 'feature/dirichlet' into feature/block_lanczos22 feature/block_lanczos22 Peter Boyle 2023-03-24 12:08:09 -04:00
  • 546be724e7 Merge pull request #421 from UniOfLeicester/feature/accel_Copy_plane Peter Boyle 2023-03-24 12:04:06 -04:00
  • 8a1b9073f9 Mshift update Peter Boyle 2023-03-23 15:39:30 -04:00
  • 1a7114d4b9 Temporary algorithm while sorting out mixed prec Peter Boyle 2023-03-23 15:38:35 -04:00
  • 3f385f717c Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet Peter Boyle 2023-03-23 14:52:53 -04:00
  • 481bbaf1fc Interface to query memory use Peter Boyle 2023-03-23 12:55:31 -04:00
  • 281488611a WriteDiscard on construct Peter Boyle 2023-03-23 10:28:50 -04:00
  • c180a52518 Merge branch 'feature/dirichlet' of https://www.github.com/paboyle/Grid into feature/dirichlet Peter Boyle 2023-03-23 10:28:01 -04:00
  • 90130e25e9 TODO list Peter Boyle 2023-03-23 10:27:02 -04:00
  • 23298acb81 Merge pull request #424 from giltirn/feature/dirichlet-precchange Peter Boyle 2023-03-22 23:04:52 -04:00
  • 52384e34cf Discard on construct Peter Boyle 2023-03-22 19:40:32 -04:00
  • d0bb033ea2 Device resident GPU block buffer instead of UVM as hit likely UVM bug. Code worked on CUDA 11.4 but fails on later drivers (certainly 530.30.02, but need to find the perlmutter driver version). Peter Boyle 2023-03-22 19:07:32 -04:00
  • c6621806ca Compiling on laptop and running Peter Boyle 2023-03-21 17:27:09 -04:00
  • 0b6f0f6d2f Merge branch 'feature/dirichlet' of https://www.github.com/paboyle/Grid into feature/dirichlet Peter Boyle 2023-03-21 16:06:55 -04:00
  • b5b759df73 Merge branch 'develop' into feature/dirichlet Peter Boyle 2023-03-21 16:05:46 -04:00
  • 7db8dd7a95 Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet Peter Boyle 2023-03-21 16:04:27 -04:00
  • 8b43be39c0 Config command Peter Boyle 2023-03-21 16:00:52 -04:00
  • f17f879206 Test update Peter Boyle 2023-03-21 15:59:29 -04:00
  • 68428fceab Integrator update Peter Boyle 2023-03-21 15:58:49 -04:00
  • 4135f2dcd1 Compressor Peter Boyle 2023-03-21 15:41:41 -04:00
  • c5bdf61215 AUdit fix Peter Boyle 2023-03-21 15:38:39 -04:00
  • 88e218e8ee Stencil updates Peter Boyle 2023-03-21 15:37:58 -04:00
  • 0f2b786436 Vector -> vector Peter Boyle 2023-03-21 15:36:11 -04:00
  • e1c326558a COmms improvements Peter Boyle 2023-03-21 08:53:56 -07:00
  • bae0f8ea99 Merge pull request #425 from rrhodgson/feature/CacheLogging Peter Boyle 2023-03-21 08:59:08 -04:00
  • bbbcd36ae5 Merge pull request #426 from rrhodgson/feature/LCDeflation Peter Boyle 2023-03-21 08:58:40 -04:00
  • 39c0815d9e WriteDiscard Peter Boyle 2023-03-21 08:57:29 -04:00
  • 1b8176e2c0 fix code duplication Alessandro Lupo 2023-03-17 14:58:00 +00:00
  • cbc053c3db Revert "projection on Sp2n algebra, to be used instead of Ta" Alessandro Lupo 2023-03-17 11:36:58 +00:00
  • cdf3f6ef6e Merge branch 'refactoring_sp2n' of https://github.com/LupoA/Grid into refactoring_sp2n Alessandro Lupo 2023-03-15 15:59:50 +00:00
  • ba7f9d7b70 projection on Sp2n algebra, to be used instead of Ta Alessandro Lupo 2023-03-15 15:55:12 +00:00
  • a997d24743 Remove nofma Peter Boyle 2023-03-14 12:10:31 -07:00
  • 861e5d7f4c SYCL version update. Why do they keep making incompatible changes Peter Boyle 2023-03-14 12:10:02 -07:00
  • 14cc142a14 Warning remove Peter Boyle 2023-03-14 12:09:26 -07:00
  • f36b87deb5 syscall fix Peter Boyle 2023-03-14 12:09:00 -07:00
  • eeb6e0a6e3 Renable cache blocking and efficient UPI type SHM comms Peter Boyle 2023-03-14 09:10:27 -07:00
  • cad5b187dd Cleanup Peter Boyle 2023-03-14 09:08:16 -07:00
  • 87697eb07e SHared compile Peter Boyle 2023-03-14 09:07:36 -07:00
  • 371fd123fb consequence of iSUnMatrix being no longer a member of the SU class Alessandro Lupo 2023-03-14 10:47:07 +00:00
  • d6ff644aab Towards the day all tests compile Alessandro Lupo 2023-03-14 10:43:25 +00:00
  • 29586f6b5e Deactivate some tests for Nc!=3 Julian Lenz 2023-03-13 08:17:14 +00:00
  • fd057c838f add ProjectOnGaugeGroup and ProjectGn to allow future templating in GaugeImplTypes Alessandro Lupo 2023-03-10 12:10:46 +00:00
  • f51222086c Move functions from GaugeGroup to group specific implementations Alessandro Lupo 2023-03-09 16:22:20 +00:00
  • a3e935c902 Batched block project/promote size checks Raoul Hodgson 2023-02-27 11:38:16 +00:00
  • 7731c7db8e Add huge cache type and allow Ncache==0 Raoul Hodgson 2023-02-26 14:15:28 +00:00
  • ff97340324 Expose cached bytes Raoul Hodgson 2023-02-26 12:22:45 +00:00
  • 83d86943db Fixed compile bug in MemoryManagerShared caused by Audit function not being passed a string Christopher Kelly 2023-02-23 13:09:45 -05:00
  • e82cf1d311 Further prec-change improvements Christopher Kelly 2023-02-23 09:45:29 -05:00
  • 1db58a8acc Precision change improvements Christopher Kelly 2023-02-21 10:52:42 -05:00
  • 920a51438d Added batched Mixed precision CG Raoul Hodgson 2023-02-14 17:04:13 +00:00
  • be528b6d27 Add batched block project/promote functions Raoul Hodgson 2023-02-14 14:37:10 +00:00
  • f73691ec47 Merge pull request #18 from nickforce989/sp2n/newbranch Alessandro Lupo 2023-02-13 10:22:27 +01:00
  • ccd21f96ff Plaquette agreeing and moving to final form (slowly) need to optimise Peter Boyle 2023-02-01 22:57:44 -05:00
  • 4b90cb8888 First cut passes combining padded cell with general stencil towards fast plaquette and staggered force Peter Boyle 2023-02-01 22:14:10 -05:00
  • 7ebda3e9ec Merge commit 'b10e1b7bc8bec809f874e9e48a3ccc7b2619c9d1' into sp2n/newbranch Niccolo Forzano 2023-01-19 12:10:18 +00:00
  • b10e1b7bc8 Fixed files giving zero force computation on GPU, issue #8 Niccolo Forzano 2023-01-18 18:04:47 +00:00
  • 796abfad80 Merge pull request #422 from fjosw/fix/NVCC_DIAG_PRAGMA_SUPPORT Peter Boyle 2023-01-17 09:34:49 -05:00
  • ad0270ac8c fix: diagnostic pragma warnings fixed for CUDA 12+ Fabian Joswig 2023-01-12 12:36:30 +00:00
  • 7d62f1d6d2 Populate the Cshift_table in the GPU Makis Kappas 2023-01-11 21:26:25 +00:00
  • 458c943987 merged upstream Christoph Lehner 2022-12-31 11:16:21 +02:00
  • 88015b0858 Split sum in rankSum and GlobalSum Christoph Lehner 2022-12-26 10:01:32 +01:00
  • 8b91b61b61 First cut at faster GPU slice sum via atomics feature/felix-slice-sum-fast Peter Boyle 2022-12-22 15:13:45 -05:00