1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-06-19 02:13:42 +01:00
Commit Graph

97 Commits

Author SHA1 Message Date
paboyle 5adf2657dd Updated to compile and run fast on CUDA 2025-08-10 00:00:13 +01:00
paboyle 9e6a4a4737 Assertion updates to macros (mostly) with backtrace.
WIlson flow to include options for DBW2, Iwasaki, Symanzik.
View logging for data assurance
2025-08-07 15:48:38 +00:00
paboyle 41f344bbd3 Merge with Christoph GPT checksum debug 2025-07-15 03:06:09 +00:00
paboyle bffb83c46e std::cout<<GridLogMessage<<"Debug:"<<std::endl;
std::cout<<GridLogMessage<<"  --dylib-map     : print dynamic library map, useful for interpreting signal backtraces "<<std::endl;
    std::cout<<GridLogMessage<<"  --heartbeat     : periodic itimer wakeup (interrupts stuck system calls!) "<<std::endl;
    std::cout<<GridLogMessage<<"  --signal-delay n : pause for n seconds after signal handling (useful to get ALL nodes in stuck state) "<<std::endl;
    std::cout<<GridLogMessage<<"  --debug-stdout  : print stdout from EVERY node to file Grid.stdout/err.rank "<<std::endl;
    std::cout<<GridLogMessage<<"  --debug-signals : catch sigsegv and print a blame report, handle SIGHUP with a backtrace to stderr"<<std::endl;
    std::cout<<GridLogMessage<<"  --debug-heartbeat : periodically report backtrace "<<std::endl;

--dylib-map : Grid prints its dylib regions
--heartbeat : itimer based / SIGALRM wake up which seems to make Aurora
more stable
--debug-heartbeat : periodically report to stderr where we are in code

Now have libunwind option (configure: --with-unwind=<prefix>) to give an
Asynch-Signal safe backtrace. Avoid glibc backtrace due to mallocs.
2025-06-27 06:08:54 +00:00
paboyle b78fc73d19 Better signal handler 2025-06-27 06:07:34 +00:00
paboyle d418f78352 Making running on Aurora more debuggable 2025-05-23 20:58:16 +00:00
paboyle 1d22841811 Working on aurora, GPT issue turned up is fixed 2025-03-06 03:20:18 +00:00
paboyle 6ae809ed40 Print not liked on GPT compile 2025-02-27 20:12:49 +00:00
paboyle b2ce760cf4 Verbose issue with GPT 2025-02-25 16:55:23 +00:00
Peter Boyle 5ae77876a8 Meson field and Aslash field on GPU; some compiler warning removed 2024-10-18 19:08:06 -04:00
paboyle fd58f0b669 Return ok 2024-10-11 03:21:21 +00:00
paboyle ec1395a304 Better flight logging 2024-10-10 22:01:57 +00:00
paboyle 02c8178f16 Almost working on Aurora 2024-09-23 09:43:50 +00:00
paboyle 066544281f Deprecate UVM 2024-09-17 13:34:27 +00:00
Peter Boyle 9a1ad6a5eb Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2024-05-17 11:33:46 -04:00
Peter Boyle 0ac85fa70b Serialisation removal 2024-05-16 10:49:04 -04:00
Peter Boyle 24602e1259 Accidental synchronise 2024-05-07 17:28:38 +00:00
Peter Boyle 8a098889fc Update FlightRecorder.cc 2024-04-30 21:15:08 +01:00
Peter Boyle 5c3ace7c3e Merge branch 'develop' into feature/scidac-wp1 2024-04-30 05:26:06 -04:00
Peter Boyle 1bd20cd9e8 FlightRecorder 2024-03-22 15:40:01 +00:00
Peter Boyle 6f59fed563 Flight recorder, resurrecting the "world famous" Britney test 2024-03-22 15:32:32 +00:00
Peter Boyle 60b7f6c99d Flight recorder, resurrecting the "world famous" Britney test 2024-03-22 15:32:26 +00:00
Peter Boyle b92dfcc8d3 Flight recorder, resurrecting the "world famous" Britney test 2024-03-22 15:30:27 +00:00
Peter Boyle f6fd6dd053 Flight recorder, resurrecting the "world famous" Britney test 2024-03-22 15:30:01 +00:00
Peter Boyle fab1efb48c More britney logging improvements 2024-03-19 14:36:21 +00:00
Peter Boyle 62e7bf024a Updated flight logging for Britney test 2024-03-12 20:10:04 +00:00
paboyle 2704b82084 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2024-03-12 15:16:24 +00:00
paboyle cf8632bbac Britney test option 2024-03-12 15:15:35 +00:00
portelli f17b8de907 fallback to _POSIX_HOST_NAME_MAX if HOST_NAME_MAX is not defined 2024-03-07 15:22:08 +09:00
Peter Boyle cc04dc42dc Merge branch 'develop' into feature/scidac-wp1 2024-03-06 14:55:21 -05:00
paboyle 1b93a9be88 Print out the hostname 2024-03-06 00:01:58 +00:00
Peter Boyle 839f9f1bbe Don't log memory by default 2024-01-17 16:25:50 -05:00
Peter Boyle 82fc4b1e94 Finalise 2023-11-23 18:19:41 -05:00
Peter Boyle b4f1740380 Finalise message 2023-11-23 18:19:16 -05:00
Peter Boyle b3925df9c3 Verbose on CPU-GPU xfer, remove performance by default 2023-10-24 10:25:01 -04:00
Peter Boyle afe10ba2a2 More digits 2023-10-20 19:27:13 -04:00
Peter Boyle 6a87487544 Running on Frontier, fix RNG big volume y2k, affecting 5D RNG 2023-10-05 16:50:59 -04:00
Peter Boyle 755002da9c Comparison convenience 2023-09-25 17:16:33 -04:00
Peter Boyle d57ed25071 Merge branch 'feature/dirichlet' into feature/block_lanczos22 2023-03-24 12:08:09 -04:00
Chulwoo Jung 82c1ecf60f Block lanczos added 2022-11-30 16:08:40 -05:00
Peter Boyle 1177b8f661 Merge branch 'develop' into feature/dirichlet 2022-08-31 19:05:57 -04:00
Gurtej Kanwar 554c238359 Update OpenSSL digest to use high-level methods
This avoids deprecation warnings when compiling against OpenSSL 3.0
but should still be backwards compatible. It is the recommended way
to use the digest API going forward.
2022-08-21 17:28:57 +02:00
Peter Boyle f73db8f1f3 Synch clocks 2022-07-11 13:47:39 -04:00
Peter Boyle 63dbaeefaa Extra barrier prior to finalize just in case it fixes an issue on Tursa 2022-02-16 14:01:43 +00:00
Daniel Richtmann b4f8e87982 Have Grid's cli interface understand floats 2022-02-01 23:09:06 +01:00
Peter Boyle 5398b7e7e3 Max 128 size 2021-10-26 09:16:29 -07:00
Peter Boyle 68650b61fe Options controlling behaviour 2021-09-21 00:51:01 +02:00
Peter Boyle 5b3c530aa7 Return value 2021-08-23 15:30:45 +01:00
Peter Boyle c6a5499c8b Fail on non-apple 2021-08-22 18:40:55 +01:00
Peter Boyle ffbdd91e0e Apple happiness 2021-08-20 01:15:00 +01:00