Thomas Blum
ce8b52749d
Merge remote-tracking branch 'origin/develop' into feature/staggered-hdcg
2026-05-27 16:20:47 -04:00
Peter Boyle
86c7f29183
Config command update
2026-05-27 16:19:33 -04:00
Thomas Blum
bbdc8e95f4
mac-arm: disable Sp, fermion-reps, gparity for faster dev builds
...
Reduces compile time significantly by skipping representations not
needed for the staggered HDCG work.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-05-27 16:19:28 -04:00
Thomas Blum
1284acf37a
Merge remote-tracking branch 'origin/develop' into feature/staggered-hdcg
2026-05-27 16:19:19 -04:00
Peter Boyle
b0c99f876e
Configure on mac update
2026-05-27 16:16:55 -04:00
Thomas Blum
520b90259d
Add staggered HDCG multigrid test and mac-arm Homebrew build scripts
...
Test_staggered_hdcg.cc implements a two-level ADEF2 multigrid solver for
NaiveStaggeredFermion using SchurStaggeredOperator, following the mrhs
hermitian multigrid approach of arXiv:2409.03904. Uses a 33-point coarse
stencil (NextToNearestStencilGeometry4D) with nbasis=24, block={4,4,4,4},
and Chebyshev subspace generation with hi=5.0 (lambda_max ~4.6).
Also adds systems/mac-arm/sourceme-homebrew.sh and config-command-homebrew
for building Grid on Apple Silicon with Homebrew-installed dependencies.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com >
2026-05-27 15:52:49 -04:00
Peter Boyle
b58a1508fa
Perlmutter cuda version update
2026-05-21 13:25:13 -07:00
Peter Boyle
6140ac6864
Hip Happy
2026-05-15 12:13:01 -04:00
Peter Boyle
856545a1db
Support ROCM 7.0.2
2026-05-15 11:30:29 -04:00
Peter Boyle
b37390bb5a
4 node usqcd run
2026-04-27 14:40:11 -07:00
Peter Boyle
829dc8cceb
32 node
2026-04-27 14:38:02 -07:00
Peter Boyle
13cc2c39f5
FOM run
2026-04-27 14:20:49 -07:00
Peter Boyle
d293b58a20
384 node baseline run
2026-04-27 13:54:40 -07:00
Peter Boyle
e4404efe5a
Perlmutter compile update
2026-04-27 13:53:28 -07:00
paboyle
c54d87a472
Aurora compile fix for new compiler
2025-11-06 18:17:33 +00:00
Peter Boyle
fe0db53842
FFT offload to GPU and MUCH faster comms.
...
40x speed up on Frontier
2025-08-21 16:45:38 -04:00
paboyle
9e6a4a4737
Assertion updates to macros (mostly) with backtrace.
...
WIlson flow to include options for DBW2, Iwasaki, Symanzik.
View logging for data assurance
2025-08-07 15:48:38 +00:00
paboyle
41f344bbd3
Merge with Christoph GPT checksum debug
2025-07-15 03:06:09 +00:00
paboyle
a77cd50b2f
Update comms logging in Cshift
2025-07-11 14:36:10 +00:00
Peter Boyle
fce6e1f135
Kill core files for quota reasons
2025-06-13 05:08:15 +02:00
Peter Boyle
9203126aa5
Scripts
2025-06-11 15:30:16 +02:00
Peter Boyle
f90ba4712a
Update for Jupiter
2025-06-11 15:24:34 +02:00
Peter Boyle
dc546aaa4b
Updated config options for BNL cluster
2025-05-13 18:44:47 -04:00
Peter Boyle
d60a80c098
Fixes and visualisation
2025-04-29 18:04:23 -04:00
Peter Boyle
677b4cc5b0
Make all tests compile
2025-04-24 20:33:26 -04:00
Peter Boyle
be565ffab6
update mac config command
2025-04-24 14:50:06 -04:00
Peter Boyle
a49fa3f8d0
ROCM 6.3.1 appears to work
2025-04-07 11:50:59 -04:00
Peter Boyle
cd452a2f91
Slurm update
2025-04-04 18:40:20 -04:00
Peter Boyle
938c47480f
Updated compile on frontier.
...
Unsatisfactory hacsk
2025-04-04 18:35:06 -04:00
Peter Boyle
0c3cb60135
Script update
2025-04-04 18:35:05 -04:00
Peter Boyle
882a217074
Example of Useful prerequisite installs with spack
2025-03-26 11:28:53 -04:00
Peter Boyle
3d014864e2
Makinig LLVM happy
2025-03-06 14:19:25 -05:00
Peter Boyle
a1cdda833f
Update WorkArounds.txt
2025-03-05 14:04:23 -05:00
Peter Boyle
ad6db92690
Update WorkArounds.txt
2025-03-05 14:00:26 -05:00
Peter Boyle
e8ff9d8e50
Update WorkArounds.txt
2025-03-05 14:00:04 -05:00
Peter Boyle
795769c636
Update WorkArounds.txt
2025-03-05 13:50:41 -05:00
Peter Boyle
267a39d943
Update WorkArounds.txt
2025-03-05 13:49:43 -05:00
Peter Boyle
3624bd3d22
Update WorkArounds.txt
2025-03-05 13:45:09 -05:00
Peter Boyle
bc12dbbb38
Update WorkArounds.txt
2025-03-05 12:48:56 -05:00
Peter Boyle
eb8a008a8f
Create WorkArounds.txt
2025-03-05 12:41:59 -05:00
paboyle
c4d9aa1a21
Config command that makes GPT happier
2025-02-27 20:12:49 +00:00
paboyle
0baaddbe98
Pipeline mode commit on Aurora. 5+ TF/s on 16^3x32 per tile at 384
...
nodes.
More concurrency/fine grained scheduling is possible.
2025-02-04 19:27:26 +00:00
paboyle
b50fb34e71
Perf on Aurora
2025-02-01 18:39:34 +00:00
paboyle
de84d730ff
Fastest run config on Aurora to date
2025-02-01 18:08:40 +00:00
paboyle
c4fc972fec
Merge branch 'feature/deprecate-uvm' into develop
2025-01-31 16:32:36 +00:00
paboyle
8cf809e231
Best results on Aurora so far
2025-01-31 16:14:45 +00:00
paboyle
94019a922e
Significantly better performance on Aurora without using pipeline mode
2025-01-30 16:36:46 +00:00
paboyle
d6b2727f86
Pipeline mode getting better -- 2 nodes @ 10TF/s per node on Aurora
2025-01-29 09:22:21 +00:00
paboyle
74a4f43946
Optional host buffer bounce for no CUDA aware MPI
2025-01-28 15:22:46 +00:00
paboyle
1caf8b0f86
Rename
2025-01-28 15:22:37 +00:00