1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-06-04 11:14:38 +01:00
Files
Grid/systems/Frontier/benchmarks/Benchmark_usqcd.csv
T
Peter Boyle 42cd9eda71 Some improvements that should have been there if in synch with develop,
and also some staggered hdcg type work
2026-05-29 13:36:57 -04:00

2.1 KiB

1Per node summary table
2L , Wilson, DWF4, Staggered, NaiveStag
38 , 90, 933, 38, 23
412 , 403, 1688, 178, 113
516 , 188, 1647, 449, 295
624 , 947, 1574, 674, 553
732 , 931, 1371, 718, 643
8Memory Bandwidth
9Bytes, GB/s per node
10786432, 40.271620
1112582912, 433.611792
1263700992, 905.374321
13201326592, 1114.979152
14491520000, 1180.241898
15Communications
16Packet bytes, direction, GB/s per node
17GEMM
18 M, N, K, BATCH, GF/s per rank fp64
1916, 8, 16, 4096, 693.316363
2016, 12, 16, 4096, 657.277058
2116, 16, 16, 4096, 711.992616
2232, 8, 32, 4096, 821.084324
2332, 12, 32, 4096, 1279.852719
2432, 16, 32, 4096, 2647.096674
2564, 8, 64, 4096, 2630.192325
2664, 12, 64, 4096, 3338.071321
2764, 16, 64, 4096, 3950.899281
2816, 8, 256, 4096, 1638.362501
2916, 12, 256, 4096, 2377.502234
3016, 16, 256, 4096, 3048.328833
3132, 8, 256, 4096, 2917.384276
3232, 12, 256, 4096, 4103.085151
3332, 16, 256, 4096, 5102.971860
3464, 8, 256, 4096, 3222.258206
3564, 12, 256, 4096, 4619.456391
3664, 16, 256, 4096, 5847.916650
378, 256, 16, 4096, 1728.073337
3812, 256, 16, 4096, 2356.653970
3916, 256, 16, 4096, 2676.876038
408, 256, 32, 4096, 2611.531990
4112, 256, 32, 4096, 3451.573106
4216, 256, 32, 4096, 3966.915301
438, 256, 64, 4096, 3436.248737
4412, 256, 64, 4096, 4539.497945
4516, 256, 64, 4096, 5307.992323
46GEMM
47 M, N, K, BATCH, GF/s per rank fp32
4816, 8, 16, 4096, 499.017445
4916, 12, 16, 4096, 731.543385
5016, 16, 16, 4096, 958.800786
5132, 8, 32, 4096, 1549.813550
5232, 12, 32, 4096, 2147.907502
5332, 16, 32, 4096, 2601.698596
5464, 8, 64, 4096, 3785.446233
5564, 12, 64, 4096, 5116.694843
5664, 16, 64, 4096, 6109.345016
5716, 8, 256, 4096, 1206.627737
5816, 12, 256, 4096, 1809.699599
5916, 16, 256, 4096, 2412.014053
6032, 8, 256, 4096, 2406.114488
6132, 12, 256, 4096, 3605.531907
6232, 16, 256, 4096, 4798.444037
6364, 8, 256, 4096, 4688.711196
6464, 12, 256, 4096, 6990.696301
6564, 16, 256, 4096, 9214.749925
668, 256, 16, 4096, 2596.307289
6712, 256, 16, 4096, 3439.892562
6816, 256, 16, 4096, 3907.201036
698, 256, 32, 4096, 3012.752067
7012, 256, 32, 4096, 3904.217583
7116, 256, 32, 4096, 4599.047092
728, 256, 64, 4096, 3721.999042
7312, 256, 64, 4096, 5098.573927
7416, 256, 64, 4096, 6159.080872