Peter Boyle
|
413312f9a9
|
Benchmark the halo construction.
THe bye counts are out and should be doubled for SIMD directions
|
2022-10-04 11:12:59 -07:00 |
|
|
4b1997e2f3
|
wilson sweep test
|
2022-05-16 15:58:33 +01:00 |
|
|
8939d5dc73
|
bugfix: eo operator called in correct location
|
2022-05-16 00:28:28 +01:00 |
|
Christoph Lehner
|
e2fc3a0f04
|
Merge pull request #28 from paboyle/develop
Sync with Upstream
|
2022-03-08 09:58:51 +01:00 |
|
Christoph Lehner
|
9616811c3d
|
Merge branch 'feature/gpt' of https://github.com/lehner/Grid into feature/gpt
|
2022-02-24 22:03:05 +01:00 |
|
Christoph Lehner
|
8a3002c03b
|
separate left and right masses for CayleyFermion5D
|
2022-02-24 22:02:56 +01:00 |
|
Peter Boyle
|
135808dcfa
|
Less verbose
|
2021-12-07 16:24:24 -05:00 |
|
Peter Boyle
|
2bf3b4d576
|
Update to reduce memory footpring in benchmark test
|
2021-12-07 09:02:02 -08:00 |
|
Peter Boyle
|
ba7e371b90
|
Warning free compile on Tursa.
Hopefully got all reqd virtual dtors
|
2021-10-21 19:56:52 +01:00 |
|
Peter Boyle
|
8bd70ad8b5
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2021-09-16 10:22:38 -07:00 |
|
Peter Boyle
|
b4690e6091
|
Adding build basics for different systems
|
2021-09-16 00:00:38 +01:00 |
|
Peter Boyle
|
c7baeb5bae
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2021-09-14 08:31:11 -07:00 |
|
Peter Boyle
|
361bb8a101
|
Remove half prec comms
|
2021-09-14 15:06:29 +01:00 |
|
Peter Boyle
|
7efdb3cd2b
|
Remove half prec comms
|
2021-09-14 15:06:06 +01:00 |
|
Peter Boyle
|
bcfa9cf068
|
Improvement of output
|
2021-08-28 08:08:15 -07:00 |
|
Peter Boyle
|
75030637cc
|
Improved comms benchmark, same as benchmark_comms_host_device
|
2021-08-10 05:16:30 -07:00 |
|
Peter Boyle
|
fe5aaf7677
|
Make comms benchmark same as Benchmark_comms_host_device
|
2021-08-09 04:06:30 -07:00 |
|
Peter Boyle
|
1eea9d73b9
|
Pass serial RNG around
|
2021-03-03 23:50:01 +01:00 |
|
Peter Boyle
|
cf76741ec6
|
Intel DPCPP Gold happy now (compiles all, runs Benchmark_dwf_fp32 )
|
2020-12-03 03:47:11 -08:00 |
|
Peter Boyle
|
147dc15d26
|
Update
|
2020-11-20 13:13:59 -05:00 |
|
Peter Boyle
|
8fcb392e24
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2020-11-17 04:51:31 -08:00 |
|
Peter Boyle
|
dd8d70eeff
|
Build without LIME
|
2020-11-17 04:41:15 -08:00 |
|
Peter Boyle
|
3aab983760
|
Flop count set as in DiRAC-ITT-2020 (mistaken 20% low, but must maintain consistency)
|
2020-11-16 17:13:58 +01:00 |
|
Peter Boyle
|
9c4dcc5ea3
|
Merge branch 'master' into develop
|
2020-11-16 16:34:57 +01:00 |
|
Peter Boyle
|
e9bc748828
|
Useful GPU machine benchmark for GDR used to shakeout Booster at Juelich - see slack earlyaccess channel
|
2020-11-13 03:58:34 +01:00 |
|
Peter Boyle
|
f48156529b
|
Work on 2,2,2,8 ranks
|
2020-11-13 03:57:58 +01:00 |
|
Peter Boyle
|
f16c2665f5
|
Host memory explict
|
2020-11-12 20:29:58 +01:00 |
|
Peter Boyle
|
41e28015ae
|
Volume divisible guarantee
|
2020-11-07 13:32:16 +01:00 |
|
Peter Boyle
|
3f06209720
|
Pretty print
|
2020-10-13 22:18:51 -04:00 |
|
|
c2b688abc9
|
Benchmark_IO: reducing max local volume to 32^4
|
2020-10-10 16:52:56 +01:00 |
|
|
b0d61b9687
|
Benchmark_IO cleaner output
|
2020-10-09 21:46:45 +01:00 |
|
|
5f893bf9af
|
Benchmark_IO procurement sizes
|
2020-10-09 21:31:59 +01:00 |
|
|
0e17bd6597
|
I/O benchmark cleanup
|
2020-10-09 20:29:57 +01:00 |
|
|
22caa158cc
|
multi-pass I/O benchmark, with statistic and robustness summary
|
2020-10-09 20:29:40 +01:00 |
|
Peter Boyle
|
992ef6e9fc
|
more runtime
|
2020-10-08 22:19:20 -04:00 |
|
Peter Boyle
|
f32a320bc3
|
Single prec benchmark in double prec compile
|
2020-10-08 19:52:08 -04:00 |
|
Peter Boyle
|
5f0fe029d2
|
Improve meemory benchmarks for GPU (avoid host mem ping pong)
|
2020-10-08 19:51:28 -04:00 |
|
Peter Boyle
|
3f9c427a3a
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2020-10-07 13:12:57 -04:00 |
|
Peter Boyle
|
d201277652
|
Expose Nc as a compile time configure option.
Remove precision option
|
2020-10-07 13:07:00 -04:00 |
|
|
e22d30f715
|
Merge branch 'develop' into feature/benchmark-io-update
|
2020-10-07 15:56:39 +01:00 |
|
|
1ba25a0d8c
|
more I/O benchmark code cleaning
|
2020-10-07 15:38:41 +01:00 |
|
|
9ba3647bdf
|
script to convert I/O benchmark logs to CSV
|
2020-10-07 15:35:03 +01:00 |
|
|
5ee832f738
|
I/O benchmark code cleaning
|
2020-10-07 15:31:51 +01:00 |
|
Peter Boyle
|
35a69a5133
|
SU4 x SU4
|
2020-10-06 21:48:35 -04:00 |
|
|
acac2d6938
|
standard C/C++ I/O in benchmark
|
2020-10-06 17:57:00 +01:00 |
|
Peter Boyle
|
81441e98f4
|
HIP runs sensible
|
2020-09-16 03:35:03 +01:00 |
|
Peter Boyle
|
8244caff25
|
Remove the asynchronous non-Stencil calls.
|
2020-09-03 18:52:55 -04:00 |
|
Bartosz Kostrzewa
|
a9b92867a8
|
use tabulator
|
2020-08-31 18:41:17 +02:00 |
|
Bartosz Kostrzewa
|
65920faeba
|
correct formatting of Benchmark_wilson_sweep output
|
2020-08-31 18:39:27 +02:00 |
|
nmeyer-ur
|
337d9dc043
|
move barrier in Benchmark_wilson
|
2020-07-08 08:13:40 +02:00 |
|