|
066544281f
|
Deprecate UVM
|
2024-09-17 13:34:27 +00:00 |
|
Peter Boyle
|
cd5cf6d614
|
Tracing replaces self timing hooks
|
2022-08-31 17:33:41 -04:00 |
|
Peter Boyle
|
ba7e371b90
|
Warning free compile on Tursa.
Hopefully got all reqd virtual dtors
|
2021-10-21 19:56:52 +01:00 |
|
Peter Boyle
|
75030637cc
|
Improved comms benchmark, same as benchmark_comms_host_device
|
2021-08-10 05:16:30 -07:00 |
|
Peter Boyle
|
fe5aaf7677
|
Make comms benchmark same as Benchmark_comms_host_device
|
2021-08-09 04:06:30 -07:00 |
|
Peter Boyle
|
147dc15d26
|
Update
|
2020-11-20 13:13:59 -05:00 |
|
Peter Boyle
|
3aab983760
|
Flop count set as in DiRAC-ITT-2020 (mistaken 20% low, but must maintain consistency)
|
2020-11-16 17:13:58 +01:00 |
|
Peter Boyle
|
9c4dcc5ea3
|
Merge branch 'master' into develop
|
2020-11-16 16:34:57 +01:00 |
|
Peter Boyle
|
f48156529b
|
Work on 2,2,2,8 ranks
|
2020-11-13 03:57:58 +01:00 |
|
Peter Boyle
|
41e28015ae
|
Volume divisible guarantee
|
2020-11-07 13:32:16 +01:00 |
|
Peter Boyle
|
3f06209720
|
Pretty print
|
2020-10-13 22:18:51 -04:00 |
|
Peter Boyle
|
992ef6e9fc
|
more runtime
|
2020-10-08 22:19:20 -04:00 |
|
Peter Boyle
|
5f0fe029d2
|
Improve meemory benchmarks for GPU (avoid host mem ping pong)
|
2020-10-08 19:51:28 -04:00 |
|
Peter Boyle
|
d201277652
|
Expose Nc as a compile time configure option.
Remove precision option
|
2020-10-07 13:07:00 -04:00 |
|
Peter Boyle
|
35a69a5133
|
SU4 x SU4
|
2020-10-06 21:48:35 -04:00 |
|
Peter Boyle
|
cdf0a04fc5
|
Merge branch 'develop' into sycl
|
2020-06-09 04:00:12 -04:00 |
|
Peter Boyle
|
1a4c8c3387
|
Global edit with change to View usage. autoView() creates a wrapper object that closes the view when scope closes.
|
2020-06-05 18:52:35 -04:00 |
|
Peter Boyle
|
cf2938688a
|
Sycl unhappy fix
|
2020-05-25 08:36:53 -07:00 |
|
Peter Boyle
|
a7abda89e2
|
View location & access mode
|
2020-05-21 16:13:59 -04:00 |
|
Peter Boyle
|
ee1de82a53
|
Working ITT benchmark again
|
2020-05-08 18:54:50 -04:00 |
|
Peter Boyle
|
0561c2edeb
|
Benchmarks modified for new GPU constructs
|
2019-06-15 12:52:56 +01:00 |
|
Peter Boyle
|
47c063f984
|
Remove Ls Vec cases from benchmarks
|
2019-06-04 20:45:35 +01:00 |
|
paboyle
|
aead94e9a7
|
View introduced
|
2018-03-04 16:39:29 +00:00 |
|
paboyle
|
36ea5f6b77
|
gpu friendly coordinates ; no std::vector on GPU
|
2018-02-24 22:20:14 +00:00 |
|
paboyle
|
604c05f4b8
|
parallel_for elimination -> thread_loop
|
2018-01-28 01:01:36 +00:00 |
|
paboyle
|
ce4da83bc2
|
Zero changes, literally
|
2018-01-27 23:51:10 +00:00 |
|
paboyle
|
c4f82e072b
|
_grid becomes private ; use Grid()§
|
2018-01-27 00:04:12 +00:00 |
|
paboyle
|
2a4a0e43c1
|
Hide internals
|
2018-01-26 23:08:27 +00:00 |
|
paboyle
|
918c105c57
|
NVCC warning elimination
|
2018-01-24 13:23:59 +00:00 |
|
paboyle
|
d74c21a386
|
GLobal edit for QCD namespace removal & NAMESPACE macros
|
2018-01-15 09:37:58 +00:00 |
|
paboyle
|
17c5b0f152
|
Patching comparison point
|
2017-09-16 18:18:07 +01:00 |
|
Peter Boyle
|
b331be9101
|
Better reporting
|
2017-08-31 11:32:57 +01:00 |
|
Peter Boyle
|
49c20a9fa8
|
Patch to reporting
|
2017-08-31 11:32:21 +01:00 |
|
paboyle
|
7359df3501
|
Full reporting for benchmark; save robustness factor
|
2017-08-31 10:42:35 +01:00 |
|
Peter Boyle
|
5b9267e88d
|
Cleaner comms benchmark treatment for one node runs
|
2017-08-27 18:24:48 -04:00 |
|
paboyle
|
15fd4003ef
|
Improving presentation of results
|
2017-08-27 13:46:02 +01:00 |
|
paboyle
|
ad89abb018
|
Fix
|
2017-08-25 20:43:37 +01:00 |
|
paboyle
|
80c5bce5bb
|
Merge branch 'develop' into feature/multi-communicator
|
2017-08-25 20:21:26 +01:00 |
|
Peter Boyle
|
d0f3d525d5
|
Optimal block size for KNL
|
2017-08-25 19:33:54 +01:00 |
|
Peter Boyle
|
3a58217405
|
Updated
|
2017-08-25 14:29:53 +01:00 |
|
Peter Boyle
|
c289699d9a
|
updated from cambridge mpi3 shakeout
|
2017-08-25 11:41:01 +01:00 |
|
Peter Boyle
|
c3b1263e75
|
Benchmark prep
|
2017-08-25 09:25:54 +01:00 |
|
paboyle
|
383ca7d392
|
Switch off comms for now until feature/multi-communicator is merged
|
2017-08-20 01:27:48 +01:00 |
|
paboyle
|
a446d95c33
|
Trying to pass TeamCity and Travis
|
2017-08-20 01:10:50 +01:00 |
|
paboyle
|
bfef525ed2
|
New benchmark prep
|
2017-08-19 23:10:12 +01:00 |
|