nmeyer-ur
|
337d9dc043
|
move barrier in Benchmark_wilson
|
2020-07-08 08:13:40 +02:00 |
|
nmeyer-ur
|
8726e94ea7
|
merge upstream develop
|
2020-07-07 20:26:47 +02:00 |
|
nmeyer-ur
|
a87e45ba25
|
SVE readme update
|
2020-06-18 11:23:08 +02:00 |
|
Peter Boyle
|
cdf0a04fc5
|
Merge branch 'develop' into sycl
|
2020-06-09 04:00:12 -04:00 |
|
Peter Boyle
|
14fcd0912a
|
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
|
2020-06-05 19:14:17 -04:00 |
|
Peter Boyle
|
3111c0bd4f
|
Single precisiono hardwire
|
2020-06-05 19:13:27 -04:00 |
|
Peter Boyle
|
1a4c8c3387
|
Global edit with change to View usage. autoView() creates a wrapper object that closes the view when scope closes.
|
2020-06-05 18:52:35 -04:00 |
|
nmeyer-ur
|
5f52804907
|
update calculation of data
|
2020-05-30 10:55:17 +02:00 |
|
nmeyer-ur
|
936071773e
|
correct throughput in wilson and dwf
|
2020-05-29 22:15:59 +02:00 |
|
nmeyer-ur
|
1732f9319e
|
more mods; counters seem to work correctly
|
2020-05-29 18:44:00 +02:00 |
|
nmeyer-ur
|
5cb3530c34
|
enable counters in Benchmark_wilson
|
2020-05-29 15:44:52 +02:00 |
|
Peter Boyle
|
006cc8a8f1
|
Staggereed move to accelerator
|
2020-05-28 08:33:06 -04:00 |
|
Peter Boyle
|
cf2938688a
|
Sycl unhappy fix
|
2020-05-25 08:36:53 -07:00 |
|
Peter Boyle
|
a7abda89e2
|
View location & access mode
|
2020-05-21 16:13:59 -04:00 |
|
nmeyer-ur
|
015d8bb38a
|
introduced assertions in Benchmark_wilson, removed data output from Benchmark_dwf
|
2020-05-15 09:15:50 +02:00 |
|
Peter Boyle
|
ea08f193e7
|
Allocator cache spliit into large/small pools
|
2020-05-10 05:24:26 -04:00 |
|
Peter Boyle
|
ee1de82a53
|
Working ITT benchmark again
|
2020-05-08 18:54:50 -04:00 |
|
Peter Boyle
|
2b576fc185
|
Comment deadd codde remove
|
2020-05-08 18:54:29 -04:00 |
|
Peter Boyle
|
6859a3e1d4
|
Schur operator
|
2020-05-08 09:19:12 -04:00 |
|
Peter Boyle
|
28a1fcaaff
|
First compile against SYCL
|
2020-05-05 11:13:27 -07:00 |
|
u37294
|
59c51d2c35
|
Make compile if HAVE_LIME=0
|
2020-05-04 10:26:20 -07:00 |
|
nils meyer
|
64b72fc17f
|
testing gcc 10.0.1: build errors in Exchange1 using -DA64FX and in Lattice_base.h building Dslash only
|
2020-04-19 01:25:40 +02:00 |
|
Peter Boyle
|
e279b2be29
|
Merge develop
|
2019-08-14 23:01:59 +01:00 |
|
Peter Boyle
|
48e6efc7c9
|
Merge branch 'develop' into feature/gpu-port
Conflicts:
Grid/qcd/action/fermion/WilsonKernelsAsm.cc
Grid/qcd/action/fermion/implementation/ImprovedStaggeredFermionImplementation.h
Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h
benchmarks/Benchmark_comms.cc
|
2019-08-14 18:56:54 +01:00 |
|
Peter Boyle
|
263dcbabab
|
Simplify the comms benchmark
|
2019-07-30 22:51:04 +01:00 |
|
Peter Boyle
|
d85dcc72df
|
Multinode fix
|
2019-07-20 07:13:28 +01:00 |
|
Peter Boyle
|
0561c2edeb
|
Benchmarks modified for new GPU constructs
|
2019-06-15 12:52:56 +01:00 |
|
Peter Boyle
|
3e41b1055c
|
Remove Gpu only kernels.
|
2019-06-09 11:20:01 +01:00 |
|
Peter Boyle
|
da8d87e9da
|
Cuda switch off
|
2019-06-08 17:11:38 +01:00 |
|
Peter Boyle
|
6d77941990
|
Drop the 5D vec actions
|
2019-06-08 13:38:05 +01:00 |
|
Peter Boyle
|
47c063f984
|
Remove Ls Vec cases from benchmarks
|
2019-06-04 20:45:35 +01:00 |
|
Peter Boyle
|
ee6f96d85c
|
Merge pull request #210 from grid-test-organisation/feature/gpu-port-develop
Cayley fermion functions for GPUs
|
2019-05-18 19:06:20 +01:00 |
|
Peter Boyle
|
4e9df9e93c
|
GPU patches
|
2019-05-18 17:43:11 +01:00 |
|
gfilaci
|
e3c56fd9b3
|
CayleyZeroCounters before benchmark loop
|
2019-05-13 15:52:00 +01:00 |
|
gfilaci
|
d9438627d9
|
M5D benchmark without vector copy overhead
|
2019-05-02 11:10:57 +01:00 |
|
gfilaci
|
6da9aa9971
|
replace std::vector with Vector in benchmark
|
2019-05-02 10:56:22 +01:00 |
|
gfilaci
|
b52fa38f8c
|
seed initialisation of RNG5
|
2019-05-02 10:36:09 +01:00 |
|
Peter Boyle
|
c43a2b599a
|
GPU support
|
2019-01-01 15:07:29 +00:00 |
|
Peter Boyle
|
b57a4d32aa
|
Merge branch 'develop' into feature/gpu-port
|
2018-12-13 05:11:34 +00:00 |
|
|
0ba3d469c7
|
Benchmark IO in single and double precision
|
2018-10-17 20:27:34 +01:00 |
|
|
291bc2a1f0
|
IO benchmark on a list of directories
|
2018-10-15 17:25:08 +01:00 |
|
Peter Boyle
|
adbdc4e65b
|
Half comms not working on GPU yet, so disable.
|
2018-09-11 05:15:22 +01:00 |
|
Peter Boyle
|
f4bfeb835d
|
Drop back to smaller Ls
|
2018-09-09 14:25:06 +01:00 |
|
|
a15a2dfd29
|
Merge branch 'develop' into feature/hadrons
|
2018-08-10 16:08:22 +01:00 |
|
paboyle
|
27cdb79063
|
Sha used to seed from a unique string
|
2018-08-10 15:11:01 +01:00 |
|
Peter Boyle
|
00b92a91b5
|
Optimising
|
2018-07-28 23:46:22 +01:00 |
|
paboyle
|
65533741f7
|
7 moms
|
2018-07-28 16:17:47 +01:00 |
|
Peter Boyle
|
131a6785d4
|
Merge branch 'feature/hadrons-a2a' into feature/hadrons-a2a
|
2018-07-27 23:03:42 +01:00 |
|
paboyle
|
44f4f5c8e2
|
Momentum loop
|
2018-07-27 23:00:16 +01:00 |
|
fionnoh
|
2679df034f
|
Changes to meson field benchmark. Now includes the gammas in the final part of the naive method, both methods compute
lhs^dag*Gamma*rhs (previously Gamma*lhs^dag*rhs), and checks results.
|
2018-07-27 18:31:10 +01:00 |
|