1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-19 02:01:02 +01:00
Commit Graph

286 Commits

Author SHA1 Message Date
nmeyer-ur 337d9dc043 move barrier in Benchmark_wilson 2020-07-08 08:13:40 +02:00
nmeyer-ur 8726e94ea7 merge upstream develop 2020-07-07 20:26:47 +02:00
nmeyer-ur a87e45ba25 SVE readme update 2020-06-18 11:23:08 +02:00
Peter Boyle cdf0a04fc5 Merge branch 'develop' into sycl 2020-06-09 04:00:12 -04:00
Peter Boyle 14fcd0912a Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl 2020-06-05 19:14:17 -04:00
Peter Boyle 3111c0bd4f Single precisiono hardwire 2020-06-05 19:13:27 -04:00
Peter Boyle 1a4c8c3387 Global edit with change to View usage. autoView() creates a wrapper object that closes the view when scope closes. 2020-06-05 18:52:35 -04:00
nmeyer-ur 5f52804907 update calculation of data 2020-05-30 10:55:17 +02:00
nmeyer-ur 936071773e correct throughput in wilson and dwf 2020-05-29 22:15:59 +02:00
nmeyer-ur 1732f9319e more mods; counters seem to work correctly 2020-05-29 18:44:00 +02:00
nmeyer-ur 5cb3530c34 enable counters in Benchmark_wilson 2020-05-29 15:44:52 +02:00
Peter Boyle 006cc8a8f1 Staggereed move to accelerator 2020-05-28 08:33:06 -04:00
Peter Boyle cf2938688a Sycl unhappy fix 2020-05-25 08:36:53 -07:00
Peter Boyle a7abda89e2 View location & access mode 2020-05-21 16:13:59 -04:00
nmeyer-ur 015d8bb38a introduced assertions in Benchmark_wilson, removed data output from Benchmark_dwf 2020-05-15 09:15:50 +02:00
Peter Boyle ea08f193e7 Allocator cache spliit into large/small pools 2020-05-10 05:24:26 -04:00
Peter Boyle ee1de82a53 Working ITT benchmark again 2020-05-08 18:54:50 -04:00
Peter Boyle 2b576fc185 Comment deadd codde remove 2020-05-08 18:54:29 -04:00
Peter Boyle 6859a3e1d4 Schur operator 2020-05-08 09:19:12 -04:00
Peter Boyle 28a1fcaaff First compile against SYCL 2020-05-05 11:13:27 -07:00
u37294 59c51d2c35 Make compile if HAVE_LIME=0 2020-05-04 10:26:20 -07:00
nils meyer 64b72fc17f testing gcc 10.0.1: build errors in Exchange1 using -DA64FX and in Lattice_base.h building Dslash only 2020-04-19 01:25:40 +02:00
Peter Boyle e279b2be29 Merge develop 2019-08-14 23:01:59 +01:00
Peter Boyle 48e6efc7c9 Merge branch 'develop' into feature/gpu-port
Conflicts:
	Grid/qcd/action/fermion/WilsonKernelsAsm.cc
	Grid/qcd/action/fermion/implementation/ImprovedStaggeredFermionImplementation.h
	Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h
	benchmarks/Benchmark_comms.cc
2019-08-14 18:56:54 +01:00
Peter Boyle 263dcbabab Simplify the comms benchmark 2019-07-30 22:51:04 +01:00
Peter Boyle d85dcc72df Multinode fix 2019-07-20 07:13:28 +01:00
Peter Boyle 0561c2edeb Benchmarks modified for new GPU constructs 2019-06-15 12:52:56 +01:00
Peter Boyle 3e41b1055c Remove Gpu only kernels. 2019-06-09 11:20:01 +01:00
Peter Boyle da8d87e9da Cuda switch off 2019-06-08 17:11:38 +01:00
Peter Boyle 6d77941990 Drop the 5D vec actions 2019-06-08 13:38:05 +01:00
Peter Boyle 47c063f984 Remove Ls Vec cases from benchmarks 2019-06-04 20:45:35 +01:00
Peter Boyle ee6f96d85c Merge pull request #210 from grid-test-organisation/feature/gpu-port-develop
Cayley fermion functions for GPUs
2019-05-18 19:06:20 +01:00
Peter Boyle 4e9df9e93c GPU patches 2019-05-18 17:43:11 +01:00
gfilaci e3c56fd9b3 CayleyZeroCounters before benchmark loop 2019-05-13 15:52:00 +01:00
gfilaci d9438627d9 M5D benchmark without vector copy overhead 2019-05-02 11:10:57 +01:00
gfilaci 6da9aa9971 replace std::vector with Vector in benchmark 2019-05-02 10:56:22 +01:00
gfilaci b52fa38f8c seed initialisation of RNG5 2019-05-02 10:36:09 +01:00
Peter Boyle c43a2b599a GPU support 2019-01-01 15:07:29 +00:00
Peter Boyle b57a4d32aa Merge branch 'develop' into feature/gpu-port 2018-12-13 05:11:34 +00:00
portelli 0ba3d469c7 Benchmark IO in single and double precision 2018-10-17 20:27:34 +01:00
portelli 291bc2a1f0 IO benchmark on a list of directories 2018-10-15 17:25:08 +01:00
Peter Boyle adbdc4e65b Half comms not working on GPU yet, so disable. 2018-09-11 05:15:22 +01:00
Peter Boyle f4bfeb835d Drop back to smaller Ls 2018-09-09 14:25:06 +01:00
portelli a15a2dfd29 Merge branch 'develop' into feature/hadrons 2018-08-10 16:08:22 +01:00
paboyle 27cdb79063 Sha used to seed from a unique string 2018-08-10 15:11:01 +01:00
Peter Boyle 00b92a91b5 Optimising 2018-07-28 23:46:22 +01:00
paboyle 65533741f7 7 moms 2018-07-28 16:17:47 +01:00
Peter Boyle 131a6785d4 Merge branch 'feature/hadrons-a2a' into feature/hadrons-a2a 2018-07-27 23:03:42 +01:00
paboyle 44f4f5c8e2 Momentum loop 2018-07-27 23:00:16 +01:00
fionnoh 2679df034f Changes to meson field benchmark. Now includes the gammas in the final part of the naive method, both methods compute
lhs^dag*Gamma*rhs (previously Gamma*lhs^dag*rhs), and checks results.
2018-07-27 18:31:10 +01:00