Peter Boyle
|
c82b164f6b
|
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
|
2022-10-04 17:41:48 -04:00 |
|
Peter Boyle
|
cd5cf6d614
|
Tracing replaces self timing hooks
|
2022-08-31 17:33:41 -04:00 |
|
Peter Boyle
|
58a86c9164
|
SSC mark removal
|
2022-06-01 19:27:06 -04:00 |
|
Peter Boyle
|
7efdb3cd2b
|
Remove half prec comms
|
2021-09-14 15:06:06 +01:00 |
|
Peter Boyle
|
d201277652
|
Expose Nc as a compile time configure option.
Remove precision option
|
2020-10-07 13:07:00 -04:00 |
|
nmeyer-ur
|
8726e94ea7
|
merge upstream develop
|
2020-07-07 20:26:47 +02:00 |
|
Peter Boyle
|
1a4c8c3387
|
Global edit with change to View usage. autoView() creates a wrapper object that closes the view when scope closes.
|
2020-06-05 18:52:35 -04:00 |
|
nmeyer-ur
|
5f52804907
|
update calculation of data
|
2020-05-30 10:55:17 +02:00 |
|
nmeyer-ur
|
936071773e
|
correct throughput in wilson and dwf
|
2020-05-29 22:15:59 +02:00 |
|
Peter Boyle
|
a7abda89e2
|
View location & access mode
|
2020-05-21 16:13:59 -04:00 |
|
nmeyer-ur
|
015d8bb38a
|
introduced assertions in Benchmark_wilson, removed data output from Benchmark_dwf
|
2020-05-15 09:15:50 +02:00 |
|
Peter Boyle
|
28a1fcaaff
|
First compile against SYCL
|
2020-05-05 11:13:27 -07:00 |
|
nils meyer
|
64b72fc17f
|
testing gcc 10.0.1: build errors in Exchange1 using -DA64FX and in Lattice_base.h building Dslash only
|
2020-04-19 01:25:40 +02:00 |
|
Peter Boyle
|
3e41b1055c
|
Remove Gpu only kernels.
|
2019-06-09 11:20:01 +01:00 |
|
Peter Boyle
|
da8d87e9da
|
Cuda switch off
|
2019-06-08 17:11:38 +01:00 |
|
Peter Boyle
|
6d77941990
|
Drop the 5D vec actions
|
2019-06-08 13:38:05 +01:00 |
|
Peter Boyle
|
47c063f984
|
Remove Ls Vec cases from benchmarks
|
2019-06-04 20:45:35 +01:00 |
|
Peter Boyle
|
b57a4d32aa
|
Merge branch 'develop' into feature/gpu-port
|
2018-12-13 05:11:34 +00:00 |
|
Peter Boyle
|
adbdc4e65b
|
Half comms not working on GPU yet, so disable.
|
2018-09-11 05:15:22 +01:00 |
|
Peter Boyle
|
f4bfeb835d
|
Drop back to smaller Ls
|
2018-09-09 14:25:06 +01:00 |
|
paboyle
|
27cdb79063
|
Sha used to seed from a unique string
|
2018-08-10 15:11:01 +01:00 |
|
Peter Boyle
|
21a1710b43
|
Verbose vector length
|
2018-07-23 06:08:39 -04:00 |
|
Peter Boyle
|
4b04ae3611
|
Printing improvement
|
2018-07-05 06:59:38 -04:00 |
|
paboyle
|
25becc9324
|
GPU tweaks for benchmarking; really necessary?
|
2018-06-13 20:26:07 +01:00 |
|
paboyle
|
aead94e9a7
|
View introduced
|
2018-03-04 16:39:29 +00:00 |
|
paboyle
|
36ea5f6b77
|
gpu friendly coordinates ; no std::vector on GPU
|
2018-02-24 22:20:14 +00:00 |
|
paboyle
|
ce4da83bc2
|
Zero changes, literally
|
2018-01-27 23:51:10 +00:00 |
|
paboyle
|
c4f82e072b
|
_grid becomes private ; use Grid()§
|
2018-01-27 00:04:12 +00:00 |
|
paboyle
|
2a4a0e43c1
|
Hide internals
|
2018-01-26 23:08:27 +00:00 |
|
paboyle
|
e7cba358c2
|
Temporary update to reflect the new dropping of std::vector in Lattice
Will update again to hide the internals in an interface
|
2018-01-25 23:31:41 +00:00 |
|
Guido Cossu
|
cff3bae155
|
Adding support for general Nc in the benchmark outputs
|
2018-01-25 13:46:31 +01:00 |
|
paboyle
|
d74c21a386
|
GLobal edit for QCD namespace removal & NAMESPACE macros
|
2018-01-15 09:37:58 +00:00 |
|
Peter Boyle
|
bfb68e6f02
|
Merge pull request #130 from giltirn/gparity-handunroll
Gparity handunroll
|
2017-09-21 10:11:00 +01:00 |
|
Christopher Kelly
|
d36d2fb40d
|
Added ability to override default Ls in Benchmark_dwf
|
2017-08-28 06:53:56 -07:00 |
|
paboyle
|
ae56e556c6
|
finalise issue on new OPA revert
|
2017-08-20 02:53:12 +01:00 |
|
Peter Boyle
|
7d88198387
|
Merge branch 'develop' into feature/multi-communicator
|
2017-08-19 13:03:35 -04:00 |
|
Peter Boyle
|
14d53e1c9e
|
Threaded MPI calls patches
|
2017-07-29 13:08:10 -04:00 |
|
Peter Boyle
|
b73bd151bb
|
Switch off counters by default
|
2017-06-30 10:16:35 +01:00 |
|
Peter Boyle
|
694b305cab
|
Update to reporting
|
2017-06-30 10:16:13 +01:00 |
|
Guido Cossu
|
20999c1370
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-05-05 12:47:17 +01:00 |
|
Peter Boyle
|
945767c6d8
|
More info
|
2017-05-03 20:26:35 -04:00 |
|
Peter Boyle
|
92e364a35f
|
Better reporting in benchmark for MPI3
|
2017-05-03 15:43:36 -04:00 |
|
Guido Cossu
|
4063238943
|
Adding HMC test file example for Mobius + smearing
|
2017-05-01 13:44:00 +01:00 |
|
Guido Cossu
|
3344788fa1
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-05-01 12:13:56 +01:00 |
|
paboyle
|
738c1a11c2
|
longer nloop
|
2017-04-26 08:43:20 +01:00 |
|
paboyle
|
ab66bac4e6
|
Think I'm getting on top of the reduced cost exterior precomputed list of links
|
2017-04-25 08:50:26 +01:00 |
|
paboyle
|
c429ace748
|
Cleaner OpenMP use
|
2017-04-22 20:28:42 +01:00 |
|
Peter Boyle
|
1d1b225497
|
Hand unrolled Nc=3 kernels support split phase compute (on-node, off-node).
|
2017-04-22 09:05:28 -04:00 |
|
paboyle
|
fc4ab9ccd5
|
Working half precision comms
|
2017-04-20 11:20:26 +01:00 |
|
Guido Cossu
|
8c540333d5
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-04-05 14:41:04 +01:00 |
|