Peter Boyle
7440cde92f
No half prec comms; coalesced access on GPU
2021-09-14 05:04:56 +01:00
Peter Boyle
0fc662bb24
Dirac cuda 11.4 happy ; force host for functions accessing mult table
...
ET runs these on host BEFORE lodging result in AST for kernel
2021-09-14 05:00:44 +01:00
Peter Boyle
8195890640
Force MPI over NVLINK
2021-09-14 05:00:17 +01:00
Peter Boyle
4c88104a73
Fix compile warns
2021-09-11 23:08:05 +01:00
Peter Boyle
73b944c152
Drop half prec comms for now.
2021-09-11 23:07:18 +01:00
Peter Boyle
d1b0b7f5c6
Half prec comms dropping
2021-09-11 23:05:40 +01:00
Peter Boyle
381d8797d0
Drop half prec comms for now
2021-09-11 23:05:02 +01:00
Christopher Kelly
d184b8c921
Merge branch 'develop' into gparity_HMC
2021-09-08 06:14:08 -07:00
Christopher Kelly
c92e390b08
Added initial main binary code for 40ID and 48ID Gparity HMC
2021-09-08 09:00:13 -04:00
Peter Boyle
b06526bc1e
Comment update
2021-08-30 21:15:39 -04:00
Peter Boyle
3044419111
Some sample code
2021-08-30 20:32:11 -04:00
Peter Boyle
bcfa9cf068
Improvement of output
2021-08-28 08:08:15 -07:00
Peter Boyle
114920b8de
Some example clean up
2021-08-25 12:24:17 +01:00
Peter Boyle
0d588b95f4
Bug fix to Example_Laplacian test
2021-08-23 23:14:26 +01:00
Peter Boyle
5b3c530aa7
Return value
2021-08-23 15:30:45 +01:00
Peter Boyle
c6a5499c8b
Fail on non-apple
2021-08-22 18:40:55 +01:00
Peter Boyle
ec9c3fe77a
Remove the file
2021-08-22 18:28:39 +01:00
Peter Boyle
6135ad530e
Extra examples / solutions
2021-08-22 18:25:07 +01:00
Peter Boyle
40098424c7
Examples
2021-08-22 14:17:12 +01:00
Peter Boyle
7163b31a26
Examples
2021-08-20 01:15:23 +01:00
Peter Boyle
ffbdd91e0e
Apple happiness
2021-08-20 01:15:00 +01:00
Peter Boyle
5d29e175d8
Typo fix
2021-08-10 18:25:43 +01:00
Peter Boyle
417dbfa257
Fix
2021-08-10 08:55:35 -07:00
peterx.a.boyle
1eda4d8e0b
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-08-10 05:41:18 -07:00
peterx.a.boyle
50181f16e5
Level 0 IPC set up
2021-08-10 05:35:15 -07:00
Peter Boyle
75030637cc
Improved comms benchmark, same as benchmark_comms_host_device
2021-08-10 05:16:30 -07:00
Peter Boyle
fe5aaf7677
Make comms benchmark same as Benchmark_comms_host_device
2021-08-09 04:06:30 -07:00
Peter Boyle
80ac2a73ca
Check is wrong (HtoD / DtoH)
2021-08-05 18:33:20 -04:00
Christopher Kelly
5b36a8af54
Added a CshiftLink function to the GaugeImplementations and boundary condition classes that offers a boundary aware C-shift
...
Modified gauge fixing code to use CshiftLink internally such that the steepest descent algorithm is universal
Modified gauge transformation code to use CshiftLink for a universal definition
Improved comprehensibility of Test_fft_gfix and generalized to use either periodic or charge conjugation BCs based on cmdline option
Added cmdline options to Test_fft_gfix to tune alpha and optionally disable the Fourier acceleration tests
2021-07-12 17:13:40 -04:00
d75a66a3e6
test done
2021-07-06 11:42:36 +01:00
fcc4374d7b
i/o done
2021-07-05 14:52:00 +01:00
67c3c16fe5
working test
2021-07-05 14:41:52 +01:00
25e9be50b5
created test file
2021-07-02 15:51:19 +01:00
Christopher Kelly
75a1f85162
Added method to compute and return the Wilson flow energy density over some number of steps
2021-06-30 17:24:00 -04:00
Ed Bennett
323cf6c038
make message consistent with configure script
2021-06-23 17:00:43 +01:00
Peter Boyle
29a22ae603
Simpler SYCL setup
2021-06-22 17:57:20 +00:00
Peter Boyle
403bff1a47
Force reqd subgroup size fo SYCL
2021-06-22 17:56:10 +00:00
Christoph Lehner
c50f27e68b
Make FFT play nice with split grid
2021-06-20 11:34:38 +02:00
Peter Georg
80afacec5b
nvcc: Add -fopenmp to LDFLAGS
2021-06-17 13:05:13 +02:00
Peter Boyle
6cd9224dd7
SYCL comms buffer allocate
2021-06-16 17:10:55 +00:00
Peter Boyle
4bf8196ff1
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
2021-06-15 21:45:36 +00:00
Peter Boyle
4c5440fb06
const happy for sycl
2021-06-15 21:45:07 +00:00
Christopher Kelly
ac4f2d9798
Fixed EOFA approx test square rooting the result inappropriately thus failing when it shouldn't
...
To MDWF+ID GPBC evol main program, added routine to compute the lower bound of the EOFA using the power method with a command line toggle
2021-06-09 09:08:37 -04:00
a269a3d919
Merge pull request #358 from mmphys/feature/serialisation-test
...
Add a ragged std::vector to the serialisation test
2021-06-09 10:16:25 +01:00
Michael Marshall
0c4f585496
Test nested std::vector<grid tensor>
2021-06-08 00:05:35 +01:00
Michael Marshall
33d2df46a0
Merge branch 'develop' into feature/serialisation-test
...
* develop:
Update README.md
removing Travis CI constantly failing due to overtime (no way we can compile Grid on free time anymore)
2021-06-07 23:25:38 +01:00
Michael Marshall
2df308f649
Add a ragged vector to the serialisation tests. NB: Already had nested (regular) std::vector<std::vector<...>>
2021-06-07 23:25:07 +01:00
Peter Boyle
92def28bd3
Update README.md
2021-06-06 04:52:05 -04:00
ca10bfa1c7
removing Travis CI constantly failing due to overtime (no way we can compile Grid on free time anymore)
2021-06-04 11:12:22 +01:00
298a6ec51e
Merge pull request #357 from mmphys/bugfix/ragged
...
Bugfix/ragged Multi-dimensional ragged vectors
2021-06-04 10:34:46 +01:00