Peter Boyle
|
bfa7b69aff
|
Verbose changes
|
2024-04-16 15:42:46 -04:00 |
|
Peter Boyle
|
2aaa959b5f
|
Printing changes
|
2024-04-16 15:41:25 -04:00 |
|
Peter Boyle
|
ce2970b93a
|
Printing changes
|
2024-04-16 15:40:38 -04:00 |
|
Peter Boyle
|
7b76970d10
|
Verbose changes
|
2024-04-16 15:40:10 -04:00 |
|
Peter Boyle
|
9fd41882d2
|
Herm Op update
|
2024-04-16 15:39:27 -04:00 |
|
Peter Boyle
|
ff2ea5de18
|
Update Tensor_traits.h
|
2024-04-11 14:25:45 -04:00 |
|
Peter Boyle
|
5147a42818
|
Updated hdcg
|
2024-04-05 01:05:57 -04:00 |
|
Peter Boyle
|
57552d8ca3
|
Assign from non-lattice made accelerator resident
|
2024-04-05 01:05:12 -04:00 |
|
Peter Boyle
|
13713b2a76
|
Much faster little dirac operator calculation
|
2024-04-05 01:04:40 -04:00 |
|
Peter Boyle
|
36a14e4ee3
|
Best setup and introduce an HDCG refine method
|
2024-04-05 01:03:33 -04:00 |
|
Peter Boyle
|
b4cc788b8c
|
First version used in mrhsHDCG
Need to consolidate files.
Plan: Make this version able to go virtual base, then absorb chulwoos
version when it is proven
|
2024-04-05 01:02:21 -04:00 |
|
Peter Boyle
|
0f0e7512f3
|
Keep MRHS in a different file
|
2024-04-05 00:59:53 -04:00 |
|
Peter Boyle
|
1196b1a161
|
Less verbose
|
2024-04-05 00:58:58 -04:00 |
|
Peter Boyle
|
2c8c3be9ee
|
Adef2Mrhs
|
2024-04-05 00:57:13 -04:00 |
|
Peter Boyle
|
5b79d51c22
|
Improvements
|
2024-04-01 14:18:40 -04:00 |
|
Peter Boyle
|
da890dc293
|
Verbose changes
|
2024-04-01 14:18:00 -04:00 |
|
Peter Boyle
|
93d0a1e73a
|
HISQ view call
|
2024-04-01 14:16:47 -04:00 |
|
Peter Boyle
|
f0a8c7d045
|
Playing with chebyshevs
|
2024-04-01 14:16:11 -04:00 |
|
Peter Boyle
|
db8793777c
|
Logging/verbose
|
2024-04-01 14:15:41 -04:00 |
|
Peter Boyle
|
c745484e65
|
9.5x speed up version
|
2024-04-01 14:14:30 -04:00 |
|
Peter Boyle
|
da59379612
|
Large reg file for double
|
2024-03-26 17:03:20 +00:00 |
|
Peter Boyle
|
3ef2a41518
|
ifdef guard ommitted
|
2024-03-26 14:50:32 +00:00 |
|
Peter Boyle
|
aa96f420c6
|
Acclerator ware MPI guard on the Unix domain sockets
|
2024-03-26 14:41:25 +00:00 |
|
Peter Boyle
|
49e9e4ed0e
|
Fences
|
2024-03-26 14:14:06 +00:00 |
|
Peter Boyle
|
f7b8163016
|
Deterministic MPI reduce options
|
2024-03-26 14:11:40 +00:00 |
|
Peter Boyle
|
93769eacd3
|
Updated configure for bounce through host
|
2024-03-26 14:10:24 +00:00 |
|
Peter Boyle
|
59b0cc11df
|
REduce the time in single
|
2024-03-26 00:42:40 +00:00 |
|
Peter Boyle
|
f32c275376
|
Updated config options for MPI not being aware of GPU
|
2024-03-26 00:42:00 +00:00 |
|
Peter Boyle
|
5404fc66ab
|
Merge needs a fence on SYCL
|
2024-03-26 00:38:41 +00:00 |
|
Peter Boyle
|
1f53458af8
|
Options to bounce through a host buffer if
--disable-accelerator-aware-mpi
|
2024-03-26 00:37:19 +00:00 |
|
Peter Boyle
|
434c3e7f1d
|
We have a choice of GET or PUT across NVlink
|
2024-03-25 14:32:44 +00:00 |
|
Peter Boyle
|
500b119f3d
|
Deterministic MPI
|
2024-03-22 15:55:23 +00:00 |
|
Peter Boyle
|
4b87259c1b
|
New config command for sunspot
|
2024-03-22 15:43:49 +00:00 |
|
Peter Boyle
|
503dec34ef
|
This appears working now on Sunspot
|
2024-03-22 15:43:30 +00:00 |
|
Peter Boyle
|
d1e9fe50d2
|
Xor csum for repro testing
|
2024-03-22 15:42:57 +00:00 |
|
Peter Boyle
|
d01e5fa838
|
Improved FlightRecorder
|
2024-03-22 15:42:32 +00:00 |
|
Peter Boyle
|
a477c25e8c
|
Sunspot repro tests
|
2024-03-22 15:42:11 +00:00 |
|
Peter Boyle
|
1bd20cd9e8
|
FlightRecorder
|
2024-03-22 15:40:01 +00:00 |
|
Peter Boyle
|
e49e95b037
|
Upgrade of the Britney test with flight recorder and fast xor checksum
|
2024-03-22 15:39:27 +00:00 |
|
Peter Boyle
|
6f59fed563
|
Flight recorder, resurrecting the "world famous" Britney test
|
2024-03-22 15:32:32 +00:00 |
|
Peter Boyle
|
60b7f6c99d
|
Flight recorder, resurrecting the "world famous" Britney test
|
2024-03-22 15:32:26 +00:00 |
|
Peter Boyle
|
b92dfcc8d3
|
Flight recorder, resurrecting the "world famous" Britney test
|
2024-03-22 15:30:27 +00:00 |
|
Peter Boyle
|
f6fd6dd053
|
Flight recorder, resurrecting the "world famous" Britney test
|
2024-03-22 15:30:01 +00:00 |
|
Peter Boyle
|
79ad567dd5
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2024-03-19 15:43:42 +00:00 |
|
Peter Boyle
|
fab1efb48c
|
More britney logging improvements
|
2024-03-19 14:36:21 +00:00 |
|
Peter Boyle
|
660eb76d93
|
FFTW from OneAPI
|
2024-03-19 14:28:33 +00:00 |
|
dbollweg
|
461cd045c6
|
sliceSum cleanup
|
2024-03-13 18:18:44 -04:00 |
|
dbollweg
|
fee65d7a75
|
Merge branch 'paboyle:develop' into sycl_slicesum_update
|
2024-03-13 18:06:17 -04:00 |
|
dbollweg
|
31f9971dbf
|
avoid PI_ERROR_OUT_OF_RESOURCES in sycl sliceSum
|
2024-03-13 13:39:26 -04:00 |
|
Peter Boyle
|
62e7bf024a
|
Updated flight logging for Britney test
|
2024-03-12 20:10:04 +00:00 |
|