|
5bfa88be85
|
Aurora MPI standalone benchmake and options that work well
|
2024-02-06 16:28:40 +00:00 |
|
Dennis Bollweg
|
5af8da76d7
|
Fix cuda compilation of Lattice_slicesum_gpu.h
|
2024-02-01 18:02:30 -05:00 |
|
Dennis Bollweg
|
b8b9dc952d
|
Async memcpy's and cleanup
|
2024-02-01 17:55:35 -05:00 |
|
Dennis Bollweg
|
79a6ed32d8
|
Use accelerator_for2d and DeviceSegmentedRecude to avoid kernel launch latencies
|
2024-02-01 16:41:03 -05:00 |
|
dbollweg
|
caa5f97723
|
Add sliceSum gpu using cub/hipcub
|
2024-01-31 16:50:06 -05:00 |
|
david clarke
|
4924b3209e
|
projectU3 yields a unitary matrix
|
2024-01-23 14:43:58 -07:00 |
|
david clarke
|
00f24f8765
|
already found some bugs in projection, still needs testing
|
2024-01-22 05:50:16 -07:00 |
|
david clarke
|
f5b3d582b0
|
first attempt at U3 projection
|
2024-01-22 02:49:40 -07:00 |
|
david clarke
|
981c93d67a
|
update Test_fatLinks to accept Naik
|
2024-01-21 21:09:19 -07:00 |
|
david clarke
|
c020b78e02
|
Merge branch 'develop' into hisq_fat_links
|
2024-01-21 20:21:08 -07:00 |
|
|
2a0d75bac2
|
Aurora files
|
2023-12-21 23:20:17 +00:00 |
|
Peter Boyle
|
f48298ad4e
|
Bug fix
|
2023-12-11 20:57:02 -05:00 |
|
root
|
645e47c1ba
|
Config for Ampere Altra ARM
|
2023-12-08 16:17:56 -05:00 |
|
Peter Boyle
|
d1d9827263
|
Integrator logging update
|
2023-12-08 12:14:00 -05:00 |
|
Peter Boyle
|
14643c0aab
|
SDCC benchmarking scripts for A100 nodes and IceLake nodes (AVX512)
|
2023-12-04 15:45:57 -05:00 |
|
Peter Boyle
|
b77a9b8947
|
SDDC compiles starting
|
2023-11-30 14:31:51 -05:00 |
|
Peter Boyle
|
7d077fe493
|
Frontier compiel
|
2023-11-09 13:58:44 -05:00 |
|
david clarke
|
9cd4128833
|
fix naik bug
|
2023-11-03 14:11:38 -06:00 |
|
david clarke
|
c8b17c9526
|
Naik to CShift
|
2023-11-02 12:43:22 -06:00 |
|
david clarke
|
2ae2a81e85
|
attempt to fix Naik
|
2023-10-31 13:54:55 -06:00 |
|
david clarke
|
69c869d345
|
fixed stupid typo
|
2023-10-30 17:41:52 -06:00 |
|
david clarke
|
df9b958c40
|
naik now returns separately
|
2023-10-30 17:40:53 -06:00 |
|
david clarke
|
3d3376d1a3
|
LePage works, trying Naik
|
2023-10-27 16:26:31 -06:00 |
|
Christoph Lehner
|
f2648e94b9
|
getHostPointer added to Lattice
|
2023-10-23 13:47:41 +02:00 |
|
david clarke
|
21ed6ac0f4
|
added floating-point support
|
2023-10-20 13:54:26 -06:00 |
|
david clarke
|
7bb8ab7000
|
improve smearing templating
|
2023-10-20 08:41:02 -06:00 |
|
david clarke
|
2c824c2641
|
Merge branch 'develop' into hisq_fat_links
|
2023-10-17 16:03:59 -06:00 |
|
david clarke
|
391fd9cc6a
|
try lepage term
|
2023-10-17 14:57:15 -06:00 |
|
Peter Boyle
|
51051df62c
|
3GeV run setup
|
2023-10-16 20:49:52 +03:00 |
|
Peter Boyle
|
33097681b9
|
FTHMC compiled and merged to develop
|
2023-10-14 00:42:55 +03:00 |
|
Peter Boyle
|
07e4900218
|
FTHMC commit
|
2023-10-13 18:21:57 +03:00 |
|
Peter Boyle
|
36ab567d67
|
FTHMC 3 Gev
|
2023-10-13 18:21:57 +03:00 |
|
Peter Boyle
|
e19171523b
|
FTHMC Status at lattice conference commit
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
9626a2c7c0
|
Asynch handling
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
e936f5b80b
|
IfGridTensor shorthand
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
ffc0639cb9
|
Running in HMC tests
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
c5b43b322c
|
traceProduct eliminates non-contributing intermediate terms
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
c9c4576237
|
Improved frontier cshift
|
2023-10-13 18:21:56 +03:00 |
|
david clarke
|
bf4369f72d
|
clean up HISQSmear with decltypes
|
2023-10-12 12:41:06 -06:00 |
|
david clarke
|
36600899e2
|
working 7-link; Grid_log; generalShift
|
2023-10-12 11:11:39 -06:00 |
|
david clarke
|
b9c70d156b
|
Merge branch 'develop' into hisq_fat_links
|
2023-10-10 22:44:17 -06:00 |
|
david clarke
|
eb89579fe7
|
Merge remote-tracking branch 'origin/develop' into develop
|
2023-10-10 22:43:51 -06:00 |
|
david clarke
|
0cfd13d18b
|
7-link working
|
2023-10-10 22:41:52 -06:00 |
|
Christoph Lehner
|
e6ed516052
|
merged
|
2023-10-08 09:00:37 +02:00 |
|
Christoph Lehner
|
e2a3dae1f2
|
Option for multiple simultaneous CartesianStencils
|
2023-10-08 08:58:44 +02:00 |
|
Peter Boyle
|
6d0c2de399
|
Deprecate teh PVC directory and make a PVC-OEM generic PVC target with
no queueing system dependency -- just interactive scripts
|
2023-10-03 17:04:20 +00:00 |
|
Peter Boyle
|
7786ea9921
|
Bug fix in script
|
2023-10-03 09:58:44 -07:00 |
|
Peter Boyle
|
d93eac7b1c
|
Performance regressed and is OK in icpx 2023.2
|
2023-10-03 15:53:14 +00:00 |
|
Peter Boyle
|
afc316f501
|
Rename headers
|
2023-10-02 16:25:11 -04:00 |
|
Peter Boyle
|
f14bfd5c1b
|
Relocate sub includes
|
2023-10-02 16:23:38 -04:00 |
|