Peter Boyle
|
0e6fa6f6b8
|
DOn't need the Cshift for the period optimisation
|
2023-10-24 10:56:31 -04:00 |
|
Peter Boyle
|
38b87de53f
|
This works around a stacksize limit on AMD GPU
|
2023-10-24 10:56:07 -04:00 |
|
Peter Boyle
|
aa5047a9e4
|
Faster blockProject blockPromote
|
2023-10-24 10:49:55 -04:00 |
|
Peter Boyle
|
24b6ee0df9
|
M4 file
|
2023-10-24 10:36:48 -04:00 |
|
Peter Boyle
|
1e79cc9cbe
|
Avoid compiler error
|
2023-10-24 10:36:09 -04:00 |
|
Peter Boyle
|
b3925df9c3
|
Verbose on CPU-GPU xfer, remove performance by default
|
2023-10-24 10:25:01 -04:00 |
|
Christoph Lehner
|
f2648e94b9
|
getHostPointer added to Lattice
|
2023-10-23 13:47:41 +02:00 |
|
Peter Boyle
|
351795ac3a
|
Better messaging
|
2023-10-20 19:33:04 -04:00 |
|
Peter Boyle
|
9c9c42d0df
|
Tests on frontier with real speed up . 3.5x on 16^3 at mq=0.01
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
b6ad1bafc7
|
Normal memory SendToRecvFrom asynchronous for use in general stencil
code
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
a5ca40f446
|
Better verbose -- track CPU GPU motion under --log Memory, others go to
debug output stream
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
9ab54c5565
|
Overlap comms & data copy/buffer assembly in Ghost zone exchange
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
4341d96bde
|
Massively sped up coarse grid mult, comms
Save 3ms spend (60% of time !) on cudaMalloc !!
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
5fac47a26d
|
Faster halo exchange
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
e064f17346
|
Faster halo exchange
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
afe10ba2a2
|
More digits
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
7cc3435ba8
|
Imporved General coarsened matrix
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
541772313c
|
Verbosity
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
3747494a09
|
Notify delet public
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
f2b98d0dcc
|
Const safety
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
80471bf762
|
Alternate implementation involving face operations
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
a06f63c110
|
Improved I/O and non-lexico option exposed to SciDAC format
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
0ae4478cd9
|
Checkpoint the subspace and ldop
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
ae4e705e09
|
Use random vec as easier for debug
|
2023-10-20 19:27:13 -04:00 |
|
Peter Boyle
|
f5dcea9dbf
|
Updates for Frontier
|
2023-10-20 19:27:12 -04:00 |
|
david clarke
|
21ed6ac0f4
|
added floating-point support
|
2023-10-20 13:54:26 -06:00 |
|
david clarke
|
7bb8ab7000
|
improve smearing templating
|
2023-10-20 08:41:02 -06:00 |
|
david clarke
|
2c824c2641
|
Merge branch 'develop' into hisq_fat_links
|
2023-10-17 16:03:59 -06:00 |
|
david clarke
|
391fd9cc6a
|
try lepage term
|
2023-10-17 14:57:15 -06:00 |
|
Peter Boyle
|
2207309f8a
|
Spack rules
|
2023-10-16 18:38:24 -04:00 |
|
Peter Boyle
|
51051df62c
|
3GeV run setup
|
2023-10-16 20:49:52 +03:00 |
|
Peter Boyle
|
33097681b9
|
FTHMC compiled and merged to develop
|
2023-10-14 00:42:55 +03:00 |
|
Peter Boyle
|
07e4900218
|
FTHMC commit
|
2023-10-13 18:21:57 +03:00 |
|
Peter Boyle
|
36ab567d67
|
FTHMC 3 Gev
|
2023-10-13 18:21:57 +03:00 |
|
Peter Boyle
|
e19171523b
|
FTHMC Status at lattice conference commit
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
9626a2c7c0
|
Asynch handling
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
e936f5b80b
|
IfGridTensor shorthand
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
ffc0639cb9
|
Running in HMC tests
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
c5b43b322c
|
traceProduct eliminates non-contributing intermediate terms
|
2023-10-13 18:21:56 +03:00 |
|
Peter Boyle
|
c9c4576237
|
Improved frontier cshift
|
2023-10-13 18:21:56 +03:00 |
|
david clarke
|
bf4369f72d
|
clean up HISQSmear with decltypes
|
2023-10-12 12:41:06 -06:00 |
|
david clarke
|
36600899e2
|
working 7-link; Grid_log; generalShift
|
2023-10-12 11:11:39 -06:00 |
|
david clarke
|
b9c70d156b
|
Merge branch 'develop' into hisq_fat_links
|
2023-10-10 22:44:17 -06:00 |
|
david clarke
|
eb89579fe7
|
Merge remote-tracking branch 'origin/develop' into develop
|
2023-10-10 22:43:51 -06:00 |
|
david clarke
|
0cfd13d18b
|
7-link working
|
2023-10-10 22:41:52 -06:00 |
|
Christoph Lehner
|
e6ed516052
|
merged
|
2023-10-08 09:00:37 +02:00 |
|
Christoph Lehner
|
e2a3dae1f2
|
Option for multiple simultaneous CartesianStencils
|
2023-10-08 08:58:44 +02:00 |
|
Peter Boyle
|
2111e7ab5f
|
Run at physical mass
|
2023-10-06 21:20:21 -04:00 |
|
Peter Boyle
|
d29abfdcaf
|
Transfer code to Frontier now
|
2023-10-06 21:03:34 -04:00 |
|
Peter Boyle
|
a751c42cc5
|
Checkpoint restore the setup
|
2023-10-06 21:03:08 -04:00 |
|