Peter Boyle
|
d83beaa890
|
Update perlmutter
|
2022-05-24 18:25:00 -07:00 |
|
Peter Boyle
|
f9f05e995b
|
Update perlmutter
|
2022-05-24 18:24:38 -07:00 |
|
Peter Boyle
|
e651b9e7ab
|
Clean up stencil with better intranode Dirichlet / DDHMC support.
14TF/s on a Perlmutter node
|
2022-05-24 18:23:39 -07:00 |
|
Peter Boyle
|
47b4e91473
|
Verbose change
|
2022-05-24 18:19:18 -07:00 |
|
Peter Boyle
|
3f31afa4fc
|
Clean up verbose
|
2022-05-24 18:18:51 -07:00 |
|
Peter Boyle
|
f82ce67624
|
Dirichlet improved
|
2022-05-19 19:17:11 -07:00 |
|
Peter Boyle
|
b52e8ef65a
|
Dirichlet changes
|
2022-05-19 16:45:41 -07:00 |
|
Peter Boyle
|
2594e3c230
|
Dirichlet option
|
2022-05-19 16:45:19 -07:00 |
|
Peter Boyle
|
8cedb45af2
|
Dirichlet BCs
|
2022-05-19 16:45:02 -07:00 |
|
Peter Boyle
|
aa008cbe99
|
Updated for new Dirichlet interface
|
2022-05-19 16:44:39 -07:00 |
|
Peter Boyle
|
6fb6ca5b6b
|
Merge branch 'develop' into feature/dirichlet
|
2022-05-17 09:09:00 -07:00 |
|
Peter Boyle
|
b8ee19691c
|
Updated config for PM
|
2022-05-17 09:08:12 -07:00 |
|
Peter Boyle
|
ef820a26cd
|
Bcopy on crusher compile
|
2022-04-05 16:49:02 -04:00 |
|
Peter Boyle
|
5012adfebf
|
Merge branch 'develop' into feature/dirichlet
|
2022-04-05 16:26:19 -04:00 |
|
Peter Boyle
|
605cf401e1
|
Merge branch 'feature/sumd-npr' into develop
|
2022-03-16 22:43:12 +00:00 |
|
Peter Boyle
|
f99c3660d2
|
Merge branch 'feature/cpu-threaded-smp' into develop
|
2022-03-16 22:07:54 +00:00 |
|
Peter Boyle
|
92a83a9eb3
|
Performance improve for Tesseract
|
2022-03-16 17:14:36 +00:00 |
|
Peter Boyle
|
b615fa0f35
|
Merge pull request #388 from fjosw/feature/sumd-npr
Feature/sumd npr
|
2022-03-15 09:05:57 -04:00 |
|
Peter Boyle
|
bb5c16b97f
|
New scripts
|
2022-03-03 17:00:37 -05:00 |
|
Peter Boyle
|
0d80eeb545
|
small DDHMC update
|
2022-03-03 16:56:02 -05:00 |
|
|
d1decee4cc
|
Cleaned up unused variables in Lattice_reduction_gpu.h
|
2022-03-02 16:54:23 +00:00 |
|
|
d4ae71b880
|
sum_gpu_large and sum_gpu templates added.
|
2022-03-02 15:40:18 +00:00 |
|
Peter Boyle
|
b0f4eee78b
|
New files
|
2022-03-01 19:09:13 -05:00 |
|
Peter Boyle
|
5340e50427
|
HMC running with new formulation
|
2022-03-01 17:10:25 -05:00 |
|
Peter Boyle
|
e16fc5b2e4
|
Threaded intranode comms transfer - ideally between NUMA domains
|
2022-03-01 11:17:24 -05:00 |
|
Peter Boyle
|
694306f202
|
Configure for mac arm
|
2022-03-01 10:53:44 -05:00 |
|
Peter Boyle
|
9aac1e6d64
|
Merge branch 'develop' into feature/sumd-npr
|
2022-03-01 10:51:38 -05:00 |
|
Peter Boyle
|
3e882f555d
|
Large / small sumD options
|
2022-03-01 08:54:45 -05:00 |
|
Peter Boyle
|
0f1c5b08a1
|
Dirichlet filters running on AMD and now integrated in Fermion op
|
2022-02-23 19:29:28 -05:00 |
|
Peter Boyle
|
70988e43d2
|
Passes multinode dirichlet test with boundaries at
node boundary or at the single rank boundary
|
2022-02-23 01:42:14 -05:00 |
|
Peter Boyle
|
aab3bcb46f
|
Dirichlet first cut - wrong answers on dagger multiply.
Struggling to get a compute node so changing systems
|
2022-02-22 19:58:33 +00:00 |
|
Peter Boyle
|
da06d15f73
|
Merge branch 'feature/feature/staggered-comms' into develop
|
2022-02-17 04:58:50 +00:00 |
|
Peter Boyle
|
e8b1251b8c
|
Staggered fix finished
|
2022-02-17 04:51:13 +00:00 |
|
Peter Boyle
|
63dbaeefaa
|
Extra barrier prior to finalize just in case it fixes an issue on Tursa
|
2022-02-16 14:01:43 +00:00 |
|
Peter Boyle
|
e8c187b323
|
SyCL happier?
|
2022-02-15 11:24:38 -05:00 |
|
Peter Boyle
|
fad5a74a4b
|
Bug fix to detection case
|
2022-02-15 10:27:39 -05:00 |
|
Peter Boyle
|
e83f6a6ae9
|
Merge branch 'develop' into feature/feature/staggered-comms
|
2022-02-15 08:52:39 -05:00 |
|
Peter Boyle
|
0c1618197f
|
Faster intranode MPI works now
|
2022-02-15 08:52:07 -05:00 |
|
Peter Boyle
|
f49d5c2d22
|
Updated scripts for crusher
|
2022-02-14 17:55:16 -05:00 |
|
Peter Boyle
|
a3b022d469
|
Crusher compile
|
2022-02-14 15:09:08 -05:00 |
|
Peter Boyle
|
48772f0976
|
Merge pull request #384 from jdmaia/hip_launchbounds
Changing thread block order and adding launch_bounds
|
2022-02-14 11:08:28 -05:00 |
|
Peter Boyle
|
c322420580
|
Dont instantiate an Nc=3 and non-GP hardwired code for other implementations
|
2022-02-14 16:04:08 +00:00 |
|
Azusa Yamaguchi
|
6283d11d50
|
Add the comment line to tell the existance of copied data/buffer
|
2022-02-08 15:22:06 +00:00 |
|
Julio Maia
|
86f4e17928
|
Changing thread block order and adding launch_bounds
|
2022-02-07 11:29:37 -06:00 |
|
Peter Boyle
|
6616d5d090
|
Commit
|
2022-02-02 16:38:24 -05:00 |
|
Peter Boyle
|
215df671be
|
Merge pull request #382 from DanielRichtmann/feature/compact-clover
Compact Clover Fermions
|
2022-02-01 21:45:38 -05:00 |
|
Daniel Richtmann
|
1b6b12589f
|
Get splitting up into implementation and instantiation files correct
|
2022-02-02 00:51:11 +01:00 |
|
Daniel Richtmann
|
3082ab8252
|
Check in compact version of wilson clover fermions
|
2022-02-02 00:50:05 +01:00 |
|
Daniel Richtmann
|
add86cd7f4
|
Abandon ET for clover application, use construct similar to multLink
|
2022-02-01 23:09:06 +01:00 |
|
Daniel Richtmann
|
0b6fd20c54
|
Enable memory coalescing in clover term generation
|
2022-02-01 23:09:06 +01:00 |
|