Christoph Lehner
|
e2fc3a0f04
|
Merge pull request #28 from paboyle/develop
Sync with Upstream
|
2022-03-08 09:58:51 +01:00 |
|
|
451e7972fd
|
Reintroduced explicit inversion of the Clover term in case of the
CompactExpClover because of the open boundary O(a) improvement. Changed
the timing output to GridLogDebug
|
2022-03-07 17:43:33 +00:00 |
|
|
56c089d347
|
Removed leftover comments
|
2022-03-07 16:40:20 +00:00 |
|
|
acf740e44d
|
Merge pull request #1 from FelixPGZiegler/feature/eclover
Feature/eclover
|
2022-03-07 16:25:11 +00:00 |
|
|
182f513404
|
Merge remote-tracking branch 'fjosw/feature/eclover' into feature/eclover
|
2022-03-07 15:22:04 +00:00 |
|
|
d5b2323a57
|
included Cayley-Hamilton exponentiation for the compact Wilson exp clover, bug fix for inverse of exp clover
|
2022-03-07 14:44:24 +00:00 |
|
FelixPGZiegler
|
bad18d4417
|
Merge branch 'paboyle:develop' into feature/eclover
|
2022-03-07 13:54:10 +00:00 |
|
|
d1decee4cc
|
Cleaned up unused variables in Lattice_reduction_gpu.h
|
2022-03-02 16:54:23 +00:00 |
|
|
d4ae71b880
|
sum_gpu_large and sum_gpu templates added.
|
2022-03-02 15:40:18 +00:00 |
|
Peter Boyle
|
e16fc5b2e4
|
Threaded intranode comms transfer - ideally between NUMA domains
|
2022-03-01 11:17:24 -05:00 |
|
Peter Boyle
|
694306f202
|
Configure for mac arm
|
2022-03-01 10:53:44 -05:00 |
|
Peter Boyle
|
9aac1e6d64
|
Merge branch 'develop' into feature/sumd-npr
|
2022-03-01 10:51:38 -05:00 |
|
Peter Boyle
|
3e882f555d
|
Large / small sumD options
|
2022-03-01 08:54:45 -05:00 |
|
|
438caab25f
|
generate_instantiations.sh now correctly produces instantiations for CompactClover variant, redundant instantiations removed.
|
2022-02-27 18:27:18 +00:00 |
|
|
239e2c1ee6
|
tests: wilson clover cg tests now include compact variant as well as
exponential wilson clover operators
|
2022-02-27 18:26:34 +00:00 |
|
|
013dc2ef33
|
tests: core tests for wilson clover and wilson exp clover including
compact version extended/added
|
2022-02-27 18:13:47 +00:00 |
|
Christoph Lehner
|
9616811c3d
|
Merge branch 'feature/gpt' of https://github.com/lehner/Grid into feature/gpt
|
2022-02-24 22:03:05 +01:00 |
|
Christoph Lehner
|
8a3002c03b
|
separate left and right masses for CayleyFermion5D
|
2022-02-24 22:02:56 +01:00 |
|
Mattia Bruno
|
71034f828e
|
attempt to fix broken WilsonExpClover; Compact version still broken will be replaced by F.Joswig
|
2022-02-23 01:02:27 +01:00 |
|
Mattia Bruno
|
11437930c5
|
cleaned up definitions of wilsonclover fermions
|
2022-02-22 10:45:16 +01:00 |
|
Mattia Bruno
|
3d44aa9cb9
|
cleaned up cloverhelpers; fixed test compact_clover which runs
|
2022-02-22 01:10:19 +01:00 |
|
Mattia Bruno
|
2851870d70
|
expClover support via helpers template class
|
2022-02-22 00:05:43 +01:00 |
|
Peter Boyle
|
63dbaeefaa
|
Extra barrier prior to finalize just in case it fixes an issue on Tursa
|
2022-02-16 14:01:43 +00:00 |
|
Peter Boyle
|
e8c187b323
|
SyCL happier?
|
2022-02-15 11:24:38 -05:00 |
|
Peter Boyle
|
0c1618197f
|
Faster intranode MPI works now
|
2022-02-15 08:52:07 -05:00 |
|
Peter Boyle
|
f49d5c2d22
|
Updated scripts for crusher
|
2022-02-14 17:55:16 -05:00 |
|
Peter Boyle
|
a3b022d469
|
Crusher compile
|
2022-02-14 15:09:08 -05:00 |
|
Peter Boyle
|
48772f0976
|
Merge pull request #384 from jdmaia/hip_launchbounds
Changing thread block order and adding launch_bounds
|
2022-02-14 11:08:28 -05:00 |
|
Peter Boyle
|
c322420580
|
Dont instantiate an Nc=3 and non-GP hardwired code for other implementations
|
2022-02-14 16:04:08 +00:00 |
|
Julio Maia
|
86f4e17928
|
Changing thread block order and adding launch_bounds
|
2022-02-07 11:29:37 -06:00 |
|
Peter Boyle
|
215df671be
|
Merge pull request #382 from DanielRichtmann/feature/compact-clover
Compact Clover Fermions
|
2022-02-01 21:45:38 -05:00 |
|
Daniel Richtmann
|
1b6b12589f
|
Get splitting up into implementation and instantiation files correct
|
2022-02-02 00:51:11 +01:00 |
|
Daniel Richtmann
|
3082ab8252
|
Check in compact version of wilson clover fermions
|
2022-02-02 00:50:05 +01:00 |
|
Daniel Richtmann
|
add86cd7f4
|
Abandon ET for clover application, use construct similar to multLink
|
2022-02-01 23:09:06 +01:00 |
|
Daniel Richtmann
|
0b6fd20c54
|
Enable memory coalescing in clover term generation
|
2022-02-01 23:09:06 +01:00 |
|
Daniel Richtmann
|
e83423fee6
|
Refactor clover to align with other files and prepare for upcoming changes
|
2022-02-01 23:09:06 +01:00 |
|
Daniel Richtmann
|
b4f8e87982
|
Have Grid's cli interface understand floats
|
2022-02-01 23:09:06 +01:00 |
|
Peter Boyle
|
135808dcfa
|
Less verbose
|
2021-12-07 16:24:24 -05:00 |
|
Peter Boyle
|
7f7d06d963
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2021-12-07 09:06:42 -08:00 |
|
Peter Boyle
|
2bf3b4d576
|
Update to reduce memory footpring in benchmark test
|
2021-12-07 09:02:02 -08:00 |
|
RJHudspith
|
0bd83cdbda
|
Fixes for Nc!=3 Nersc IO, Gauge and Gauge_NCxNC compatible with GLU. Trace normalisation changed in places removing explicit threes. Guards against non-su3 tests and tests failing when LIME is not compiled.
|
2021-11-28 21:51:03 +01:00 |
|
Peter Boyle
|
f34d34bd17
|
2 nodes
|
2021-11-22 22:27:16 -05:00 |
|
Peter Boyle
|
e32d5141b4
|
Updated to make MPI reliable still gives good perf, but MPI will be slow
intranode
|
2021-11-22 21:46:31 -05:00 |
|
Peter Boyle
|
6d5277f2d7
|
Update to Spock
|
2021-11-22 20:58:02 -05:00 |
|
Peter Boyle
|
14d82777e0
|
Best modules for spock
|
2021-11-22 20:47:16 -05:00 |
|
Peter Boyle
|
2a4e739513
|
Enable XGMI copy (need to rename nvlink to cover NVLINK/XGMI/XeLink)
|
2021-11-22 20:46:09 -05:00 |
|
Peter Boyle
|
8079dc2a14
|
Cray MPI not working right yet
|
2021-11-22 20:45:44 -05:00 |
|
Peter Boyle
|
6ceb556684
|
Intranode asynch hipMemCopy
|
2021-11-22 20:45:12 -05:00 |
|
Peter Boyle
|
76cde73705
|
HIP improvements on messaging and intranode hipMemCopyAsynch
|
2021-11-22 20:44:39 -05:00 |
|
Peter Boyle
|
cc094366a9
|
Merge pull request #375 from JPRichings/develop
Lattice object ACCcache probe
|
2021-11-09 18:19:32 -05:00 |
|