Peter Boyle
|
a3b022d469
|
Crusher compile
|
2022-02-14 15:09:08 -05:00 |
|
Peter Boyle
|
48772f0976
|
Merge pull request #384 from jdmaia/hip_launchbounds
Changing thread block order and adding launch_bounds
|
2022-02-14 11:08:28 -05:00 |
|
Peter Boyle
|
c322420580
|
Dont instantiate an Nc=3 and non-GP hardwired code for other implementations
|
2022-02-14 16:04:08 +00:00 |
|
Julio Maia
|
86f4e17928
|
Changing thread block order and adding launch_bounds
|
2022-02-07 11:29:37 -06:00 |
|
Peter Boyle
|
215df671be
|
Merge pull request #382 from DanielRichtmann/feature/compact-clover
Compact Clover Fermions
|
2022-02-01 21:45:38 -05:00 |
|
Daniel Richtmann
|
1b6b12589f
|
Get splitting up into implementation and instantiation files correct
|
2022-02-02 00:51:11 +01:00 |
|
Daniel Richtmann
|
3082ab8252
|
Check in compact version of wilson clover fermions
|
2022-02-02 00:50:05 +01:00 |
|
Daniel Richtmann
|
add86cd7f4
|
Abandon ET for clover application, use construct similar to multLink
|
2022-02-01 23:09:06 +01:00 |
|
Daniel Richtmann
|
0b6fd20c54
|
Enable memory coalescing in clover term generation
|
2022-02-01 23:09:06 +01:00 |
|
Daniel Richtmann
|
e83423fee6
|
Refactor clover to align with other files and prepare for upcoming changes
|
2022-02-01 23:09:06 +01:00 |
|
Daniel Richtmann
|
b4f8e87982
|
Have Grid's cli interface understand floats
|
2022-02-01 23:09:06 +01:00 |
|
Peter Boyle
|
135808dcfa
|
Less verbose
|
2021-12-07 16:24:24 -05:00 |
|
Peter Boyle
|
7f7d06d963
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2021-12-07 09:06:42 -08:00 |
|
Peter Boyle
|
2bf3b4d576
|
Update to reduce memory footpring in benchmark test
|
2021-12-07 09:02:02 -08:00 |
|
Peter Boyle
|
f34d34bd17
|
2 nodes
|
2021-11-22 22:27:16 -05:00 |
|
Peter Boyle
|
e32d5141b4
|
Updated to make MPI reliable still gives good perf, but MPI will be slow
intranode
|
2021-11-22 21:46:31 -05:00 |
|
Peter Boyle
|
6d5277f2d7
|
Update to Spock
|
2021-11-22 20:58:02 -05:00 |
|
Peter Boyle
|
14d82777e0
|
Best modules for spock
|
2021-11-22 20:47:16 -05:00 |
|
Peter Boyle
|
2a4e739513
|
Enable XGMI copy (need to rename nvlink to cover NVLINK/XGMI/XeLink)
|
2021-11-22 20:46:09 -05:00 |
|
Peter Boyle
|
8079dc2a14
|
Cray MPI not working right yet
|
2021-11-22 20:45:44 -05:00 |
|
Peter Boyle
|
6ceb556684
|
Intranode asynch hipMemCopy
|
2021-11-22 20:45:12 -05:00 |
|
Peter Boyle
|
76cde73705
|
HIP improvements on messaging and intranode hipMemCopyAsynch
|
2021-11-22 20:44:39 -05:00 |
|
Peter Boyle
|
cc094366a9
|
Merge pull request #375 from JPRichings/develop
Lattice object ACCcache probe
|
2021-11-09 18:19:32 -05:00 |
|
|
41a575ff9b
|
Format edit
|
2021-11-09 21:56:23 +00:00 |
|
|
12ef413065
|
fix to deflation.h
|
2021-11-09 21:20:36 +00:00 |
|
|
829a328451
|
remove deflation timing
|
2021-11-09 20:46:57 +00:00 |
|
|
402523c62e
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2021-11-09 12:57:40 +00:00 |
|
|
d7bef70b5c
|
Helper functions to allow probe of cache state of lattice objects.
|
2021-11-09 12:57:09 +00:00 |
|
|
2ad1811642
|
Added timing to deflation code.
|
2021-11-09 12:33:25 +00:00 |
|
|
a65a497bae
|
Merge branch 'develop' of github.com:paboyle/Grid into develop
|
2021-10-29 13:01:34 +01:00 |
|
|
b27b12828e
|
reverse previous "fix", missing statement was probably intentional, added a comment to that effect
|
2021-10-29 13:01:31 +01:00 |
|
Peter Boyle
|
fe9edf8526
|
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
|
2021-10-29 02:03:27 +01:00 |
|
Peter Boyle
|
44204c7e06
|
Extra code
|
2021-10-29 02:02:56 +01:00 |
|
Peter Boyle
|
33b3789598
|
Merge pull request #364 from AndrewYongZhenNing/develop
CayleyFermion5D Conserved current fix
|
2021-10-27 20:27:20 -04:00 |
|
Peter Boyle
|
195ab2888d
|
Merge branch 'develop' into develop
|
2021-10-27 20:26:57 -04:00 |
|
Peter Boyle
|
85f750d753
|
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
|
2021-10-27 00:28:05 +01:00 |
|
Peter Boyle
|
a4ce6e42c7
|
Warning free compile on make all and make tests under nvcc
|
2021-10-27 00:27:03 +01:00 |
|
Peter Boyle
|
5398b7e7e3
|
Max 128 size
|
2021-10-26 09:16:29 -07:00 |
|
|
fd13a3f2be
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2021-10-26 10:45:46 +01:00 |
|
|
c144b32368
|
deflation timers
|
2021-10-26 10:37:24 +01:00 |
|
Peter Boyle
|
ba7e371b90
|
Warning free compile on Tursa.
Hopefully got all reqd virtual dtors
|
2021-10-21 19:56:52 +01:00 |
|
Peter Boyle
|
99e7a5d18a
|
Merge pull request #371 from edbennett/hmc-documentation-update
update documentation for GenericHMCRunner - thanks
|
2021-10-18 14:36:43 -04:00 |
|
Ed Bennett
|
f824d99059
|
update documentation for GenericHMCRunner
|
2021-10-18 09:50:16 +01:00 |
|
Peter Boyle
|
749b8022a4
|
Linear operator and SparseMatrix virtual destructors
|
2021-10-15 20:47:18 +01:00 |
|
Peter Boyle
|
7e0057d2c4
|
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
|
2021-10-15 20:46:51 +01:00 |
|
Peter Boyle
|
cfe9e870d3
|
Stream
|
2021-10-15 20:46:44 +01:00 |
|
Peter Boyle
|
e9c4f06cbf
|
Merge pull request #370 from fjosw/bugfix/gpu_sum_shm
Error Handling sum_Dgpu large objects
|
2021-10-14 09:12:47 -04:00 |
|
|
1f9688417a
|
Error message added when attempting to sum object which is too large for
the shared memory
|
2021-10-13 20:45:46 +01:00 |
|
Peter Boyle
|
16c2a99965
|
Overlap cudamemcpy - didn't set up stream right
|
2021-10-11 13:31:26 -07:00 |
|
Peter Boyle
|
cda915a345
|
Better options
|
2021-10-07 20:29:09 +01:00 |
|