Peter Boyle
|
f6077f9d48
|
Kernels -> not instantiaed otherwise object ref on GPU
|
2018-03-24 19:33:44 -04:00 |
|
Peter Boyle
|
23c880b009
|
Remove lebesgue order; stick in stencil if need
|
2018-03-22 18:13:41 -04:00 |
|
paboyle
|
3277bda130
|
View introduction to prepare for accelerator offload.
Probably same problem exists for stencil object
|
2018-03-04 16:38:08 +00:00 |
|
paboyle
|
078901278c
|
Coordinate handling gpu friendly
|
2018-02-24 22:22:02 +00:00 |
|
paboyle
|
a308dff410
|
accelerator loop, copy Opt into the GPU
|
2018-02-02 11:34:37 +00:00 |
|
paboyle
|
70e276e1ab
|
parallel_for elimination -> thread_loop
|
2018-01-28 01:01:14 +00:00 |
|
paboyle
|
2d0bcc2606
|
Zero changes, acceleartor on kernels and some thread loop changes
|
2018-01-27 23:47:38 +00:00 |
|
paboyle
|
c4f82e072b
|
_grid becomes private ; use Grid()§
|
2018-01-27 00:04:12 +00:00 |
|
paboyle
|
85771e97e9
|
Hide internal data
|
2018-01-26 23:04:46 +00:00 |
|
paboyle
|
87ee592176
|
Pragma changes and layout and warning elimination for nvcc
|
2018-01-24 13:14:09 +00:00 |
|
paboyle
|
dec39b313d
|
Namespace and format
|
2018-01-14 23:04:37 +00:00 |
|
Vera Guelpers
|
2cfb50cbe5
|
bug fix in sequential insertion of conserved vector current
|
2017-12-08 11:13:39 +00:00 |
|
|
a56e3b40c4
|
Merge branch 'develop' into feature/hadrons
|
2017-08-29 11:03:53 -06:00 |
|
paboyle
|
f68b5de9c8
|
No compile fix on Clang
|
2017-08-25 19:35:21 +01:00 |
|
Peter Boyle
|
c3b1263e75
|
Benchmark prep
|
2017-08-25 09:25:54 +01:00 |
|
paboyle
|
a446d95c33
|
Trying to pass TeamCity and Travis
|
2017-08-20 01:10:50 +01:00 |
|
Lanny91
|
67b34e5789
|
Modified conserved current 5th dimension loop for compatibility with 5D vectorisation.
|
2017-07-31 11:35:01 +01:00 |
|
Peter Boyle
|
14d53e1c9e
|
Threaded MPI calls patches
|
2017-07-29 13:08:10 -04:00 |
|
paboyle
|
54e94360ad
|
Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit
|
2017-06-24 23:10:24 +01:00 |
|
Lanny91
|
1bd311ba9c
|
Faster sequential conserved current implementation, now compatible with 5D vectorisation & G-parity.
|
2017-06-16 16:43:15 +01:00 |
|
Lanny91
|
41af8c12d7
|
Code cleaning for conserved current contractions. Will now be easier to implement mobius conserved current.
|
2017-06-16 16:38:59 +01:00 |
|
Lanny91
|
5633a2db20
|
Faster implementation of conserved current site contraction. Added 5D vectorised support, but not G-parity.
|
2017-06-12 10:41:02 +01:00 |
|
Lanny91
|
eec79e0a1e
|
Ward Identity test improvements and conserved current bug fixes
|
2017-06-05 11:55:41 +01:00 |
|
Lanny91
|
23135aa58a
|
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/rare_kaon
|
2017-05-26 16:00:50 +01:00 |
|
Lanny91
|
ca1077c560
|
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/rare_kaon
# Conflicts:
# lib/qcd/action/fermion/WilsonFermion5D.cc
# tests/hadrons/Test_hadrons_rarekaon.cc
|
2017-05-04 16:22:33 +01:00 |
|
Guido Cossu
|
3344788fa1
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-05-01 12:13:56 +01:00 |
|
Lanny91
|
51d84ec057
|
Bugfixes in Wilson 5D sequential conserved current insertion
|
2017-04-28 16:49:14 +01:00 |
|
Peter Boyle
|
99220f6531
|
Fixes and better timing
|
2017-04-26 17:24:11 -04:00 |
|
Lanny91
|
d2003f24f4
|
Corrected incorrect usage of ExtractSlice for conserved current code.
|
2017-04-26 17:25:28 +01:00 |
|
Peter Boyle
|
fd1eb7de13
|
Clean implementation of the exterior faces listing only those points on the boudary
|
2017-04-26 02:34:52 -04:00 |
|
Lanny91
|
44260643f6
|
First conserved current implementation for Wilson fermions only. Not implemented for Gparity or 5D-vectorised Wilson fermions.
|
2017-04-25 18:00:24 +01:00 |
|
paboyle
|
ab66bac4e6
|
Think I'm getting on top of the reduced cost exterior precomputed list of links
|
2017-04-25 08:50:26 +01:00 |
|
paboyle
|
56277a11c8
|
Build a list of whats on the surface
|
2017-04-24 17:06:15 +01:00 |
|
Peter Boyle
|
e3d0e31525
|
Debugged assemply split phase with interior suppression
|
2017-04-23 19:29:27 -04:00 |
|
paboyle
|
b722889234
|
Try a better load balancing loop
|
2017-04-22 19:27:41 +01:00 |
|
paboyle
|
736bf3c866
|
Major rework of stencil. Half precision and MPI3 now working.
|
2017-04-22 11:33:50 +01:00 |
|
paboyle
|
fc4ab9ccd5
|
Working half precision comms
|
2017-04-20 11:20:26 +01:00 |
|
paboyle
|
4a340aa5ca
|
Massive compressor rework to support reduced precision comms
|
2017-04-20 09:28:27 +01:00 |
|
Guido Cossu
|
8c540333d5
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-04-05 14:41:04 +01:00 |
|
paboyle
|
4b17e8eba8
|
Merge branch 'develop' into feature/bgq-asm
Conflicts:
lib/qcd/action/fermion/Fermion.h
lib/qcd/action/fermion/WilsonFermion.cc
lib/util/Init.cc
tests/Test_cayley_even_odd_vec.cc
|
2017-03-28 04:49:30 -04:00 |
|
paboyle
|
18bde08d1b
|
Merge branch 'feature/staggering' into develop
|
2017-03-28 15:25:55 +09:00 |
|
paboyle
|
af230a1fb8
|
Average the time across the whole machine for outliers
|
2017-02-28 17:05:22 -05:00 |
|
paboyle
|
e099dcdae7
|
Merge branch 'develop' into feature/bgq-asm
|
2017-02-23 00:25:29 +00:00 |
|
paboyle
|
4e7ab3166f
|
Refactoring header layout
|
2017-02-22 18:09:33 +00:00 |
|
paboyle
|
3ae92fa2e6
|
Global changes to parallel_for structure.
Move the comms flags to more sensible names
|
2017-02-21 05:24:27 -05:00 |
|
Guido Cossu
|
e0571c872b
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-02-09 16:12:00 +00:00 |
|
paboyle
|
2c246551d0
|
Overlap comms and compute options in wilson kernels
|
2017-02-07 01:37:10 -05:00 |
|
|
a0cfbb6e88
|
Merge branch 'feature/gammas' into feature/hadrons
# Conflicts:
# .gitignore
# lib/qcd/spin/Dirac.cc
# scripts/filelist
|
2017-01-30 09:10:49 -08:00 |
|
|
fad743fbb1
|
Build system sanity check: corrected several headers not in the <Grid/*> format
|
2017-01-26 17:00:41 -08:00 |
|
Guido Cossu
|
17629b8d9e
|
Merge branch 'develop' into feature/hmc_generalise
|
2017-01-25 11:33:53 +00:00 |
|