ferben
|
feb029fb66
|
new utils for baryons
|
2019-07-18 14:24:16 +01:00 |
|
Peter Boyle
|
08904f830e
|
Merge develop
|
2019-07-16 11:59:56 +01:00 |
|
Peter Boyle
|
fa9cd50c5b
|
Merge branch 'develop' into feature/gpu-port
|
2019-07-16 11:55:17 +01:00 |
|
Peter Boyle
|
6179acfda0
|
Put back a call that was required
|
2019-07-14 13:59:54 +01:00 |
|
Peter Boyle
|
07601ac1f5
|
Replace instantiation of Gparity
|
2019-07-12 17:18:12 +01:00 |
|
Peter Boyle
|
705a8098b2
|
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
Conflicts:
Grid/stencil/Stencil.h
|
2019-07-12 17:14:11 +01:00 |
|
Peter Boyle
|
78ebd93281
|
Cuda 9.1 happy
|
2019-07-12 17:11:00 +01:00 |
|
Peter Boyle
|
bd155ca5c0
|
Overlap comms with comput now supported
|
2019-07-12 09:09:40 +01:00 |
|
Felix Erben
|
b7d0cf6751
|
buxfix in diquark sum / baryons
|
2019-07-04 22:06:37 +01:00 |
|
Peter Boyle
|
532e226b22
|
cuda 9.1 fixes
|
2019-07-01 07:29:22 +01:00 |
|
fionnoh
|
67690df3bd
|
Changes nedded to have a current insertion on every second time slice - avoids unnecessary contractions
|
2019-06-28 15:18:28 +08:00 |
|
fionnoh
|
421a0a8a36
|
Changes to A2Autils, A2AMatirx and DiskVector code that is needed for Hadrons 4 quark contraction module
|
2019-06-27 13:45:20 +08:00 |
|
Peter Boyle
|
d7b3efe893
|
Compile fix
|
2019-06-15 17:03:15 +01:00 |
|
Peter Boyle
|
0184719216
|
Change to predicate type
|
2019-06-15 12:52:26 +01:00 |
|
Peter Boyle
|
24202dbc51
|
Thread loop construct change
|
2019-06-15 12:52:07 +01:00 |
|
Peter Boyle
|
d763c303c5
|
Clean acceleerator barrier
|
2019-06-15 12:51:45 +01:00 |
|
Peter Boyle
|
8e394d3bf9
|
New loop construct
|
2019-06-15 12:51:15 +01:00 |
|
Peter Boyle
|
b881d5489b
|
Move SchurDiagTwoKappa to Algorithms
|
2019-06-15 12:50:45 +01:00 |
|
Peter Boyle
|
49f90cc7eb
|
use pragma once
|
2019-06-15 12:45:22 +01:00 |
|
Peter Boyle
|
b77af0210b
|
Thread loop. Probably deprecate this impl
|
2019-06-15 12:44:56 +01:00 |
|
Peter Boyle
|
5254ede2d8
|
New loops. Revisit as accelerator loop in future audit
|
2019-06-15 12:44:29 +01:00 |
|
Peter Boyle
|
16e5d7945e
|
Hard to make 5D vec work with GPU code
|
2019-06-15 12:43:43 +01:00 |
|
Peter Boyle
|
decc99ca76
|
Accelerator version
|
2019-06-15 12:43:00 +01:00 |
|
Peter Boyle
|
464cd65931
|
Still to test this fully
|
2019-06-15 12:35:14 +01:00 |
|
Peter Boyle
|
a1ec2f4723
|
Still to test this routine fully
|
2019-06-15 12:33:55 +01:00 |
|
Peter Boyle
|
ea9662ec85
|
Thread loop changes
|
2019-06-15 09:09:57 +01:00 |
|
Peter Boyle
|
52c74f1cac
|
Thread loop changes
|
2019-06-15 09:08:16 +01:00 |
|
Peter Boyle
|
9a13d2992c
|
lean up
|
2019-06-15 09:05:16 +01:00 |
|
Peter Boyle
|
b0449ae270
|
Thread loop changes
|
2019-06-15 09:04:19 +01:00 |
|
Peter Boyle
|
1299225105
|
Accelerator loop changes
|
2019-06-15 09:03:46 +01:00 |
|
Peter Boyle
|
5925e7f405
|
Thread for changes
|
2019-06-15 09:01:30 +01:00 |
|
Peter Boyle
|
be1fd4930f
|
Template instantiation make happy changes
|
2019-06-15 08:37:34 +01:00 |
|
Peter Boyle
|
377fa5dec1
|
looping construct
|
2019-06-15 08:36:48 +01:00 |
|
Peter Boyle
|
e8b78f596e
|
Looping construct changes
|
2019-06-15 08:35:57 +01:00 |
|
Peter Boyle
|
09720c40cd
|
Coalesced loops
|
2019-06-15 08:35:26 +01:00 |
|
Peter Boyle
|
bb024dd114
|
Loop construct changed
|
2019-06-15 08:30:05 +01:00 |
|
Peter Boyle
|
52456b9ec7
|
New loop construct
|
2019-06-15 08:28:45 +01:00 |
|
Michael Marshall
|
9a8a63467e
|
BC2 now runs. setup() runs twice, which had resulted in doubling up of momenta. Also fixed initialisation of momentum phases.
|
2019-06-12 15:25:59 +01:00 |
|
Peter Boyle
|
36f06555a2
|
Simplify Impl
|
2019-06-09 22:26:27 +01:00 |
|
Peter Boyle
|
d6c0e0756d
|
Remove GPU version
|
2019-06-09 11:23:42 +01:00 |
|
Peter Boyle
|
3e41b1055c
|
Remove Gpu only kernels.
|
2019-06-09 11:20:01 +01:00 |
|
Peter Boyle
|
e78a5e7838
|
ASM instantiation without link errors
|
2019-06-09 01:25:21 +01:00 |
|
Peter Boyle
|
8e3a05d89b
|
Moving the instantiation into a cleaner structure
|
2019-06-08 13:48:33 +01:00 |
|
Peter Boyle
|
c933ac2248
|
Temporarily introduce a SIMT_loop to test out approaches prior to making a global change to
accelerator_loop
|
2019-06-08 13:44:27 +01:00 |
|
Peter Boyle
|
ad2c433574
|
Instantiations move. Tried using Gianluca's suggestion about avoiding threadIdx but doesn't
seem to make a difference. Will revisit this and probably remove the lane parameter from the coalescedRead
|
2019-06-08 13:43:12 +01:00 |
|
Peter Boyle
|
86e7fb6e86
|
Instantiation relocation
|
2019-06-08 13:42:46 +01:00 |
|
Peter Boyle
|
fb91dda7be
|
Hand instantiation moved location
|
2019-06-08 13:42:26 +01:00 |
|
Peter Boyle
|
82cf7bc5ab
|
Move instantiation into fermion/instantiation
|
2019-06-08 13:41:46 +01:00 |
|
Peter Boyle
|
e452cc0a22
|
Move static variables into instantiation .cc file
|
2019-06-08 13:41:20 +01:00 |
|
Peter Boyle
|
4d2b938166
|
Remove explict instantiation from here
|
2019-06-08 13:41:01 +01:00 |
|