|
fdd9b14e82
|
speed up MooeeInvDag for DWF EOFA
|
2019-09-02 14:49:51 +01:00 |
|
|
e66669d300
|
fast MooeeInv for EOFA
|
2019-09-02 14:26:13 +01:00 |
|
|
0efaf3c4fa
|
access M5D coeffs through pointers
|
2019-09-02 11:33:00 +01:00 |
|
|
3ef519aaa4
|
fast MooeeInv
|
2019-09-02 11:18:14 +01:00 |
|
|
e279b2be29
|
Merge develop
|
2019-08-14 23:01:59 +01:00 |
|
|
48e6efc7c9
|
Merge branch 'develop' into feature/gpu-port
Conflicts:
Grid/qcd/action/fermion/WilsonKernelsAsm.cc
Grid/qcd/action/fermion/implementation/ImprovedStaggeredFermionImplementation.h
Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h
benchmarks/Benchmark_comms.cc
|
2019-08-14 18:56:54 +01:00 |
|
|
53e3ab4131
|
Fix force term
|
2019-08-11 11:06:13 +01:00 |
|
|
1282e1067f
|
Do the force term on the accelerator too. Needed particularly because comms buffers
are device memory.
|
2019-07-29 22:58:35 +01:00 |
|
|
fe700a183a
|
Getting HMC to run
|
2019-07-26 12:18:29 +01:00 |
|
|
08904f830e
|
Merge develop
|
2019-07-16 11:59:56 +01:00 |
|
|
fa9cd50c5b
|
Merge branch 'develop' into feature/gpu-port
|
2019-07-16 11:55:17 +01:00 |
|
|
6179acfda0
|
Put back a call that was required
|
2019-07-14 13:59:54 +01:00 |
|
|
07601ac1f5
|
Replace instantiation of Gparity
|
2019-07-12 17:18:12 +01:00 |
|
|
705a8098b2
|
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
Conflicts:
Grid/stencil/Stencil.h
|
2019-07-12 17:14:11 +01:00 |
|
|
78ebd93281
|
Cuda 9.1 happy
|
2019-07-12 17:11:00 +01:00 |
|
|
bd155ca5c0
|
Overlap comms with comput now supported
|
2019-07-12 09:09:40 +01:00 |
|
|
532e226b22
|
cuda 9.1 fixes
|
2019-07-01 07:29:22 +01:00 |
|
|
d7b3efe893
|
Compile fix
|
2019-06-15 17:03:15 +01:00 |
|
|
0184719216
|
Change to predicate type
|
2019-06-15 12:52:26 +01:00 |
|
|
24202dbc51
|
Thread loop construct change
|
2019-06-15 12:52:07 +01:00 |
|
|
d763c303c5
|
Clean acceleerator barrier
|
2019-06-15 12:51:45 +01:00 |
|
|
8e394d3bf9
|
New loop construct
|
2019-06-15 12:51:15 +01:00 |
|
|
b881d5489b
|
Move SchurDiagTwoKappa to Algorithms
|
2019-06-15 12:50:45 +01:00 |
|
|
49f90cc7eb
|
use pragma once
|
2019-06-15 12:45:22 +01:00 |
|
|
b77af0210b
|
Thread loop. Probably deprecate this impl
|
2019-06-15 12:44:56 +01:00 |
|
|
5254ede2d8
|
New loops. Revisit as accelerator loop in future audit
|
2019-06-15 12:44:29 +01:00 |
|
|
16e5d7945e
|
Hard to make 5D vec work with GPU code
|
2019-06-15 12:43:43 +01:00 |
|
|
decc99ca76
|
Accelerator version
|
2019-06-15 12:43:00 +01:00 |
|
|
464cd65931
|
Still to test this fully
|
2019-06-15 12:35:14 +01:00 |
|
|
a1ec2f4723
|
Still to test this routine fully
|
2019-06-15 12:33:55 +01:00 |
|
|
ea9662ec85
|
Thread loop changes
|
2019-06-15 09:09:57 +01:00 |
|
|
52c74f1cac
|
Thread loop changes
|
2019-06-15 09:08:16 +01:00 |
|
|
9a13d2992c
|
lean up
|
2019-06-15 09:05:16 +01:00 |
|
|
b0449ae270
|
Thread loop changes
|
2019-06-15 09:04:19 +01:00 |
|
|
1299225105
|
Accelerator loop changes
|
2019-06-15 09:03:46 +01:00 |
|
|
5925e7f405
|
Thread for changes
|
2019-06-15 09:01:30 +01:00 |
|
|
be1fd4930f
|
Template instantiation make happy changes
|
2019-06-15 08:37:34 +01:00 |
|
|
36f06555a2
|
Simplify Impl
|
2019-06-09 22:26:27 +01:00 |
|
|
d6c0e0756d
|
Remove GPU version
|
2019-06-09 11:23:42 +01:00 |
|
|
3e41b1055c
|
Remove Gpu only kernels.
|
2019-06-09 11:20:01 +01:00 |
|
|
e78a5e7838
|
ASM instantiation without link errors
|
2019-06-09 01:25:21 +01:00 |
|
|
8e3a05d89b
|
Moving the instantiation into a cleaner structure
|
2019-06-08 13:48:33 +01:00 |
|
|
c933ac2248
|
Temporarily introduce a SIMT_loop to test out approaches prior to making a global change to
accelerator_loop
|
2019-06-08 13:44:27 +01:00 |
|
|
ad2c433574
|
Instantiations move. Tried using Gianluca's suggestion about avoiding threadIdx but doesn't
seem to make a difference. Will revisit this and probably remove the lane parameter from the coalescedRead
|
2019-06-08 13:43:12 +01:00 |
|
|
86e7fb6e86
|
Instantiation relocation
|
2019-06-08 13:42:46 +01:00 |
|
|
fb91dda7be
|
Hand instantiation moved location
|
2019-06-08 13:42:26 +01:00 |
|
|
82cf7bc5ab
|
Move instantiation into fermion/instantiation
|
2019-06-08 13:41:46 +01:00 |
|
|
e452cc0a22
|
Move static variables into instantiation .cc file
|
2019-06-08 13:41:20 +01:00 |
|
|
4d2b938166
|
Remove explict instantiation from here
|
2019-06-08 13:41:01 +01:00 |
|
|
10d16ab76c
|
Remove explict instantiation from here
|
2019-06-08 13:40:32 +01:00 |
|