f999408e92
View locatoin and access mode
2020-05-21 16:14:20 -04:00
a7abda89e2
View location & access mode
2020-05-21 16:13:59 -04:00
7860a50f70
Make view specify where and drive data motion - first cut.
...
This is a compile tiime option --enable-unified=yes/no
2020-05-21 16:13:16 -04:00
ebb60330c9
Automatic data motion options beginning
2020-05-17 16:34:25 -04:00
a9847aa866
Dependence fix
2020-05-12 20:03:37 -04:00
d24d8e8398
Use X-direction as more bits meaningful on CUDA.
...
2^31-1 shoulddd always bee enough for SIMD and thread reduced local volume
e.g. 32*2^31 = 2^36 = (2^9)^4 or 512^4 ias big enough.
Where 32 is gpu_threads * Nsimd = 8*4
2020-05-12 10:35:49 -04:00
07c0c02f8c
Speed up Cshift
2020-05-11 17:02:01 -04:00
8c31c065b5
Keep the Vector fixed to protect it from realloc
2020-05-11 17:00:30 -04:00
bbbee5660d
First compiile on HiP
2020-05-10 05:28:09 -04:00
52081acfa5
NVCC compile fixes
2020-05-08 13:14:12 -04:00
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
28a1fcaaff
First compile against SYCL
2020-05-05 11:13:27 -07:00
04927d2e40
SYCL prep - no sycl just make it compile through DPC++
2020-05-04 10:28:29 -07:00
7caed4edd9
dpc++ didn't like rdtsc()
2020-05-04 10:27:05 -07:00
59c51d2c35
Make compile if HAVE_LIME=0
2020-05-04 10:26:20 -07:00
ff53b231c8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-04 10:25:10 -07:00
fc19cf905b
Lime optional
2020-05-04 10:24:48 -07:00
9b2d2d0fc3
Basis rotate stack passig to GPU reduction
2020-04-30 12:31:07 -04:00
5011753f4f
Clean up warning
2020-04-30 10:23:48 -04:00
dd3ebc2ce4
Slow compile on NVCC switch off conserved current
2020-04-29 08:43:12 -04:00
103e7ae2f0
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-04-29 03:05:36 -04:00
29ae5615c0
Seqeuential fix
2020-04-29 03:05:15 -04:00
dae820aa96
Merge pull request #277 from mmphys/bugfix/grid-config
...
Bugfix/grid config
2020-04-23 10:26:54 -04:00
5daf176f4a
Updated to expose GRID_CXXLD in addition to CXXLD.
...
NB: CXXLD required as this is what drives linking behaviour.
2020-04-23 15:25:53 +01:00
e96c86ec14
Make grid-config message more specific for --cxx and --cxxld
2020-04-23 13:10:45 +01:00
c2c3cad20d
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-04-23 04:35:42 -04:00
edec9ee2e2
Conserved current rewrite done. Zmobius working
2020-04-23 04:34:01 -04:00
ed70cce542
Test for 5D DWF obserevables
2020-04-23 04:29:45 -04:00
4701201b5f
grid-config: Expose CXXLD (for GPU build) and update help
2020-04-22 18:42:30 +01:00
0782b76ed4
Merge pull request #274 from paboyle/feature/zmobius_paramcompute
...
ZMobius parameter computation
2020-04-20 14:39:29 -04:00
0896f2cead
Added missing include guards in bigfloat_double.h
2020-04-20 10:30:38 -04:00
181709bba4
Merge branch 'develop' into feature/zmobius_paramcompute
2020-04-20 09:12:34 -04:00
90229cfb0f
Merge pull request #270 from milc-qcd/feature/CGinfo
...
feature/CGinfo
2020-04-16 11:46:08 -04:00
0475c46ecb
Merge pull request #256 from djm2131/feature/BiCGSTAB
...
Import BiCGSTAB solvers and tests
2020-04-16 11:45:15 -04:00
3cca10e617
Merge pull request #276 from nils-asmussen/fix/regression_nt
...
fix regression in tests/core/Test_qed.cc
2020-04-16 11:42:39 -04:00
43dc2814dd
fix regression in core/Test_qed.cc
2020-04-15 16:10:15 +01:00
f3a8d039a2
Merge branch 'feature/hdcr' into develop
2020-04-10 22:01:52 -04:00
4e864e56c9
develop pull
2020-04-10 17:19:18 +01:00
014dbfa464
Compile fix with OpDirAll
2020-04-10 11:57:09 -04:00
3b0e07882f
Adding another form of polynomial
2020-04-10 11:28:33 -04:00
8e81a811d0
Merge branch 'feature/hdcr' into develop
2020-04-10 11:14:49 -04:00
aa13118127
Missing conjugate already fixed in develop
2020-04-10 11:11:24 -04:00
6cdb09c884
Faster copy region
2020-04-10 11:10:52 -04:00
a65bc64f10
Accelerator peek poke
2020-04-10 11:09:59 -04:00
11dec4883c
Don't throw assert
2020-04-10 11:09:11 -04:00
afa458c812
Extra solvers
2020-04-10 11:08:19 -04:00
dc50190b8f
Faster GPU basis rotation
...
May need to later include Regensburg optimised CPU variant
2020-04-10 11:06:04 -04:00
d671a63e78
Update README.md
2020-04-03 19:52:15 +01:00
6235c7ba98
IPP path fix in configure
2020-03-27 17:23:29 +00:00
7e13724882
removing Hadrons
2020-03-27 12:03:32 +00:00