Peter Boyle
cf2938688a
Sycl unhappy fix
2020-05-25 08:36:53 -07:00
Peter Boyle
ee63721bad
int unhappiness sycl fix
2020-05-25 08:36:24 -07:00
Peter Boyle
22c5168d70
Sycl happier
2020-05-25 08:35:56 -07:00
Peter Boyle
949ac3cd24
Must avoid non-trivial copy constructors
2020-05-25 08:35:28 -07:00
Peter Boyle
7bc0166c1c
SYCLL maknig happy - must avoid non ttrivial copy constructors
2020-05-25 08:34:19 -07:00
Peter Boyle
cb0d1b3399
hopefullly fix buildd fail
2020-05-24 21:27:00 -04:00
Peter Boyle
d1f1ccc705
HIP changes
2020-05-24 21:18:49 -04:00
Peter Boyle
c7519a237a
Assertions fail on HIP foor unknown reasons - dedbugging
2020-05-24 14:02:47 -04:00
Peter Boyle
32be2b13d3
Updates for HiP
2020-05-24 14:00:55 -04:00
Peter Boyle
92b342a477
Hip reduction too
2020-05-24 13:50:28 -04:00
Peter Boyle
556da86ac3
HIP fp16
2020-05-24 13:41:58 -04:00
Peter Boyle
8285e41574
View location / access mode
2020-05-21 16:14:41 -04:00
Peter Boyle
f999408e92
View locatoin and access mode
2020-05-21 16:14:20 -04:00
Peter Boyle
a7abda89e2
View location & access mode
2020-05-21 16:13:59 -04:00
Peter Boyle
7860a50f70
Make view specify where and drive data motion - first cut.
...
This is a compile tiime option --enable-unified=yes/no
2020-05-21 16:13:16 -04:00
Peter Boyle
ebb60330c9
Automatic data motion options beginning
2020-05-17 16:34:25 -04:00
Peter Boyle
a9847aa866
Dependence fix
2020-05-12 20:03:37 -04:00
Peter Boyle
d24d8e8398
Use X-direction as more bits meaningful on CUDA.
...
2^31-1 shoulddd always bee enough for SIMD and thread reduced local volume
e.g. 32*2^31 = 2^36 = (2^9)^4 or 512^4 ias big enough.
Where 32 is gpu_threads * Nsimd = 8*4
2020-05-12 10:35:49 -04:00
Peter Boyle
07c0c02f8c
Speed up Cshift
2020-05-11 17:02:01 -04:00
Peter Boyle
8c31c065b5
Keep the Vector fixed to protect it from realloc
2020-05-11 17:00:30 -04:00
Peter Boyle
bbbee5660d
First compiile on HiP
2020-05-10 05:28:09 -04:00
Peter Boyle
52081acfa5
NVCC compile fixes
2020-05-08 13:14:12 -04:00
Peter Boyle
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
Peter Boyle
28a1fcaaff
First compile against SYCL
2020-05-05 11:13:27 -07:00
u37294
04927d2e40
SYCL prep - no sycl just make it compile through DPC++
2020-05-04 10:28:29 -07:00
u37294
7caed4edd9
dpc++ didn't like rdtsc()
2020-05-04 10:27:05 -07:00
u37294
59c51d2c35
Make compile if HAVE_LIME=0
2020-05-04 10:26:20 -07:00
u37294
ff53b231c8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-04 10:25:10 -07:00
u37294
fc19cf905b
Lime optional
2020-05-04 10:24:48 -07:00
Peter Boyle
9b2d2d0fc3
Basis rotate stack passig to GPU reduction
2020-04-30 12:31:07 -04:00
Peter Boyle
5011753f4f
Clean up warning
2020-04-30 10:23:48 -04:00
Peter Boyle
dd3ebc2ce4
Slow compile on NVCC switch off conserved current
2020-04-29 08:43:12 -04:00
Peter Boyle
103e7ae2f0
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-04-29 03:05:36 -04:00
Peter Boyle
29ae5615c0
Seqeuential fix
2020-04-29 03:05:15 -04:00
Peter Boyle
dae820aa96
Merge pull request #277 from mmphys/bugfix/grid-config
...
Bugfix/grid config
2020-04-23 10:26:54 -04:00
Michael Marshall
5daf176f4a
Updated to expose GRID_CXXLD in addition to CXXLD.
...
NB: CXXLD required as this is what drives linking behaviour.
2020-04-23 15:25:53 +01:00
Michael Marshall
e96c86ec14
Make grid-config message more specific for --cxx and --cxxld
2020-04-23 13:10:45 +01:00
Peter Boyle
c2c3cad20d
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-04-23 04:35:42 -04:00
Peter Boyle
edec9ee2e2
Conserved current rewrite done. Zmobius working
2020-04-23 04:34:01 -04:00
Peter Boyle
ed70cce542
Test for 5D DWF obserevables
2020-04-23 04:29:45 -04:00
Michael Marshall
4701201b5f
grid-config: Expose CXXLD (for GPU build) and update help
2020-04-22 18:42:30 +01:00
Peter Boyle
0782b76ed4
Merge pull request #274 from paboyle/feature/zmobius_paramcompute
...
ZMobius parameter computation
2020-04-20 14:39:29 -04:00
Christopher Kelly
0896f2cead
Added missing include guards in bigfloat_double.h
2020-04-20 10:30:38 -04:00
Christopher Kelly
181709bba4
Merge branch 'develop' into feature/zmobius_paramcompute
2020-04-20 09:12:34 -04:00
Peter Boyle
90229cfb0f
Merge pull request #270 from milc-qcd/feature/CGinfo
...
feature/CGinfo
2020-04-16 11:46:08 -04:00
Peter Boyle
0475c46ecb
Merge pull request #256 from djm2131/feature/BiCGSTAB
...
Import BiCGSTAB solvers and tests
2020-04-16 11:45:15 -04:00
Peter Boyle
3cca10e617
Merge pull request #276 from nils-asmussen/fix/regression_nt
...
fix regression in tests/core/Test_qed.cc
2020-04-16 11:42:39 -04:00
43dc2814dd
fix regression in core/Test_qed.cc
2020-04-15 16:10:15 +01:00
Peter Boyle
f3a8d039a2
Merge branch 'feature/hdcr' into develop
2020-04-10 22:01:52 -04:00
4e864e56c9
develop pull
2020-04-10 17:19:18 +01:00