Peter Boyle
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
Peter Boyle
0dd1bdfa94
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 09:21:43 -04:00
Peter Boyle
1d65e2f62c
Slightly faster Chebyshev; ifdef'ed out the fastest until tested numerics
...
Lifteed from HDCR setup
2020-05-08 09:20:54 -04:00
Peter Boyle
93920c4811
Remove verbose
2020-05-08 09:19:54 -04:00
Peter Boyle
6859a3e1d4
Schur operator
2020-05-08 09:19:12 -04:00
Peter Boyle
21ca182c36
Comments remove
2020-05-08 09:18:24 -04:00
053b4dd495
Merge pull request #282 from felixerben/baryon-reversal
...
Baryon reversal
2020-05-07 18:09:17 +01:00
ferben
42bb5f0721
asserrtion
2020-05-07 18:06:12 +01:00
ferben
253bcc3426
back to old version
2020-05-07 18:03:17 +01:00
a887206413
Merge pull request #281 from felixerben/feature/baryonSpeedup
...
Feature/baryon speedup
2020-05-07 13:41:29 +01:00
ferben
591ebb6213
Merge branch 'develop' of github.com:paboyle/Grid into feature/baryonSpeedup
2020-05-07 11:13:21 +01:00
ferben
56e2f7d088
deleted test routines. cleaned up fast version. assert Ns=4,Nc=3.
2020-05-07 10:03:45 +01:00
Peter Boyle
525418abfb
Merge pull request #273 from lehner/feature/gpt
...
Feature/gpt
2020-05-06 10:10:51 -04:00
Peter Boyle
5f780806c2
Merge pull request #279 from paboyle/bugfix/nvcc-config
...
configure fix for nvcc with extra arguments as CXX
2020-05-06 10:07:52 -04:00
Christoph Lehner
3c6ffcb48c
Merge branch 'develop' into feature/gpt
2020-05-06 15:03:35 +02:00
Christoph Lehner
87984ece7d
add Lattice_basis.h
2020-05-06 08:47:18 -04:00
Christoph Lehner
e9b295f967
Synchronize blocking infrastructure with GPT
2020-05-06 08:42:28 -04:00
Peter Boyle
224cbf0453
Merge pull request #280 from mmphys/bugfix/ET_go_home
...
Bugfix/et go home
2020-05-05 17:56:51 -04:00
Michael Marshall
c1e57d4357
Merge branch 'develop' into bugfix/ET_go_home
...
* develop:
SYCL prep - no sycl just make it compile through DPC++
dpc++ didn't like rdtsc()
Make compile if HAVE_LIME=0
Lime optional
2020-05-05 22:35:04 +01:00
Peter Boyle
28a1fcaaff
First compile against SYCL
2020-05-05 11:13:27 -07:00
Christoph Lehner
6b64727161
disable comments
2020-05-05 05:05:36 -04:00
Christoph Lehner
04863f8f38
debug new AcceleratorView
2020-05-04 16:07:03 -04:00
u37294
04927d2e40
SYCL prep - no sycl just make it compile through DPC++
2020-05-04 10:28:29 -07:00
u37294
7caed4edd9
dpc++ didn't like rdtsc()
2020-05-04 10:27:05 -07:00
u37294
59c51d2c35
Make compile if HAVE_LIME=0
2020-05-04 10:26:20 -07:00
u37294
ff53b231c8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-04 10:25:10 -07:00
u37294
fc19cf905b
Lime optional
2020-05-04 10:24:48 -07:00
Christoph Lehner
2a1387e992
rankInnerProduct
2020-05-03 17:27:11 -04:00
Christoph Lehner
9bfa51bffb
cleanup comment
2020-05-03 09:12:52 -04:00
Christoph Lehner
38532753f4
interface cleanup
2020-05-03 08:58:32 -04:00
Christoph Lehner
949be9605c
fix pragmas
2020-05-02 16:20:03 -04:00
Christoph Lehner
63cf201ee7
Add AdviseInfrequentUse
2020-05-02 11:38:42 -04:00
Christoph Lehner
c8af498a2a
BinaryIO fix for alternative little-endian format name (used in 96I ensemble)
2020-05-01 03:45:50 -04:00
Christoph Lehner
ddb192bac7
re-work double precision promotion for summit
2020-04-30 16:09:57 -04:00
Michael Marshall
7666300a6f
Merge branch 'develop' into bugfix/ET_go_home
...
* develop:
Basis rotate stack passig to GPU reduction
Clean up warning
2020-04-30 20:10:32 +01:00
Michael Marshall
4a4b9e305d
Fix: strToVec enters infinite loop and exhausts memory if operator>> fails before the end of string, e.g. if parsing "0_0_0" for momentum instead of "0 0 0".
2020-04-30 19:40:04 +01:00
Peter Boyle
9b2d2d0fc3
Basis rotate stack passig to GPU reduction
2020-04-30 12:31:07 -04:00
Peter Boyle
5011753f4f
Clean up warning
2020-04-30 10:23:48 -04:00
Michael Marshall
dbaeefaeef
All Eigen::TensorMap objects are fixed (i.e. cannot be dynamically resized)
2020-04-30 15:02:51 +01:00
Christopher Kelly
dee96cbf82
Added workaround in configure to still catch Cuda compiler when nvcc with extra arguments (eg -ccbin) is used as CXX
2020-04-29 10:37:11 -04:00
Peter Boyle
dd3ebc2ce4
Slow compile on NVCC switch off conserved current
2020-04-29 08:43:12 -04:00
Peter Boyle
103e7ae2f0
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-04-29 03:05:36 -04:00
Peter Boyle
29ae5615c0
Seqeuential fix
2020-04-29 03:05:15 -04:00
ferben
6240e02619
added assertion to avoid potential infinite loop
2020-04-27 18:50:53 +01:00
ferben
f4033ad8cb
baryon speedup by a factor 2
2020-04-27 17:46:14 +01:00
Christoph Lehner
f1fe444d4f
blocked precision promotion infrastructure upgrade
2020-04-24 06:27:20 -04:00
Peter Boyle
dae820aa96
Merge pull request #277 from mmphys/bugfix/grid-config
...
Bugfix/grid config
2020-04-23 10:26:54 -04:00
Michael Marshall
5daf176f4a
Updated to expose GRID_CXXLD in addition to CXXLD.
...
NB: CXXLD required as this is what drives linking behaviour.
2020-04-23 15:25:53 +01:00
Michael Marshall
e96c86ec14
Make grid-config message more specific for --cxx and --cxxld
2020-04-23 13:10:45 +01:00
Peter Boyle
c2c3cad20d
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-04-23 04:35:42 -04:00