Daniel Richtmann
0c570824f2
Add missing declaration of GridCmdOptionInt
2020-05-08 16:43:51 +02:00
Peter Boyle
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
Peter Boyle
0dd1bdfa94
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 09:21:43 -04:00
Peter Boyle
1d65e2f62c
Slightly faster Chebyshev; ifdef'ed out the fastest until tested numerics
...
Lifteed from HDCR setup
2020-05-08 09:20:54 -04:00
Peter Boyle
93920c4811
Remove verbose
2020-05-08 09:19:54 -04:00
Peter Boyle
21ca182c36
Comments remove
2020-05-08 09:18:24 -04:00
nmeyer-ur
ceb8b374da
API change v3
2020-05-08 15:04:44 +02:00
nmeyer-ur
4bc2ad2894
API change v2
2020-05-08 15:00:25 +02:00
nmeyer-ur
798af3e68f
retry changing StoD API
2020-05-08 14:34:59 +02:00
nmeyer-ur
b0ef2367f3
testing alternate call to PrecisionChange
2020-05-08 14:22:44 +02:00
nmeyer-ur
71a7350a85
changed 2nd argument in Reduce to native vector type
2020-05-08 12:26:51 +02:00
nmeyer-ur
6f79369955
trying to get rid of macro definition error
2020-05-08 12:19:24 +02:00
nmeyer-ur
f9cb6b979f
corrected more typos
2020-05-08 12:11:01 +02:00
nmeyer-ur
ed4d9d17f8
corrected type
2020-05-08 12:09:22 +02:00
nmeyer-ur
fbed02690d
some changes in breaking out A64FX: use -DA64FXFIXEDSIZE for fixed size, but also define GEN
2020-05-08 12:05:31 +02:00
nmeyer-ur
39f3ae5b1d
corrected more types
2020-05-08 11:07:14 +02:00
nmeyer-ur
e64bec8c8e
pulled SVE typedefs out of Optimization
2020-05-08 11:04:21 +02:00
nmeyer-ur
0893b4e552
fixed typos in PrecisionChange
2020-05-08 10:59:07 +02:00
nmeyer-ur
92f0f29670
fixed double overloading vecf in Div, corrected typos
2020-05-08 10:57:23 +02:00
nmeyer-ur
48a340a9d1
GEN seems to defined by default -> some fixes applied
2020-05-08 10:47:49 +02:00
nmeyer-ur
f45621109b
placed typedefs in Optimization
2020-05-08 10:41:52 +02:00
nmeyer-ur
32d1a0bbea
added even more debug output
2020-05-08 10:39:26 +02:00
nmeyer-ur
267cce66a1
added more debug output
2020-05-08 10:29:28 +02:00
nmeyer-ur
3417147b11
added real fma, corrected typos in tbls; integrated, must supply A64FXGCC with GEN in configure
2020-05-08 10:20:19 +02:00
nmeyer-ur
b338719bc8
first transition to fixed-size done, excl. Exch; next step: integration
2020-05-07 22:33:28 +02:00
nmeyer-ur
2b81cbe2c2
first attempt to introduce tables using fixed-size; still incomplete
2020-05-07 22:01:19 +02:00
nmeyer-ur
acff9d6ed2
transition to fixed size data types almost done; still incomplete
2020-05-07 21:24:07 +02:00
nmeyer-ur
a306a49788
first mods for fixed size; still incomplete
2020-05-07 19:07:49 +02:00
ferben
42bb5f0721
asserrtion
2020-05-07 18:06:12 +01:00
ferben
253bcc3426
back to old version
2020-05-07 18:03:17 +01:00
ferben
591ebb6213
Merge branch 'develop' of github.com:paboyle/Grid into feature/baryonSpeedup
2020-05-07 11:13:21 +01:00
ferben
56e2f7d088
deleted test routines. cleaned up fast version. assert Ns=4,Nc=3.
2020-05-07 10:03:45 +01:00
Christoph Lehner
3c6ffcb48c
Merge branch 'develop' into feature/gpt
2020-05-06 15:03:35 +02:00
Christoph Lehner
87984ece7d
add Lattice_basis.h
2020-05-06 08:47:18 -04:00
Christoph Lehner
e9b295f967
Synchronize blocking infrastructure with GPT
2020-05-06 08:42:28 -04:00
Michael Marshall
c1e57d4357
Merge branch 'develop' into bugfix/ET_go_home
...
* develop:
SYCL prep - no sycl just make it compile through DPC++
dpc++ didn't like rdtsc()
Make compile if HAVE_LIME=0
Lime optional
2020-05-05 22:35:04 +01:00
Peter Boyle
28a1fcaaff
First compile against SYCL
2020-05-05 11:13:27 -07:00
Christoph Lehner
6b64727161
disable comments
2020-05-05 05:05:36 -04:00
Christoph Lehner
04863f8f38
debug new AcceleratorView
2020-05-04 16:07:03 -04:00
u37294
04927d2e40
SYCL prep - no sycl just make it compile through DPC++
2020-05-04 10:28:29 -07:00
u37294
7caed4edd9
dpc++ didn't like rdtsc()
2020-05-04 10:27:05 -07:00
Christoph Lehner
2a1387e992
rankInnerProduct
2020-05-03 17:27:11 -04:00
Christoph Lehner
9bfa51bffb
cleanup comment
2020-05-03 09:12:52 -04:00
Christoph Lehner
38532753f4
interface cleanup
2020-05-03 08:58:32 -04:00
Christoph Lehner
949be9605c
fix pragmas
2020-05-02 16:20:03 -04:00
Christoph Lehner
63cf201ee7
Add AdviseInfrequentUse
2020-05-02 11:38:42 -04:00
Christoph Lehner
c8af498a2a
BinaryIO fix for alternative little-endian format name (used in 96I ensemble)
2020-05-01 03:45:50 -04:00
Christoph Lehner
ddb192bac7
re-work double precision promotion for summit
2020-04-30 16:09:57 -04:00
Michael Marshall
7666300a6f
Merge branch 'develop' into bugfix/ET_go_home
...
* develop:
Basis rotate stack passig to GPU reduction
Clean up warning
2020-04-30 20:10:32 +01:00
Michael Marshall
4a4b9e305d
Fix: strToVec enters infinite loop and exhausts memory if operator>> fails before the end of string, e.g. if parsing "0_0_0" for momentum instead of "0 0 0".
2020-04-30 19:40:04 +01:00