Peter Boyle
2bb2c68e15
Separate pools for small and large allocations cache
2020-05-09 22:57:21 -04:00
Peter Boyle
efe5bc6a3c
Split allocator cache into two pools of different sizes
2020-05-09 22:27:56 -04:00
Peter Boyle
384da487bd
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 18:55:11 -04:00
Peter Boyle
ee1de82a53
Working ITT benchmark again
2020-05-08 18:54:50 -04:00
Peter Boyle
2b576fc185
Comment deadd codde remove
2020-05-08 18:54:29 -04:00
Peter Boyle
52081acfa5
NVCC compile fixes
2020-05-08 13:14:12 -04:00
Peter Boyle
b01b7f761a
Merge pull request #283 from DanielRichtmann/feature/minor-fixes
...
Some small fixes
2020-05-08 10:52:03 -04:00
Daniel Richtmann
c83471bfd0
Fix missing checkerboards for adj und conjugate
2020-05-08 16:44:03 +02:00
Daniel Richtmann
ab0c5d77fb
Correct NonHermitianSchurOperatorBase
2020-05-08 16:44:02 +02:00
Daniel Richtmann
779e3c7442
Const-correctness for retrieval routines of GridStopWatch
2020-05-08 16:43:52 +02:00
Daniel Richtmann
0c570824f2
Add missing declaration of GridCmdOptionInt
2020-05-08 16:43:51 +02:00
Peter Boyle
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
Peter Boyle
0dd1bdfa94
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 09:21:43 -04:00
Peter Boyle
1d65e2f62c
Slightly faster Chebyshev; ifdef'ed out the fastest until tested numerics
...
Lifteed from HDCR setup
2020-05-08 09:20:54 -04:00
Peter Boyle
93920c4811
Remove verbose
2020-05-08 09:19:54 -04:00
Peter Boyle
6859a3e1d4
Schur operator
2020-05-08 09:19:12 -04:00
Peter Boyle
21ca182c36
Comments remove
2020-05-08 09:18:24 -04:00
053b4dd495
Merge pull request #282 from felixerben/baryon-reversal
...
Baryon reversal
2020-05-07 18:09:17 +01:00
ferben
42bb5f0721
asserrtion
2020-05-07 18:06:12 +01:00
ferben
253bcc3426
back to old version
2020-05-07 18:03:17 +01:00
a887206413
Merge pull request #281 from felixerben/feature/baryonSpeedup
...
Feature/baryon speedup
2020-05-07 13:41:29 +01:00
ferben
591ebb6213
Merge branch 'develop' of github.com:paboyle/Grid into feature/baryonSpeedup
2020-05-07 11:13:21 +01:00
ferben
56e2f7d088
deleted test routines. cleaned up fast version. assert Ns=4,Nc=3.
2020-05-07 10:03:45 +01:00
Peter Boyle
525418abfb
Merge pull request #273 from lehner/feature/gpt
...
Feature/gpt
2020-05-06 10:10:51 -04:00
Peter Boyle
5f780806c2
Merge pull request #279 from paboyle/bugfix/nvcc-config
...
configure fix for nvcc with extra arguments as CXX
2020-05-06 10:07:52 -04:00
Christoph Lehner
3c6ffcb48c
Merge branch 'develop' into feature/gpt
2020-05-06 15:03:35 +02:00
Christoph Lehner
87984ece7d
add Lattice_basis.h
2020-05-06 08:47:18 -04:00
Christoph Lehner
e9b295f967
Synchronize blocking infrastructure with GPT
2020-05-06 08:42:28 -04:00
Peter Boyle
224cbf0453
Merge pull request #280 from mmphys/bugfix/ET_go_home
...
Bugfix/et go home
2020-05-05 17:56:51 -04:00
Michael Marshall
c1e57d4357
Merge branch 'develop' into bugfix/ET_go_home
...
* develop:
SYCL prep - no sycl just make it compile through DPC++
dpc++ didn't like rdtsc()
Make compile if HAVE_LIME=0
Lime optional
2020-05-05 22:35:04 +01:00
Peter Boyle
28a1fcaaff
First compile against SYCL
2020-05-05 11:13:27 -07:00
Christoph Lehner
6b64727161
disable comments
2020-05-05 05:05:36 -04:00
Christoph Lehner
04863f8f38
debug new AcceleratorView
2020-05-04 16:07:03 -04:00
u37294
04927d2e40
SYCL prep - no sycl just make it compile through DPC++
2020-05-04 10:28:29 -07:00
u37294
7caed4edd9
dpc++ didn't like rdtsc()
2020-05-04 10:27:05 -07:00
u37294
59c51d2c35
Make compile if HAVE_LIME=0
2020-05-04 10:26:20 -07:00
u37294
ff53b231c8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-04 10:25:10 -07:00
u37294
fc19cf905b
Lime optional
2020-05-04 10:24:48 -07:00
Christoph Lehner
2a1387e992
rankInnerProduct
2020-05-03 17:27:11 -04:00
Christoph Lehner
9bfa51bffb
cleanup comment
2020-05-03 09:12:52 -04:00
Christoph Lehner
38532753f4
interface cleanup
2020-05-03 08:58:32 -04:00
Christoph Lehner
949be9605c
fix pragmas
2020-05-02 16:20:03 -04:00
Christoph Lehner
63cf201ee7
Add AdviseInfrequentUse
2020-05-02 11:38:42 -04:00
Christoph Lehner
c8af498a2a
BinaryIO fix for alternative little-endian format name (used in 96I ensemble)
2020-05-01 03:45:50 -04:00
Christoph Lehner
ddb192bac7
re-work double precision promotion for summit
2020-04-30 16:09:57 -04:00
Michael Marshall
7666300a6f
Merge branch 'develop' into bugfix/ET_go_home
...
* develop:
Basis rotate stack passig to GPU reduction
Clean up warning
2020-04-30 20:10:32 +01:00
Michael Marshall
4a4b9e305d
Fix: strToVec enters infinite loop and exhausts memory if operator>> fails before the end of string, e.g. if parsing "0_0_0" for momentum instead of "0 0 0".
2020-04-30 19:40:04 +01:00
Peter Boyle
9b2d2d0fc3
Basis rotate stack passig to GPU reduction
2020-04-30 12:31:07 -04:00
Peter Boyle
5011753f4f
Clean up warning
2020-04-30 10:23:48 -04:00
Michael Marshall
dbaeefaeef
All Eigen::TensorMap objects are fixed (i.e. cannot be dynamically resized)
2020-04-30 15:02:51 +01:00