07c0c02f8c
Speed up Cshift
2020-05-11 17:02:01 -04:00
8c31c065b5
Keep the Vector fixed to protect it from realloc
2020-05-11 17:00:30 -04:00
b1c86900b2
Merge pull request #4 from paboyle/develop
...
merge
2020-05-11 20:59:29 +02:00
bbbee5660d
First compiile on HiP
2020-05-10 05:28:09 -04:00
ea08f193e7
Allocator cache spliit into large/small pools
2020-05-10 05:24:26 -04:00
2bb2c68e15
Separate pools for small and large allocations cache
2020-05-09 22:57:21 -04:00
efe5bc6a3c
Split allocator cache into two pools of different sizes
2020-05-09 22:27:56 -04:00
384da487bd
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 18:55:11 -04:00
ee1de82a53
Working ITT benchmark again
2020-05-08 18:54:50 -04:00
2b576fc185
Comment deadd codde remove
2020-05-08 18:54:29 -04:00
52081acfa5
NVCC compile fixes
2020-05-08 13:14:12 -04:00
b01b7f761a
Merge pull request #283 from DanielRichtmann/feature/minor-fixes
...
Some small fixes
2020-05-08 10:52:03 -04:00
c83471bfd0
Fix missing checkerboards for adj und conjugate
2020-05-08 16:44:03 +02:00
ab0c5d77fb
Correct NonHermitianSchurOperatorBase
2020-05-08 16:44:02 +02:00
779e3c7442
Const-correctness for retrieval routines of GridStopWatch
2020-05-08 16:43:52 +02:00
0c570824f2
Add missing declaration of GridCmdOptionInt
2020-05-08 16:43:51 +02:00
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
0dd1bdfa94
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 09:21:43 -04:00
1d65e2f62c
Slightly faster Chebyshev; ifdef'ed out the fastest until tested numerics
...
Lifteed from HDCR setup
2020-05-08 09:20:54 -04:00
93920c4811
Remove verbose
2020-05-08 09:19:54 -04:00
6859a3e1d4
Schur operator
2020-05-08 09:19:12 -04:00
21ca182c36
Comments remove
2020-05-08 09:18:24 -04:00
053b4dd495
Merge pull request #282 from felixerben/baryon-reversal
...
Baryon reversal
2020-05-07 18:09:17 +01:00
42bb5f0721
asserrtion
2020-05-07 18:06:12 +01:00
253bcc3426
back to old version
2020-05-07 18:03:17 +01:00
a887206413
Merge pull request #281 from felixerben/feature/baryonSpeedup
...
Feature/baryon speedup
2020-05-07 13:41:29 +01:00
591ebb6213
Merge branch 'develop' of github.com:paboyle/Grid into feature/baryonSpeedup
2020-05-07 11:13:21 +01:00
56e2f7d088
deleted test routines. cleaned up fast version. assert Ns=4,Nc=3.
2020-05-07 10:03:45 +01:00
525418abfb
Merge pull request #273 from lehner/feature/gpt
...
Feature/gpt
2020-05-06 10:10:51 -04:00
5f780806c2
Merge pull request #279 from paboyle/bugfix/nvcc-config
...
configure fix for nvcc with extra arguments as CXX
2020-05-06 10:07:52 -04:00
3c6ffcb48c
Merge branch 'develop' into feature/gpt
2020-05-06 15:03:35 +02:00
87984ece7d
add Lattice_basis.h
2020-05-06 08:47:18 -04:00
e9b295f967
Synchronize blocking infrastructure with GPT
2020-05-06 08:42:28 -04:00
224cbf0453
Merge pull request #280 from mmphys/bugfix/ET_go_home
...
Bugfix/et go home
2020-05-05 17:56:51 -04:00
c1e57d4357
Merge branch 'develop' into bugfix/ET_go_home
...
* develop:
SYCL prep - no sycl just make it compile through DPC++
dpc++ didn't like rdtsc()
Make compile if HAVE_LIME=0
Lime optional
2020-05-05 22:35:04 +01:00
28a1fcaaff
First compile against SYCL
2020-05-05 11:13:27 -07:00
6b64727161
disable comments
2020-05-05 05:05:36 -04:00
04863f8f38
debug new AcceleratorView
2020-05-04 16:07:03 -04:00
04927d2e40
SYCL prep - no sycl just make it compile through DPC++
2020-05-04 10:28:29 -07:00
7caed4edd9
dpc++ didn't like rdtsc()
2020-05-04 10:27:05 -07:00
59c51d2c35
Make compile if HAVE_LIME=0
2020-05-04 10:26:20 -07:00
ff53b231c8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-04 10:25:10 -07:00
fc19cf905b
Lime optional
2020-05-04 10:24:48 -07:00
2a1387e992
rankInnerProduct
2020-05-03 17:27:11 -04:00
9bfa51bffb
cleanup comment
2020-05-03 09:12:52 -04:00
38532753f4
interface cleanup
2020-05-03 08:58:32 -04:00
949be9605c
fix pragmas
2020-05-02 16:20:03 -04:00
63cf201ee7
Add AdviseInfrequentUse
2020-05-02 11:38:42 -04:00
c8af498a2a
BinaryIO fix for alternative little-endian format name (used in 96I ensemble)
2020-05-01 03:45:50 -04:00
ddb192bac7
re-work double precision promotion for summit
2020-04-30 16:09:57 -04:00