4f1e66b044
Fixed HMC SU(N) integrator which was causing fields to leave Lie Algebra manifold for N>2
2020-07-04 03:53:06 +01:00
eb470aa6dc
Update to baryon and added comments/fix whitespace
2020-06-29 09:43:01 +01:00
77af9a3ddc
Baryon revert sign
2020-06-26 10:08:42 +01:00
102089798c
BaryonUtils: update to autoView
2020-06-25 16:41:58 +01:00
39cea8b5a7
Merge branch 'develop' into feature/baryon
2020-06-25 16:24:07 +01:00
a65f66d2db
Merge branch 'feature/baryon3pt' into feature/baryon
2020-06-25 16:20:59 +01:00
4ef50ba31f
Baryon speedup
2020-06-23 11:44:20 +01:00
3e97a26f90
BaryonGamm3pt threads -> accelerator
2020-06-23 11:35:32 +01:00
599f28f6ef
Baryon bug fixes
2020-06-23 11:10:26 +01:00
Peter Boyle
b949cf6b12
PeekLocal needs a view to keep thread safe.
...
ALLOCATION_CACHEE reenable
2020-06-19 17:13:27 -04:00
Peter Boyle
1aa988b2af
Comms overlap fix UVM case
2020-06-19 01:21:14 -04:00
f0d17d2b49
Added Baryon3pt code
2020-06-12 11:35:52 +01:00
244c003a1b
Updated Baryon code
2020-06-12 11:00:25 +01:00
Peter Boyle
32b2b59be4
Offload
2020-06-10 20:36:26 -04:00
Peter Boyle
86bb0cc24b
Keep on GPU
2020-06-10 20:00:00 -04:00
Peter Boyle
237ce92540
Offload loops
2020-06-10 19:59:11 -04:00
Peter Boyle
fd97f64612
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
2020-06-10 12:58:13 -04:00
Peter Boyle
8720aecb80
Offload more loops
2020-06-10 12:57:55 -04:00
Peter Boyle
cdf0a04fc5
Merge branch 'develop' into sycl
2020-06-09 04:00:12 -04:00
Peter Boyle
e97f3688db
Fix the HMC issue - kernel was launchnig asynchronously
2020-06-08 17:01:15 -04:00
nmeyer-ur
433766ac62
revert Add/SubTimesI and prefetching in stencil
...
This reverts commit 9b2699226c7a3ca8d45f843f4f8e4658fa082163.
2020-06-08 12:02:53 +02:00
Peter Boyle
1a4c8c3387
Global edit with change to View usage. autoView() creates a wrapper object that closes the view when scope closes.
2020-06-05 18:52:35 -04:00
nmeyer-ur
9872c76825
introduce AddTimesI and SubTimesI; slight benefit in operators, but < 1%; breaks all other impls
2020-06-03 15:20:13 +02:00
nmeyer-ur
5ee3ea2144
round-up after testing of prefetches in stencil close
2020-06-03 11:58:20 +02:00
nmeyer-ur
91c81cab30
some corrections; compiles on my laptop; untested
2020-05-29 18:19:22 +02:00
nmeyer-ur
38164f8480
include counters in WilsonFermionImplementation.h
2020-05-29 17:59:26 +02:00
nmeyer-ur
f013979791
add counter support in WilsonFermion.h
2020-05-29 17:13:59 +02:00
Peter Boyle
1d252d0922
Accelerator inline
2020-05-28 11:45:25 -04:00
Peter Boyle
006cc8a8f1
Staggereed move to accelerator
2020-05-28 08:33:06 -04:00
Peter Boyle
7860a50f70
Make view specify where and drive data motion - first cut.
...
This is a compile tiime option --enable-unified=yes/no
2020-05-21 16:13:16 -04:00
nmeyer-ur
9e085bd04e
guard prevents multiple A64FX build messages
2020-05-20 19:16:30 +02:00
ferben
6c6812a5ca
GB/s output
2020-05-20 12:26:57 +01:00
ferben
1f154fe652
some cleanup in BaryonUtils
2020-05-19 13:48:56 +01:00
ferben
d708c0258d
some cleanup in BaryonUtils
2020-05-19 13:48:00 +01:00
Peter Boyle
2e652431e5
No compile on summiit fix
2020-05-12 18:56:47 -04:00
Peter Boyle
82f71643a4
Remove the norm in MdagM
2020-05-12 17:55:53 -04:00
nmeyer-ur
20d1941a45
enabled asm kernels for fixed-size A64FXFIXEDSIZE
2020-05-12 19:01:12 +02:00
Peter Boyle
bbbee5660d
First compiile on HiP
2020-05-10 05:28:09 -04:00
Peter Boyle
2bb2c68e15
Separate pools for small and large allocations cache
2020-05-09 22:57:21 -04:00
Peter Boyle
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
Peter Boyle
0dd1bdfa94
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 09:21:43 -04:00
Peter Boyle
93920c4811
Remove verbose
2020-05-08 09:19:54 -04:00
ferben
42bb5f0721
asserrtion
2020-05-07 18:06:12 +01:00
ferben
253bcc3426
back to old version
2020-05-07 18:03:17 +01:00
ferben
591ebb6213
Merge branch 'develop' of github.com:paboyle/Grid into feature/baryonSpeedup
2020-05-07 11:13:21 +01:00
ferben
56e2f7d088
deleted test routines. cleaned up fast version. assert Ns=4,Nc=3.
2020-05-07 10:03:45 +01:00
Christoph Lehner
3c6ffcb48c
Merge branch 'develop' into feature/gpt
2020-05-06 15:03:35 +02:00
Peter Boyle
28a1fcaaff
First compile against SYCL
2020-05-05 11:13:27 -07:00
Peter Boyle
dd3ebc2ce4
Slow compile on NVCC switch off conserved current
2020-04-29 08:43:12 -04:00
ferben
6240e02619
added assertion to avoid potential infinite loop
2020-04-27 18:50:53 +01:00