|
77af9a3ddc
|
Baryon revert sign
|
2020-06-26 10:08:42 +01:00 |
|
|
102089798c
|
BaryonUtils: update to autoView
|
2020-06-25 16:41:58 +01:00 |
|
|
39cea8b5a7
|
Merge branch 'develop' into feature/baryon
|
2020-06-25 16:24:07 +01:00 |
|
|
a65f66d2db
|
Merge branch 'feature/baryon3pt' into feature/baryon
|
2020-06-25 16:20:59 +01:00 |
|
Peter Boyle
|
936c5ecf69
|
Reduction GPU no compile fix
|
2020-06-24 17:28:31 -04:00 |
|
Peter Boyle
|
22cfbdbbb3
|
Boost precision in inner products in single
|
2020-06-24 12:52:31 -04:00 |
|
Peter Boyle
|
093d1ee21b
|
Force initial values
|
2020-06-24 08:54:49 -04:00 |
|
Peter Boyle
|
d6ba2581ce
|
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
|
2020-06-24 08:25:08 -04:00 |
|
Peter Boyle
|
577c064184
|
Memory manager initialise earlier
|
2020-06-24 08:24:38 -04:00 |
|
Peter Boyle
|
2ff1fa6fad
|
UVM used shared for CPU alloccations andd ddont migrate
|
2020-06-23 22:14:56 -04:00 |
|
Peter Boyle
|
70be1bd8be
|
Adding code under development
|
2020-06-23 10:24:21 -04:00 |
|
|
4ef50ba31f
|
Baryon speedup
|
2020-06-23 11:44:20 +01:00 |
|
|
3e97a26f90
|
BaryonGamm3pt threads -> accelerator
|
2020-06-23 11:35:32 +01:00 |
|
|
599f28f6ef
|
Baryon bug fixes
|
2020-06-23 11:10:26 +01:00 |
|
Peter Boyle
|
c48da35921
|
Memory Vector UVM and Lattice alignedAllocator separate
|
2020-06-22 20:21:53 -04:00 |
|
Peter Boyle
|
6c5fa8dcd8
|
Aligned allocate on CPU put through this interface
|
2020-06-20 14:34:29 -04:00 |
|
Peter Boyle
|
0d2f913a1a
|
String.h for linux
|
2020-06-20 09:37:31 -04:00 |
|
Christoph Lehner
|
5b117865b2
|
Merge pull request #6 from paboyle/sycl
Sycl
|
2020-06-20 09:44:44 +02:00 |
|
Peter Boyle
|
1a74816c25
|
Hopeefully fixed
|
2020-06-19 17:50:52 -04:00 |
|
Peter Boyle
|
73de335256
|
Merge branch 'develop' into sycl
|
2020-06-19 17:44:16 -04:00 |
|
Peter Boyle
|
228fd450ce
|
Typo fix (excusee - my keyboard is starting to break)
|
2020-06-19 17:36:05 -04:00 |
|
Peter Boyle
|
b949cf6b12
|
PeekLocal needs a view to keep thread safe.
ALLOCATION_CACHEE reenable
|
2020-06-19 17:13:27 -04:00 |
|
Peter Boyle
|
11bc1aeadc
|
TThread count defaultt to fastest
|
2020-06-19 14:30:35 -04:00 |
|
Peter Boyle
|
66005929af
|
Set up the cache size on all ranks
|
2020-06-19 12:50:54 -04:00 |
|
Christoph Lehner
|
05bbc49a99
|
Edge case in GetShmDim check
|
2020-06-19 12:01:23 -04:00 |
|
Peter Boyle
|
ff7c847735
|
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
|
2020-06-19 01:22:16 -04:00 |
|
Peter Boyle
|
1aa988b2af
|
Comms overlap fix UVM case
|
2020-06-19 01:21:14 -04:00 |
|
Peter Boyle
|
edf17708a8
|
Range improvement
|
2020-06-18 22:41:06 -04:00 |
|
Christoph Lehner
|
81a8209749
|
ConvertType for blockInnerProduct
|
2020-06-18 11:53:21 -04:00 |
|
nmeyer-ur
|
a87e45ba25
|
SVE readme update
|
2020-06-18 11:23:08 +02:00 |
|
nmeyer-ur
|
465856331a
|
switch back to serialized; wrong results on single too
|
2020-06-15 15:39:39 +02:00 |
|
nmeyer-ur
|
cc958aa9ed
|
switch back to standard MPI_init due to wrong results in Benchmark_wilson using comms-overlap
|
2020-06-15 14:21:38 +02:00 |
|
Peter Boyle
|
f46f029dbb
|
Merge pull request #292 from lehner/feature/gpt-sycl
Catch edge case in SharedMemoryMPI::GetShmDims; Change default units …
|
2020-06-14 13:43:27 -04:00 |
|
Christoph Lehner
|
3dccd7aa2c
|
Catch edge case in SharedMemoryMPI::GetShmDims; Change default units to consistent MB in init args; Want last element not past last element in MemoryManagerCache.cc
|
2020-06-14 13:26:01 -04:00 |
|
nmeyer-ur
|
a25e4b3d0c
|
pred 32/64 for float/double instead of 8 in VLA patch
|
2020-06-13 14:44:37 +02:00 |
|
nmeyer-ur
|
d1210ca12a
|
switch to double/float instead of float64_t/float32_t in VLA patch
|
2020-06-13 13:59:32 +02:00 |
|
nmeyer-ur
|
36ea0e222a
|
type traits for ComplexF/D in VLA patch; cosmetics in VLS intrinsics
|
2020-06-13 13:42:35 +02:00 |
|
Peter Boyle
|
65e6e7da6f
|
Merge pull request #291 from lehner/feature/gpt-sycl
Feature/gpt sycl
|
2020-06-12 20:42:32 -04:00 |
|
Christoph Lehner
|
b5e87e8d97
|
summit compile fixes
|
2020-06-12 18:16:12 -04:00 |
|
Christoph Lehner
|
5f5807d60a
|
cleanup
|
2020-06-12 14:48:23 -04:00 |
|
nmeyer-ur
|
92281ec22d
|
add 3 op Mult for VLA
|
2020-06-12 18:49:05 +02:00 |
|
nmeyer-ur
|
87266ce099
|
comment out fcmla in vector types: need also MultAddReal
|
2020-06-12 18:37:19 +02:00 |
|
nmeyer-ur
|
2a23f133e8
|
reenable fcmla for VLA
|
2020-06-12 17:30:38 +02:00 |
|
nmeyer-ur
|
8dbf790f62
|
correct tbl2 for sp
|
2020-06-12 17:12:34 +02:00 |
|
nmeyer-ur
|
2402b4940e
|
vec_imm in float
|
2020-06-12 15:17:38 +02:00 |
|
nmeyer-ur
|
2111052fbe
|
apply VLA patch for memcpy reduction suggested by Arm, CAS-162542-D6W7Z7
|
2020-06-12 14:49:19 +02:00 |
|
Christoph Lehner
|
7974acff54
|
merged sycl to feature-gpt
|
2020-06-12 06:49:38 -04:00 |
|
|
f0d17d2b49
|
Added Baryon3pt code
|
2020-06-12 11:35:52 +01:00 |
|
|
244c003a1b
|
Updated Baryon code
|
2020-06-12 11:00:25 +01:00 |
|
|
0174f5f742
|
look for librt when using shm=shmopen
|
2020-06-11 16:50:43 +01:00 |
|