Peter Boyle
|
6c5fa8dcd8
|
Aligned allocate on CPU put through this interface
|
2020-06-20 14:34:29 -04:00 |
|
Peter Boyle
|
0d2f913a1a
|
String.h for linux
|
2020-06-20 09:37:31 -04:00 |
|
Christoph Lehner
|
5b117865b2
|
Merge pull request #6 from paboyle/sycl
Sycl
|
2020-06-20 09:44:44 +02:00 |
|
Peter Boyle
|
1a74816c25
|
Hopeefully fixed
|
2020-06-19 17:50:52 -04:00 |
|
Peter Boyle
|
73de335256
|
Merge branch 'develop' into sycl
|
2020-06-19 17:44:16 -04:00 |
|
Peter Boyle
|
228fd450ce
|
Typo fix (excusee - my keyboard is starting to break)
|
2020-06-19 17:36:05 -04:00 |
|
Peter Boyle
|
b949cf6b12
|
PeekLocal needs a view to keep thread safe.
ALLOCATION_CACHEE reenable
|
2020-06-19 17:13:27 -04:00 |
|
Peter Boyle
|
11bc1aeadc
|
TThread count defaultt to fastest
|
2020-06-19 14:30:35 -04:00 |
|
Peter Boyle
|
66005929af
|
Set up the cache size on all ranks
|
2020-06-19 12:50:54 -04:00 |
|
Christoph Lehner
|
05bbc49a99
|
Edge case in GetShmDim check
|
2020-06-19 12:01:23 -04:00 |
|
Peter Boyle
|
ff7c847735
|
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
|
2020-06-19 01:22:16 -04:00 |
|
Peter Boyle
|
1aa988b2af
|
Comms overlap fix UVM case
|
2020-06-19 01:21:14 -04:00 |
|
Peter Boyle
|
edf17708a8
|
Range improvement
|
2020-06-18 22:41:06 -04:00 |
|
Christoph Lehner
|
81a8209749
|
ConvertType for blockInnerProduct
|
2020-06-18 11:53:21 -04:00 |
|
nmeyer-ur
|
a87e45ba25
|
SVE readme update
|
2020-06-18 11:23:08 +02:00 |
|
nmeyer-ur
|
465856331a
|
switch back to serialized; wrong results on single too
|
2020-06-15 15:39:39 +02:00 |
|
nmeyer-ur
|
cc958aa9ed
|
switch back to standard MPI_init due to wrong results in Benchmark_wilson using comms-overlap
|
2020-06-15 14:21:38 +02:00 |
|
Peter Boyle
|
f46f029dbb
|
Merge pull request #292 from lehner/feature/gpt-sycl
Catch edge case in SharedMemoryMPI::GetShmDims; Change default units …
|
2020-06-14 13:43:27 -04:00 |
|
Christoph Lehner
|
3dccd7aa2c
|
Catch edge case in SharedMemoryMPI::GetShmDims; Change default units to consistent MB in init args; Want last element not past last element in MemoryManagerCache.cc
|
2020-06-14 13:26:01 -04:00 |
|
nmeyer-ur
|
a25e4b3d0c
|
pred 32/64 for float/double instead of 8 in VLA patch
|
2020-06-13 14:44:37 +02:00 |
|
nmeyer-ur
|
d1210ca12a
|
switch to double/float instead of float64_t/float32_t in VLA patch
|
2020-06-13 13:59:32 +02:00 |
|
nmeyer-ur
|
36ea0e222a
|
type traits for ComplexF/D in VLA patch; cosmetics in VLS intrinsics
|
2020-06-13 13:42:35 +02:00 |
|
Peter Boyle
|
65e6e7da6f
|
Merge pull request #291 from lehner/feature/gpt-sycl
Feature/gpt sycl
|
2020-06-12 20:42:32 -04:00 |
|
Christoph Lehner
|
b5e87e8d97
|
summit compile fixes
|
2020-06-12 18:16:12 -04:00 |
|
Christoph Lehner
|
5f5807d60a
|
cleanup
|
2020-06-12 14:48:23 -04:00 |
|
nmeyer-ur
|
92281ec22d
|
add 3 op Mult for VLA
|
2020-06-12 18:49:05 +02:00 |
|
nmeyer-ur
|
87266ce099
|
comment out fcmla in vector types: need also MultAddReal
|
2020-06-12 18:37:19 +02:00 |
|
nmeyer-ur
|
2a23f133e8
|
reenable fcmla for VLA
|
2020-06-12 17:30:38 +02:00 |
|
nmeyer-ur
|
8dbf790f62
|
correct tbl2 for sp
|
2020-06-12 17:12:34 +02:00 |
|
nmeyer-ur
|
2402b4940e
|
vec_imm in float
|
2020-06-12 15:17:38 +02:00 |
|
nmeyer-ur
|
2111052fbe
|
apply VLA patch for memcpy reduction suggested by Arm, CAS-162542-D6W7Z7
|
2020-06-12 14:49:19 +02:00 |
|
Christoph Lehner
|
7974acff54
|
merged sycl to feature-gpt
|
2020-06-12 06:49:38 -04:00 |
|
|
f0d17d2b49
|
Added Baryon3pt code
|
2020-06-12 11:35:52 +01:00 |
|
|
244c003a1b
|
Updated Baryon code
|
2020-06-12 11:00:25 +01:00 |
|
|
0174f5f742
|
look for librt when using shm=shmopen
|
2020-06-11 16:50:43 +01:00 |
|
Peter Boyle
|
32b2b59be4
|
Offload
|
2020-06-10 20:36:26 -04:00 |
|
Peter Boyle
|
86bb0cc24b
|
Keep on GPU
|
2020-06-10 20:00:00 -04:00 |
|
Peter Boyle
|
84c19587e7
|
Offload
|
2020-06-10 19:59:31 -04:00 |
|
Peter Boyle
|
237ce92540
|
Offload loops
|
2020-06-10 19:59:11 -04:00 |
|
Peter Boyle
|
a7ffc61e82
|
acceleratorSIMTlane()
|
2020-06-10 19:58:33 -04:00 |
|
Peter Boyle
|
fd97f64612
|
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
|
2020-06-10 12:58:13 -04:00 |
|
Peter Boyle
|
8720aecb80
|
Offload more loops
|
2020-06-10 12:57:55 -04:00 |
|
Peter Boyle
|
cdf0a04fc5
|
Merge branch 'develop' into sycl
|
2020-06-09 04:00:12 -04:00 |
|
Peter Boyle
|
616d3dd737
|
CCommpile updates
|
2020-06-08 18:57:41 -04:00 |
|
Peter Boyle
|
8b066baca8
|
Implement transient mechanism
|
2020-06-08 18:28:53 -04:00 |
|
Peter Boyle
|
e97f3688db
|
Fix the HMC issue - kernel was launchnig asynchronously
|
2020-06-08 17:01:15 -04:00 |
|
nmeyer-ur
|
433766ac62
|
revert Add/SubTimesI and prefetching in stencil
This reverts commit 9b2699226c .
|
2020-06-08 12:02:53 +02:00 |
|
nmeyer-ur
|
93a37c8f68
|
test prefetch to L2 in stencil
|
2020-06-08 09:39:50 +02:00 |
|
Peter Boyle
|
89a1e78390
|
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
|
2020-06-05 23:20:37 -04:00 |
|
Peter Boyle
|
ffbb3fc02c
|
Merge pull request #287 from felixerben/baryon-cleaner
slightly cleaner baryon 2pt code
|
2020-06-05 22:54:52 -04:00 |
|