8cfd5d2639
Need lattice view
2020-06-03 09:11:28 -04:00
1c9f20b15e
Views must be closed
2020-06-03 09:10:29 -04:00
32237895bd
Reorg memory manager for O(1) hash table
2020-06-03 09:09:52 -04:00
c5c2dbc0ce
Optional CUDA info
2020-06-02 14:21:49 -04:00
9fcb47ee63
Explicit error message instead of infinite loop in GlobalSharedMemory::GetShmDims
2020-06-02 07:44:38 -04:00
1d252d0922
Accelerator inline
2020-05-28 11:45:25 -04:00
006cc8a8f1
Staggereed move to accelerator
2020-05-28 08:33:06 -04:00
cf2938688a
Sycl unhappy fix
2020-05-25 08:36:53 -07:00
ee63721bad
int unhappiness sycl fix
2020-05-25 08:36:24 -07:00
22c5168d70
Sycl happier
2020-05-25 08:35:56 -07:00
949ac3cd24
Must avoid non-trivial copy constructors
2020-05-25 08:35:28 -07:00
7bc0166c1c
SYCLL maknig happy - must avoid non ttrivial copy constructors
2020-05-25 08:34:19 -07:00
cb0d1b3399
hopefullly fix buildd fail
2020-05-24 21:27:00 -04:00
d1f1ccc705
HIP changes
2020-05-24 21:18:49 -04:00
c7519a237a
Assertions fail on HIP foor unknown reasons - dedbugging
2020-05-24 14:02:47 -04:00
32be2b13d3
Updates for HiP
2020-05-24 14:00:55 -04:00
92b342a477
Hip reduction too
2020-05-24 13:50:28 -04:00
556da86ac3
HIP fp16
2020-05-24 13:41:58 -04:00
8285e41574
View location / access mode
2020-05-21 16:14:41 -04:00
f999408e92
View locatoin and access mode
2020-05-21 16:14:20 -04:00
a7abda89e2
View location & access mode
2020-05-21 16:13:59 -04:00
7860a50f70
Make view specify where and drive data motion - first cut.
...
This is a compile tiime option --enable-unified=yes/no
2020-05-21 16:13:16 -04:00
6c6812a5ca
GB/s output
2020-05-20 12:26:57 +01:00
8358ee38c4
pull develop
2020-05-19 08:56:18 -04:00
1f154fe652
some cleanup in BaryonUtils
2020-05-19 13:48:56 +01:00
d708c0258d
some cleanup in BaryonUtils
2020-05-19 13:48:00 +01:00
a7635fd5ba
summit mem
2020-05-18 17:52:26 -04:00
ebb60330c9
Automatic data motion options beginning
2020-05-17 16:34:25 -04:00
5aa60be17d
SerialisableClassName method for serialisable enum, and boolean to test if a serialisable object is an enum
2020-05-15 20:00:34 +01:00
32fbdf4fb1
Merge pull request #5 from paboyle/develop
...
Sync upstream
2020-05-13 09:02:56 +02:00
a9847aa866
Dependence fix
2020-05-12 20:03:37 -04:00
2e652431e5
No compile on summiit fix
2020-05-12 18:56:47 -04:00
8b5b55b682
Make tests all compile ccurrent Grid, mostly MdagM removal of norms fixes but a few minor
...
issues fiixed too
2020-05-12 17:57:24 -04:00
0e3c49f687
TransposeIndex was broken by Christoph
2020-05-12 17:57:01 -04:00
cb7ee37562
Close expressions in arg to cshift
2020-05-12 17:56:40 -04:00
82f71643a4
Remove the norm in MdagM
2020-05-12 17:55:53 -04:00
d24d8e8398
Use X-direction as more bits meaningful on CUDA.
...
2^31-1 shoulddd always bee enough for SIMD and thread reduced local volume
e.g. 32*2^31 = 2^36 = (2^9)^4 or 512^4 ias big enough.
Where 32 is gpu_threads * Nsimd = 8*4
2020-05-12 10:35:49 -04:00
162e4bb567
no automatic prefetching for now
2020-05-12 07:01:23 -04:00
07c0c02f8c
Speed up Cshift
2020-05-11 17:02:01 -04:00
8c31c065b5
Keep the Vector fixed to protect it from realloc
2020-05-11 17:00:30 -04:00
b1c86900b2
Merge pull request #4 from paboyle/develop
...
merge
2020-05-11 20:59:29 +02:00
bbbee5660d
First compiile on HiP
2020-05-10 05:28:09 -04:00
ea08f193e7
Allocator cache spliit into large/small pools
2020-05-10 05:24:26 -04:00
2bb2c68e15
Separate pools for small and large allocations cache
2020-05-09 22:57:21 -04:00
efe5bc6a3c
Split allocator cache into two pools of different sizes
2020-05-09 22:27:56 -04:00
384da487bd
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 18:55:11 -04:00
ee1de82a53
Working ITT benchmark again
2020-05-08 18:54:50 -04:00
2b576fc185
Comment deadd codde remove
2020-05-08 18:54:29 -04:00
52081acfa5
NVCC compile fixes
2020-05-08 13:14:12 -04:00
b01b7f761a
Merge pull request #283 from DanielRichtmann/feature/minor-fixes
...
Some small fixes
2020-05-08 10:52:03 -04:00