Peter Boyle
|
bf973d0d56
|
SHM complete
|
2017-09-05 14:30:29 +01:00 |
|
Peter Boyle
|
837bf8a5be
|
Updating to control the SHM allocation scheme under configure time options
|
2017-09-05 12:51:02 +01:00 |
|
Peter Boyle
|
c05b2199f6
|
Improvements to huge memory
|
2017-09-04 10:41:21 -04:00 |
|
Peter Boyle
|
c3b1263e75
|
Benchmark prep
|
2017-08-25 09:25:54 +01:00 |
|
paboyle
|
b49bec0cec
|
MAP_HUGETLB portability fix
|
2017-08-20 03:08:54 +01:00 |
|
paboyle
|
1cdf999668
|
Moving multicommunicator into mpi3 also for threading
|
2017-08-20 02:39:10 +01:00 |
|
paboyle
|
a446d95c33
|
Trying to pass TeamCity and Travis
|
2017-08-20 01:10:50 +01:00 |
|
Peter Boyle
|
0b0cf62193
|
Fix mpi 3 interface change
|
2017-08-19 13:18:50 -04:00 |
|
Peter Boyle
|
7d88198387
|
Merge branch 'develop' into feature/multi-communicator
|
2017-08-19 13:03:35 -04:00 |
|
azusayamaguchi
|
dc6f078246
|
fixed the header file for mpi3
|
2017-07-11 14:15:08 +01:00 |
|
Peter Boyle
|
40e119c61c
|
NUMA improvements worth preserving from AMD EPYC tests
|
2017-07-08 22:27:11 -04:00 |
|
paboyle
|
54e94360ad
|
Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit
|
2017-06-24 23:10:24 +01:00 |
|
paboyle
|
3bfd1f13e6
|
I/O improvements
|
2017-06-11 23:14:10 +01:00 |
|
paboyle
|
e30fa9f4b8
|
RankCount; need to clean up ambigious ProcessCount
|
2017-05-30 23:39:16 +01:00 |
|
paboyle
|
5592f7b8c1
|
Creation mode better implementation
|
2017-04-05 02:35:34 +09:00 |
|
paboyle
|
35da4ece0b
|
UID fix
|
2017-04-05 02:18:15 +09:00 |
|
paboyle
|
417ec56cca
|
Release candidate
|
2017-03-29 05:45:33 -04:00 |
|
paboyle
|
35695ba57a
|
Bug fix in MPI3
|
2017-03-29 04:43:55 -04:00 |
|
paboyle
|
fc93f0b2ec
|
Save some code for static huge tlb's. It is ifdef'ed out but an interesting root only experiment.
No gain from it.
|
2017-03-21 22:30:29 -04:00 |
|
paboyle
|
4e7ab3166f
|
Refactoring header layout
|
2017-02-22 18:09:33 +00:00 |
|
paboyle
|
3ae92fa2e6
|
Global changes to parallel_for structure.
Move the comms flags to more sensible names
|
2017-02-21 05:24:27 -05:00 |
|
paboyle
|
37720c4db7
|
Count bytes off node only
|
2017-02-20 17:47:40 -05:00 |
|
paboyle
|
5c0adf7bf2
|
Make clang happy with parenthesis
|
2017-02-16 23:51:33 +00:00 |
|
paboyle
|
73547cca66
|
MPI3 working i think
|
2017-02-07 01:30:02 -05:00 |
|
paboyle
|
791cb050c8
|
Comms improvements
|
2016-11-01 11:35:43 +00:00 |
|
paboyle
|
09f66100d3
|
MPI 3 compile on non-linux
|
2016-10-25 06:01:12 +01:00 |
|
azusayamaguchi
|
b94478fa51
|
mpi, mpi3, shmem all compile.
mpi, mpi3 pass single node multi-rank
|
2016-10-24 23:45:31 +01:00 |
|
azusayamaguchi
|
b6a65059a2
|
Update to use shared memory to contain the stencil comms buffers
Tested on 2.1.1.1 1.2.1.1 4.1.1.1 1.4.1.1 2.2.1.1 subnode decompositions
|
2016-10-24 17:30:43 +01:00 |
|
azusayamaguchi
|
c190221fd3
|
Internal SHM comms in non-simd directions working
Need to fix simd directions
|
2016-10-22 18:14:27 +01:00 |
|
azusayamaguchi
|
910b8dd6a1
|
use simd type
|
2016-10-21 22:35:29 +01:00 |
|
azusayamaguchi
|
09fd5c43a7
|
Reasonably fast version
|
2016-10-21 15:17:39 +01:00 |
|
azusayamaguchi
|
f331809c27
|
Use variable type for loop
|
2016-10-21 13:35:37 +01:00 |
|
paboyle
|
306160ad9a
|
bcopy threaded
|
2016-10-21 12:07:28 +01:00 |
|
paboyle
|
a762b1fb71
|
MPI3 working with a bounce through shared memory on my laptop.
Longer term plan: make the "u_comm_buf" in Stencil point to the shared region and avoid the
send between ranks on same node.
|
2016-10-21 09:03:26 +01:00 |
|
paboyle
|
b58adc6a4b
|
commVector
|
2016-10-20 17:00:15 +01:00 |
|