paboyle
d856327250
Merge branch 'release/dirac-ITT' into develop
2017-09-05 14:56:12 +01:00
Peter Boyle
bf973d0d56
SHM complete
2017-09-05 14:30:29 +01:00
Peter Boyle
837bf8a5be
Updating to control the SHM allocation scheme under configure time options
2017-09-05 12:51:02 +01:00
Peter Boyle
c05b2199f6
Improvements to huge memory
2017-09-04 10:41:21 -04:00
Azusa Yamaguchi
a5fe07c077
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-09-04 14:10:15 +01:00
Azusa Yamaguchi
b83b2b1415
Stability improvement to BCG. Force m_rr hermitian beyond rounding.
2017-09-04 14:09:47 +01:00
paboyle
7359df3501
Full reporting for benchmark; save robustness factor
2017-08-31 10:42:35 +01:00
paboyle
4b4c2a715b
fcntl.h needed
2017-08-26 11:38:04 +01:00
paboyle
54a5e6c1d0
Check if we get huge pages on linux. Larry Meadows piece of magic.
2017-08-25 22:36:08 +01:00
paboyle
80c5bce5bb
Merge branch 'develop' into feature/multi-communicator
2017-08-25 20:21:26 +01:00
paboyle
f68b5de9c8
No compile fix on Clang
2017-08-25 19:35:21 +01:00
Peter Boyle
c289699d9a
updated from cambridge mpi3 shakeout
2017-08-25 11:41:01 +01:00
Peter Boyle
c3b1263e75
Benchmark prep
2017-08-25 09:25:54 +01:00
paboyle
5fa386ddc9
FFT test compile fixed
2017-08-24 10:17:52 +01:00
Azusa Yamaguchi
d9cd4f0273
Staggered multinode block cg debugged. Missing global sum.
...
Code stalls and resumes on KNL at cambridge. Curious.
CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
paboyle
b49bec0cec
MAP_HUGETLB portability fix
2017-08-20 03:08:54 +01:00
paboyle
1cdf999668
Moving multicommunicator into mpi3 also for threading
2017-08-20 02:39:10 +01:00
paboyle
11062fb686
Comms none fail fix
2017-08-20 01:37:07 +01:00
paboyle
a446d95c33
Trying to pass TeamCity and Travis
2017-08-20 01:10:50 +01:00
paboyle
be66e7dd95
Merge branch 'develop' into feature/multi-communicator
2017-08-19 23:12:38 +01:00
Peter Boyle
0b0cf62193
Fix mpi 3 interface change
2017-08-19 13:18:50 -04:00
Peter Boyle
7d88198387
Merge branch 'develop' into feature/multi-communicator
2017-08-19 13:03:35 -04:00
Peter Boyle
2f619482b8
Enable blocking stencil send
2017-08-19 12:53:59 -04:00
Peter Boyle
d6472eda8d
Use mmap
2017-08-19 12:53:18 -04:00
Peter Boyle
bcefdd7c4e
Align both allocator calls to 2MB
2017-08-19 12:49:02 -04:00
Guido Cossu
fd367d8bfd
Debugging the PointerCache
2017-08-16 09:42:57 +01:00
Guido Cossu
8a3fe60a27
Added more asserts at grid creation time
2017-08-08 11:36:20 +01:00
Guido Cossu
44051aecd1
Checking for integer divisions in cartesian full
2017-08-08 10:31:12 +01:00
Guido Cossu
06e6f8de00
Check that the reduced dim is an integer
2017-08-08 10:22:12 +01:00
Guido Cossu
4fe182e5a7
Added high level HMC support for overriding default SIMD lane decomposition
2017-08-06 10:46:19 +01:00
Guido Cossu
175f393f9d
Binary IO error checking
2017-08-04 12:14:10 +01:00
Peter Boyle
14d53e1c9e
Threaded MPI calls patches
2017-07-29 13:08:10 -04:00
Guido Cossu
8bd869da37
Correcting a bug in the IO routines
2017-07-27 15:12:50 +01:00
Guido Cossu
c0485d799d
Explicit parameter declaration in the WilsonGauge test
2017-07-26 16:26:04 +01:00
Guido Cossu
7abc5613bd
Added smearing to the topological charge observable
2017-07-26 16:21:17 +01:00
Guido Cossu
a4b7dddb67
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-07-26 12:07:38 +01:00
Guido Cossu
5696781862
Debug error in Tensor mult
2017-07-26 12:07:34 +01:00
Christopher Kelly
0f214ad427
Moved FourierAcceleratedGaugeFixer into Grid::QCD namespace and removed 'using namespace' directives from header
2017-07-21 11:13:51 -04:00
azusayamaguchi
659d7d1a40
For test/solver
...
Fixed
2017-07-12 15:01:48 +01:00
azusayamaguchi
dc6f078246
fixed the header file for mpi3
2017-07-11 14:15:08 +01:00
Peter Boyle
40e119c61c
NUMA improvements worth preserving from AMD EPYC tests
2017-07-08 22:27:11 -04:00
Peter Boyle
a0be3f7330
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-06-30 10:53:50 +01:00
Peter Boyle
b5a6e4f1fd
Best option for Xeon cache blocking set
2017-06-30 10:53:22 +01:00
Peter Boyle
7a788db3dc
Guard first touch
2017-06-30 10:49:08 +01:00
Peter Boyle
f20eceb6cd
First touch once per page in a threaded loop
2017-06-30 10:48:27 +01:00
Peter Boyle
38325ebbc6
Interleave code path; not enabled
2017-06-30 10:23:51 +01:00
Peter Boyle
ac1f1838bc
KNL only
2017-06-30 10:15:32 +01:00
Guido Cossu
8859a151cc
Small corrections to the NEON port
2017-06-29 11:30:29 +01:00
Guido Cossu
688a39cfd9
Merge pull request #114 from nmeyer-ur/feature/arm-neon
...
ARM neon intrinsics support
Guido: checked and approved
2017-06-29 09:57:17 +01:00
Nils Meyer
0933aeefd4
corrected Grid_neon.h
2017-06-28 20:22:22 +02:00