1
0
mirror of https://github.com/paboyle/Grid.git synced 2026-04-18 17:56:12 +01:00
Commit Graph

1978 Commits

Author SHA1 Message Date
paboyle ba27cc6571 Mac os happiness 2017-09-05 15:00:16 +01:00
Peter Boyle bf973d0d56 SHM complete 2017-09-05 14:30:29 +01:00
Peter Boyle 837bf8a5be Updating to control the SHM allocation scheme under configure time options 2017-09-05 12:51:02 +01:00
Peter Boyle c05b2199f6 Improvements to huge memory 2017-09-04 10:41:21 -04:00
paboyle 7359df3501 Full reporting for benchmark; save robustness factor 2017-08-31 10:42:35 +01:00
paboyle 80c5bce5bb Merge branch 'develop' into feature/multi-communicator 2017-08-25 20:21:26 +01:00
paboyle f68b5de9c8 No compile fix on Clang 2017-08-25 19:35:21 +01:00
Peter Boyle c289699d9a updated from cambridge mpi3 shakeout 2017-08-25 11:41:01 +01:00
Peter Boyle c3b1263e75 Benchmark prep 2017-08-25 09:25:54 +01:00
paboyle 5fa386ddc9 FFT test compile fixed 2017-08-24 10:17:52 +01:00
Azusa Yamaguchi d9cd4f0273 Staggered multinode block cg debugged. Missing global sum.
Code stalls and resumes on KNL at cambridge. Curious.

CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
paboyle b49bec0cec MAP_HUGETLB portability fix 2017-08-20 03:08:54 +01:00
paboyle 1cdf999668 Moving multicommunicator into mpi3 also for threading 2017-08-20 02:39:10 +01:00
paboyle 11062fb686 Comms none fail fix 2017-08-20 01:37:07 +01:00
paboyle a446d95c33 Trying to pass TeamCity and Travis 2017-08-20 01:10:50 +01:00
paboyle be66e7dd95 Merge branch 'develop' into feature/multi-communicator 2017-08-19 23:12:38 +01:00
Peter Boyle 0b0cf62193 Fix mpi 3 interface change 2017-08-19 13:18:50 -04:00
Peter Boyle 7d88198387 Merge branch 'develop' into feature/multi-communicator 2017-08-19 13:03:35 -04:00
Peter Boyle 2f619482b8 Enable blocking stencil send 2017-08-19 12:53:59 -04:00
Peter Boyle d6472eda8d Use mmap 2017-08-19 12:53:18 -04:00
Peter Boyle bcefdd7c4e Align both allocator calls to 2MB 2017-08-19 12:49:02 -04:00
Guido Cossu fd367d8bfd Debugging the PointerCache 2017-08-16 09:42:57 +01:00
Guido Cossu 8a3fe60a27 Added more asserts at grid creation time 2017-08-08 11:36:20 +01:00
Guido Cossu 44051aecd1 Checking for integer divisions in cartesian full 2017-08-08 10:31:12 +01:00
Guido Cossu 06e6f8de00 Check that the reduced dim is an integer 2017-08-08 10:22:12 +01:00
Guido Cossu 4fe182e5a7 Added high level HMC support for overriding default SIMD lane decomposition 2017-08-06 10:46:19 +01:00
Guido Cossu 175f393f9d Binary IO error checking 2017-08-04 12:14:10 +01:00
Peter Boyle 14d53e1c9e Threaded MPI calls patches 2017-07-29 13:08:10 -04:00
Guido Cossu 8bd869da37 Correcting a bug in the IO routines 2017-07-27 15:12:50 +01:00
Guido Cossu c0485d799d Explicit parameter declaration in the WilsonGauge test 2017-07-26 16:26:04 +01:00
Guido Cossu 7abc5613bd Added smearing to the topological charge observable 2017-07-26 16:21:17 +01:00
Guido Cossu a4b7dddb67 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2017-07-26 12:07:38 +01:00
Guido Cossu 5696781862 Debug error in Tensor mult 2017-07-26 12:07:34 +01:00
Christopher Kelly 0f214ad427 Moved FourierAcceleratedGaugeFixer into Grid::QCD namespace and removed 'using namespace' directives from header 2017-07-21 11:13:51 -04:00
azusayamaguchi 659d7d1a40 For test/solver
Fixed
2017-07-12 15:01:48 +01:00
azusayamaguchi dc6f078246 fixed the header file for mpi3 2017-07-11 14:15:08 +01:00
Peter Boyle 40e119c61c NUMA improvements worth preserving from AMD EPYC tests 2017-07-08 22:27:11 -04:00
Peter Boyle a0be3f7330 Merge branch 'develop' of https://github.com/paboyle/Grid into develop 2017-06-30 10:53:50 +01:00
Peter Boyle b5a6e4f1fd Best option for Xeon cache blocking set 2017-06-30 10:53:22 +01:00
Peter Boyle 7a788db3dc Guard first touch 2017-06-30 10:49:08 +01:00
Peter Boyle f20eceb6cd First touch once per page in a threaded loop 2017-06-30 10:48:27 +01:00
Peter Boyle 38325ebbc6 Interleave code path; not enabled 2017-06-30 10:23:51 +01:00
Peter Boyle ac1f1838bc KNL only 2017-06-30 10:15:32 +01:00
Guido Cossu 8859a151cc Small corrections to the NEON port 2017-06-29 11:30:29 +01:00
Guido Cossu 688a39cfd9 Merge pull request #114 from nmeyer-ur/feature/arm-neon
ARM neon intrinsics support
Guido: checked and approved
2017-06-29 09:57:17 +01:00
Nils Meyer 0933aeefd4 corrected Grid_neon.h 2017-06-28 20:22:22 +02:00
portelli 07de925127 minor scalar action fixes 2017-06-28 12:45:44 +01:00
Nils Meyer a9c816a268 moved file to correct folder 2017-06-27 21:39:15 +02:00
Nils Meyer bf729766dd removed collision with QPX implementation 2017-06-27 20:32:24 +02:00
portelli 0b707b861c Merge branch 'develop' into feature/scalar-hmc-update 2017-06-27 14:40:05 +01:00