1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-09-20 17:25:37 +01:00
Commit Graph

175 Commits

Author SHA1 Message Date
Peter Boyle
c0ead94791 Integrated Lebesgue code and been playing with alternate implementations of the wilson dop without
any particular success in increasing the performance.
2015-04-30 16:39:06 +01:00
Peter Boyle
7ac997bd58 Merge pull request #1 from mspraggs/patch-1
Added <map> include to GridNerscIO.h
2015-04-30 09:46:48 +01:00
mspraggs
24fc71b2e9 Added <map> include to GridNerscIO.h
Adding this allows clang to compile Grid to completion.
2015-04-29 23:44:03 +01:00
Peter Boyle
d8ffa09e3b Benchmark wilson dhop now; 14.6GF on one core, not as fast as SU(3)xSU(3) [23GF] but still not too shabby.
Disassembling output shows ugly sequences in the permute sector. Could comparatively benchmark with and without
the if-else structure to see how much I'm losing.

Drops to 9GF as it falls out of cache. Moving to Lebesgue ordering should help there. Substantive progress.
2015-04-29 06:50:18 +01:00
Peter Boyle
dcc23faa4a Fixed the stencil sector and Wilson now agrees between stencil based implementation
and the cshift based implementation. Managed to reduce the volume of code in this
sector a little, but consolidation would be good, perhaps taking common
logic out into simple helper functions
2015-04-29 06:23:56 +01:00
Peter Boyle
b0485894b3 Shaken out stencil to the point where I think wilson dslash is correct.
Need to audit code carefully, consolidate between stencil and cshift,
and then benchmark and optimise.
2015-04-28 08:11:59 +01:00
Peter Boyle
0b7d389258 Reworking CSHIFT and Stencil. Implementing Wilson and discovered rework is required 2015-04-27 13:45:07 +01:00
Peter Boyle
35cfef2129 Big updates with progress towards wilson matrix 2015-04-26 15:51:09 +01:00
Peter Boyle
c678f2d255 Starting the implementation of wilson; incomplete and committing non-functional code which
is not yet included from elsewhere or linked to the build system.
2015-04-25 14:33:02 +01:00
Peter Boyle
d5fd34b6e8 Update to TODO list 2015-04-25 13:04:26 +01:00
Peter Boyle
2d8cf9e456 Added two spinor functionality required to support the Wilson hopping term. 2015-04-25 12:54:06 +01:00
Peter Boyle
dc970c6442 Dirac done ; remove from TODO 2015-04-24 22:56:37 +01:00
Peter Boyle
fc32450360 Improved the gamma quite a bit.
Serial rng's which are set on node zero and broadcaste
2015-04-24 20:21:40 +01:00
Peter Boyle
2a67214f9d static names and enum list 2015-04-24 19:12:14 +01:00
Peter Boyle
71d5927a66 Vectors now too and right multiple of matrix with gamma 2015-04-24 19:08:29 +01:00
Peter Boyle
f2ac20e7ab Removed summation 2015-04-24 18:42:44 +01:00
Peter Boyle
750dd5f5fd Cleared the code out from Grid_summation to lattice/Grid_lattice_transfer.h 2015-04-24 18:41:34 +01:00
Peter Boyle
74432432b6 Moved code from summation into transfer and reduction 2015-04-24 18:40:44 +01:00
Peter Boyle
b8eef54fa7 First implementation of Dirac matrices as a Gamma class. 2015-04-24 18:20:03 +01:00
Peter Boyle
e2e3ea5742 Reorganised the TODO. Really getting somewhere 2015-04-23 20:42:30 +01:00
Peter Boyle
4b4dcc4c13 Rename Grid_QCD 2015-04-23 20:42:09 +01:00
Peter Boyle
afe6c4f64f move 2015-04-23 20:41:22 +01:00
Peter Boyle
62e8d2d127 Slice summation working. May move this into lattice/Grid_lattice_reduction however 2015-04-23 15:13:00 +01:00
Peter Boyle
b7416d79e3 Begginings of slice summation and subblocking 2015-04-23 11:04:59 +01:00
Peter Boyle
2f8431ab03 Consolidate index to coor in a single routine 2015-04-23 11:04:19 +01:00
Peter Boyle
a9e574dd27 Snippets from Guido to optimise Reduce 2015-04-23 08:31:40 +01:00
Peter Boyle
73c0db82d5 Better description of Intel's many ISA targets 2015-04-23 08:02:51 +01:00
Peter Boyle
eb58297a43 Fixing endian on linux I hope 2015-04-23 07:51:15 +01:00
Peter Boyle
1851327d19 Got the NERSC IO working and fixed a bug in cshift. 2015-04-22 22:46:48 +01:00
Peter Boyle
a5b0c492d7 Rework of RNG to use C++11 random. Should work correctly maintaining parallel RNG across
a machine. If a "fixedSeed" is used, randoms should be reproducible across different machine
decomposition since the generators are physically indexed and assigned in lexico ordering.
2015-04-19 14:55:58 +01:00
Peter Boyle
650410cb2f Update to task list 2015-04-19 14:55:16 +01:00
Peter Boyle
f64d39ab57 Split all OMP directives into lattice subdir for easy maintainance of
parallelism and future OMP 4.0 offload.
2015-04-18 22:17:01 +01:00
Peter Boyle
4e1a3aee82 Update 2015-04-18 22:16:31 +01:00
Peter Boyle
1556c2ba3f Finishing the reorg 2015-04-18 21:24:10 +01:00
Peter Boyle
62fec04419 Reorganisation 2015-04-18 21:23:32 +01:00
Peter Boyle
aee6669d0b Build reorg with which I am a bit happier 2015-04-18 21:22:50 +01:00
Peter Boyle
a17ce0695b Clean up 2015-04-18 20:52:40 +01:00
Peter Boyle
e6ec92d0e4 More files, shorter each. 2015-04-18 20:45:00 +01:00
Peter Boyle
d964d01d6a Shrinking and organising the files 2015-04-18 20:44:19 +01:00
Peter Boyle
0fce523792 Split up into multiple files 2015-04-18 18:54:30 +01:00
Peter Boyle
520af214af splitting into smaller, multiple files for readability and easy find. 2015-04-18 18:47:43 +01:00
Peter Boyle
62ee8e1cb3 Cleanup 2015-04-18 18:37:56 +01:00
Peter Boyle
3931ad65c8 Reorg 2015-04-18 18:37:22 +01:00
Peter Boyle
8195d302dc Reorganise to keep files smaller 2015-04-18 18:36:48 +01:00
Peter Boyle
f7d80aac7f Rename 2015-04-18 17:10:45 +01:00
Peter Boyle
08f20da103 Clean up caps. 2015-04-18 17:09:48 +01:00
Peter Boyle
2c9e5aa054 Clean up capitalisation 2015-04-18 17:09:24 +01:00
Peter Boyle
ac181abc95 Rename 2015-04-18 17:07:37 +01:00
Peter Boyle
18a885d195 Renaming 2015-04-18 17:07:09 +01:00
Peter Boyle
1674f899e0 Cleaing up 2015-04-18 16:42:47 +01:00