paboyle
1f26a234f9
CPU loops explicit for peek poke
2018-01-24 13:36:31 +00:00
paboyle
13f0116425
Accelerator loops
2018-01-24 13:35:55 +00:00
paboyle
25f589b064
Accelerator loops
2018-01-24 13:35:36 +00:00
paboyle
210c50a278
Accelerator prep work
2018-01-24 13:35:13 +00:00
paboyle
549a143e78
Accelerator related
2018-01-24 13:34:46 +00:00
paboyle
277301486d
Simple warning elimination
2018-01-24 13:34:15 +00:00
paboyle
c851b39a49
Nicer way of including aggregate
2018-01-24 13:33:34 +00:00
paboyle
15cc12eb6c
Delete the old non ET file
2018-01-24 13:33:07 +00:00
paboyle
ae4f1f8c12
New file, split out two from Lattice_reduction
2018-01-24 13:32:43 +00:00
paboyle
2d13ea1a22
Namespace and indentation emacs choices
2018-01-15 00:05:55 +00:00
paboyle
8c675064bd
Namespace and indentation
2018-01-15 00:04:43 +00:00
paboyle
550b905bb8
Namespace nd indentation
2018-01-15 00:03:49 +00:00
paboyle
edb79dc088
Namespce,and indent
2018-01-15 00:02:33 +00:00
paboyle
88e635c5d1
Namepscae, format
2018-01-15 00:02:01 +00:00
paboyle
ecb4a24de8
Namespace
2018-01-15 00:01:25 +00:00
paboyle
c8c1d36710
Namespace, indent
2018-01-15 00:00:52 +00:00
paboyle
b4bb428d9b
Namespace, indent
2018-01-14 23:59:57 +00:00
paboyle
e9ef7e3852
Namespace, indent
2018-01-14 23:59:23 +00:00
paboyle
31cbbfc07e
Namespace, indent
2018-01-14 23:58:44 +00:00
paboyle
4eb0552d1d
Namespace, indnet
2018-01-14 23:58:03 +00:00
paboyle
08f2a4564f
Namespace, formatting
2018-01-14 23:56:33 +00:00
paboyle
7e00f643f8
Namespace indent
2018-01-14 23:55:44 +00:00
paboyle
c19ccdad7c
Namespace, indent
2018-01-14 23:55:07 +00:00
paboyle
8aed4181e1
Namespace, indent
2018-01-14 23:54:25 +00:00
paboyle
06ab7f5661
Namespace
2018-01-14 23:53:31 +00:00
paboyle
645ec8eba0
Namespace
2018-01-14 23:52:26 +00:00
paboyle
72ffa8a88e
Namespace
2018-01-14 23:51:38 +00:00
paboyle
9579c9c327
Threading improvement
2017-12-05 14:12:22 +00:00
paboyle
c24d4c8d0e
Improved parallel RNG init
2017-12-05 13:01:10 +00:00
paboyle
3e560b9462
Faster RNG init
2017-12-05 11:42:05 +00:00
paboyle
d93c6760ec
Faster code for split unsplit
2017-12-05 11:39:26 +00:00
paboyle
28ceacec45
Split/Unsplit working
2017-11-27 15:13:29 +00:00
paboyle
4987edbd44
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-11-27 12:34:56 +00:00
paboyle
1f04e56038
Believe split/unsplit works, but need to make pretty
2017-11-27 12:33:08 +00:00
azusayamaguchi
e55397bc13
Staggerd cg
2017-11-24 14:18:30 +00:00
paboyle
94b8fb5686
Debug in progress
2017-11-19 01:39:04 +00:00
paboyle
a7f72eb994
SHaking out
2017-10-30 00:22:06 +00:00
paboyle
84b441800f
Merge branch 'develop' into feature/lanczos-reorg
2017-10-27 14:21:38 +01:00
paboyle
1ef424b139
Split grid Y2K bug fix attempt
2017-10-27 14:20:35 +01:00
paboyle
08583afaff
Red black friendly coarsening
2017-10-25 23:51:18 +01:00
paboyle
08ca338875
Split grid communication
2017-10-09 23:19:45 +01:00
paboyle
4f8b6f26b4
Merge branch 'develop' into feature/dwf-multirhs
2017-10-02 11:41:49 +01:00
Azusa Yamaguchi
d9cd4f0273
Staggered multinode block cg debugged. Missing global sum.
...
Code stalls and resumes on KNL at cambridge. Curious.
CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
azusayamaguchi
659d7d1a40
For test/solver
...
Fixed
2017-07-12 15:01:48 +01:00
paboyle
349d75e483
Precision fix
2017-06-23 02:57:59 -07:00
Azusa Yamaguchi
e9cc21900f
Block solver complete for staggered. Now stable on mass 0.003 and
...
gives 8x (!) speed up on Haswell laptop vs. standard CG for 8 RHS solves.
166 iterations vs. 537 iterations so algorithmic gain + 2x in flop rate gain.
Better than a slap in the face with a wet kipper.
2017-06-20 12:37:41 +01:00
Azusa Yamaguchi
cfe3cd76d1
Block solver improvements
2017-06-19 14:04:21 +01:00
paboyle
c85024683e
Merge branch 'feature/parallelio' into develop
2017-06-19 01:39:48 +01:00
Peter Boyle
6f687a67cd
As local vols increase, use 64 bits for safety
2017-06-01 17:36:18 -04:00
paboyle
58e8d0a10d
reverse direction lexico mapping
2017-05-30 23:38:30 +01:00