Peter Boyle
ccf147d6c1
Select the compiler that gives better performance on sunspot
2024-05-07 18:45:56 +00:00
Peter Boyle
7aa12b446f
New config command for sunspot
2024-05-07 18:45:40 +00:00
Peter Boyle
c293228102
layout control
2024-05-07 18:45:21 +00:00
Peter Boyle
5c4c9f721a
Remove pbs file and replace with bench1 and bench2 for 1 and 2 nodes
2024-05-07 18:44:49 +00:00
Peter Boyle
057f86c1de
2 queues works ok in performance
2024-05-07 18:42:50 +00:00
Peter Boyle
cd52e3cbc2
Jobs on subspot
2024-05-07 18:38:15 +00:00
Peter Boyle
24602e1259
Accidental synchronise
2024-05-07 17:28:38 +00:00
Peter Boyle
8a098889fc
Update FlightRecorder.cc
2024-04-30 21:15:08 +01:00
Peter Boyle
ff2ea5de18
Update Tensor_traits.h
2024-04-11 14:25:45 -04:00
Peter Boyle
da59379612
Large reg file for double
2024-03-26 17:03:20 +00:00
Peter Boyle
3ef2a41518
ifdef guard ommitted
2024-03-26 14:50:32 +00:00
Peter Boyle
aa96f420c6
Acclerator ware MPI guard on the Unix domain sockets
2024-03-26 14:41:25 +00:00
Peter Boyle
49e9e4ed0e
Fences
2024-03-26 14:14:06 +00:00
Peter Boyle
f7b8163016
Deterministic MPI reduce options
2024-03-26 14:11:40 +00:00
Peter Boyle
93769eacd3
Updated configure for bounce through host
2024-03-26 14:10:24 +00:00
Peter Boyle
59b0cc11df
REduce the time in single
2024-03-26 00:42:40 +00:00
Peter Boyle
f32c275376
Updated config options for MPI not being aware of GPU
2024-03-26 00:42:00 +00:00
Peter Boyle
5404fc66ab
Merge needs a fence on SYCL
2024-03-26 00:38:41 +00:00
Peter Boyle
1f53458af8
Options to bounce through a host buffer if
...
--disable-accelerator-aware-mpi
2024-03-26 00:37:19 +00:00
Peter Boyle
434c3e7f1d
We have a choice of GET or PUT across NVlink
2024-03-25 14:32:44 +00:00
Peter Boyle
500b119f3d
Deterministic MPI
2024-03-22 15:55:23 +00:00
Peter Boyle
4b87259c1b
New config command for sunspot
2024-03-22 15:43:49 +00:00
Peter Boyle
503dec34ef
This appears working now on Sunspot
2024-03-22 15:43:30 +00:00
Peter Boyle
d1e9fe50d2
Xor csum for repro testing
2024-03-22 15:42:57 +00:00
Peter Boyle
d01e5fa838
Improved FlightRecorder
2024-03-22 15:42:32 +00:00
Peter Boyle
a477c25e8c
Sunspot repro tests
2024-03-22 15:42:11 +00:00
Peter Boyle
1bd20cd9e8
FlightRecorder
2024-03-22 15:40:01 +00:00
Peter Boyle
e49e95b037
Upgrade of the Britney test with flight recorder and fast xor checksum
2024-03-22 15:39:27 +00:00
Peter Boyle
6f59fed563
Flight recorder, resurrecting the "world famous" Britney test
2024-03-22 15:32:32 +00:00
Peter Boyle
60b7f6c99d
Flight recorder, resurrecting the "world famous" Britney test
2024-03-22 15:32:26 +00:00
Peter Boyle
b92dfcc8d3
Flight recorder, resurrecting the "world famous" Britney test
2024-03-22 15:30:27 +00:00
Peter Boyle
f6fd6dd053
Flight recorder, resurrecting the "world famous" Britney test
2024-03-22 15:30:01 +00:00
Peter Boyle
79ad567dd5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2024-03-19 15:43:42 +00:00
Peter Boyle
fab1efb48c
More britney logging improvements
2024-03-19 14:36:21 +00:00
Peter Boyle
660eb76d93
FFTW from OneAPI
2024-03-19 14:28:33 +00:00
Peter Boyle
62e7bf024a
Updated flight logging for Britney test
2024-03-12 20:10:04 +00:00
Peter Boyle
95f3d69cf9
Extra hardware test hook
2024-03-12 20:09:37 +00:00
89c0519f83
Repro test
2024-03-12 16:11:33 +00:00
2704b82084
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2024-03-12 15:16:24 +00:00
cf8632bbac
Britney test option
2024-03-12 15:15:35 +00:00
d224297972
PBS scripts
2024-03-12 15:15:16 +00:00
Peter Boyle
a4d11a630f
Merge pull request #458 from paboyle/fix/HOST_NAME_MAX
...
fallback to _POSIX_HOST_NAME_MAX if HOST_NAME_MAX is not defined
2024-03-07 07:50:25 -05:00
2b4399f8b1
more HOST_NAME_MAX fix
2024-03-07 15:26:01 +09:00
f17b8de907
fallback to _POSIX_HOST_NAME_MAX if HOST_NAME_MAX is not defined
2024-03-07 15:22:08 +09:00
Peter Boyle
7e5bd46dd3
Booster update
2024-03-06 19:03:45 +01:00
Peter Boyle
228bbb9d81
Benchmark results
2024-03-06 19:03:35 +01:00
b812a7b4c6
Staggered launch script
2024-03-06 01:32:40 +00:00
891a366f73
Repro CG script
2024-03-06 01:22:55 +00:00
10116b3be8
Force device copyable and tell SYCL to shut it.
2024-03-06 01:13:27 +00:00
a46a0f0882
force device copyable and don't take crap from SYCL
2024-03-06 01:12:49 +00:00
a26a8a38f4
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2024-03-06 00:05:00 +00:00
7435315d50
More blasted shell variables
2024-03-06 00:03:59 +00:00
9b5f741e85
Reproducing CG can be more useful now
2024-03-06 00:03:16 +00:00
517822fdd2
SPR HBM benchmarking right and also PVC batched GEMM
2024-03-06 00:02:27 +00:00
1b93a9be88
Print out the hostname
2024-03-06 00:01:58 +00:00
783a66b348
Deterministic reduction please
2024-03-06 00:01:37 +00:00
976c3e9b59
Hack for flight logging CG inner products.
...
Can be made to work, but could put in some more serious infrastructure
for repro testing and blame attribution (Britney test) if necessary
2024-03-05 23:59:57 +00:00
f8ca971dae
Use of a bare PRECISION macro is not namespace safe and collides with
...
SYCL
2024-03-05 23:59:13 +00:00
21bc8c24df
OneMKL batched blas starting
2024-03-05 23:58:20 +00:00
30228214f7
SYCL conflict with Eigen
2024-03-05 23:56:10 +00:00
Peter Boyle
2ae980ae43
Update sourceme.sh
2024-03-05 13:39:18 -05:00
Peter Boyle
6153dec2e4
Update setup.sh
2024-03-05 13:38:32 -05:00
Peter Boyle
c805f86343
USQCD benchmark
2024-03-01 00:05:04 -05:00
Peter Boyle
04ca065281
Only one rank opens
2024-02-29 20:09:11 -05:00
Peter Boyle
88d8fa43d7
Benchmark development
2024-02-29 20:01:44 -05:00
Peter Boyle
3c49762875
Propagate in the blas routine
2024-02-29 15:33:06 -05:00
Peter Boyle
436bf1d9d3
Merge pull request #455 from clarkedavida/hisq_fat_links
...
Hisq fat links
2024-02-29 15:29:39 -05:00
david clarke
f70df6e195
changed NO_SHIFT and BACKWARD_CONST from define to enum
2024-02-29 12:29:30 -07:00
Peter Boyle
fce3852dff
Merge pull request #451 from paboyle/feature/eigen-3.4.0-update
...
updating Eigen to 3.4.0
2024-02-28 18:03:37 -05:00
Peter Boyle
ee1b8bbdbd
Merge pull request #454 from edbennett/adjoint-broke
...
fix HMC for non-fundamental representations
2024-02-28 14:05:27 -05:00
Peter Boyle
3f1636637d
Merge pull request #453 from dbollweg/feature/sliceSum_gpu
...
Feature/slice sum gpu
2024-02-28 14:04:43 -05:00
Peter Boyle
2e570f5300
Merge pull request #457 from lehner/feature/gpt
...
Import GPT-related updates
2024-02-28 13:59:04 -05:00
Christoph Lehner
9f89486df5
remove unnecessary code path
2024-02-28 19:56:23 +01:00
Christoph Lehner
22b43b86cb
Make GPT test suite work with SYCL
2024-02-28 12:57:17 +01:00
dbollweg
3c9012676a
CUDA cub refuses to reduce vSpinColourMatrix, breaking up into smaller parts like already done for HIP case.
2024-02-27 12:41:45 -05:00
Dennis Bollweg
b507fe209c
Added SpinColourMatrix case to sliceSum Test
2024-02-27 11:28:32 -05:00
Dennis Bollweg
6cd2d8fcd5
Replace cuda/hip memcpy with Grid functions
2024-02-26 09:55:07 -05:00
david clarke
b02d022993
fixed race condition (thx michael)
2024-02-23 17:14:28 -07:00
david clarke
94581e3c7a
accelerator_for is broken
2024-02-23 15:58:33 -07:00
david clarke
88b52cc045
Merge branch 'develop' into hisq_fat_links
2024-02-23 14:47:15 -07:00
dbollweg
0a816b5509
Merge branch 'feature/sliceSum_gpu' of https://github.com/dbollweg/Grid into feature/sliceSum_gpu
2024-02-22 21:43:06 -05:00
dbollweg
1c8b807c2e
free malloc'd memory
2024-02-22 21:42:44 -05:00
Christoph Lehner
66391f84f2
Merge branch 'feature/gpt' of ../Grid into develop
2024-02-21 19:05:00 +01:00
97f7a9ecb3
fix HMC for non-fundamental representations
2024-02-21 08:27:55 +00:00
Dennis Bollweg
15878f7613
sliceSumReduction_cub_large now also faster than CPU on Frontier
2024-02-16 13:55:21 -05:00
dbollweg
e0d5e3c6c7
Merge branch 'paboyle:develop' into feature/sliceSum_gpu
2024-02-16 13:16:37 -05:00
dbollweg
6f3455900e
Adding sliceSumReduction_cub_small/large since hipcub cannot deal with arb. large vobjs
2024-02-16 13:15:02 -05:00
david clarke
56827d6ad6
accelerator_inline bug
2024-02-14 13:56:57 -07:00
73c0b29535
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2024-02-13 20:19:32 +00:00
303b83cdb8
Scaling benchmarks, verbosity and MPICH aware in acceleratorInit()
...
For some reason Dirichlet benchmark fails on several nodes; need to
debug this.
2024-02-13 19:48:03 +00:00
5ef4da3f29
Silence verbose
2024-02-13 19:47:36 +00:00
1502860004
Benchmark scripts
2024-02-13 19:47:02 +00:00
585efc6f3f
More benchmark scripts
2024-02-13 19:40:49 +00:00
62055e04dd
missing semicolon generates error with some compilers
2024-02-13 18:18:27 +01:00
e4a641b64e
removing old Eigen tensor patch
2024-02-13 10:37:14 +01:00
8849f187f1
updating Eigen to 3.4.0
2024-02-13 10:30:22 +01:00
david clarke
db420525b3
fix Simd::Nsimd typo
2024-02-12 15:03:53 -07:00
dbollweg
b5659d106e
more test cases
2024-02-09 13:37:14 -05:00
dbollweg
4b43307402
Undo include path changes for level zero api header
2024-02-09 13:07:56 -05:00
dbollweg
09af8c25a2
Merge branch 'paboyle:develop' into feature/sliceSum_gpu
2024-02-09 13:02:59 -05:00
dbollweg
9514035b87
refactor slicesum: slicesum uses GPU version by default now
2024-02-09 13:02:28 -05:00
david clarke
2da09ae99b
acceleration compiles and doesn't break scalar mode
2024-02-06 18:40:13 -07:00
david clarke
a38fb0e04a
first effort toward accelerators
2024-02-06 18:24:55 -07:00
7019916294
RNG seed change safer for large volumes; this is a long term solution
2024-02-07 00:56:39 +00:00
dbollweg
1514b4f137
slicesum_sycl passes test
2024-02-06 19:08:44 -05:00
91cf5ee312
Updated bench script
2024-02-06 23:45:10 +00:00
david clarke
0a6e2f42c5
small amount of cleanup
2024-02-06 16:32:07 -07:00
dbollweg
ab2de131bd
work towards sliceSum for sycl backend
2024-02-06 13:24:45 -05:00
5bfa88be85
Aurora MPI standalone benchmake and options that work well
2024-02-06 16:28:40 +00:00
Dennis Bollweg
5af8da76d7
Fix cuda compilation of Lattice_slicesum_gpu.h
2024-02-01 18:02:30 -05:00
Dennis Bollweg
b8b9dc952d
Async memcpy's and cleanup
2024-02-01 17:55:35 -05:00
Dennis Bollweg
79a6ed32d8
Use accelerator_for2d and DeviceSegmentedRecude to avoid kernel launch latencies
2024-02-01 16:41:03 -05:00
dbollweg
caa5f97723
Add sliceSum gpu using cub/hipcub
2024-01-31 16:50:06 -05:00
david clarke
4924b3209e
projectU3 yields a unitary matrix
2024-01-23 14:43:58 -07:00
david clarke
00f24f8765
already found some bugs in projection, still needs testing
2024-01-22 05:50:16 -07:00
david clarke
f5b3d582b0
first attempt at U3 projection
2024-01-22 02:49:40 -07:00
david clarke
981c93d67a
update Test_fatLinks to accept Naik
2024-01-21 21:09:19 -07:00
david clarke
c020b78e02
Merge branch 'develop' into hisq_fat_links
2024-01-21 20:21:08 -07:00
2a0d75bac2
Aurora files
2023-12-21 23:20:17 +00:00
Peter Boyle
f48298ad4e
Bug fix
2023-12-11 20:57:02 -05:00
root
645e47c1ba
Config for Ampere Altra ARM
2023-12-08 16:17:56 -05:00
Peter Boyle
d1d9827263
Integrator logging update
2023-12-08 12:14:00 -05:00
Peter Boyle
14643c0aab
SDCC benchmarking scripts for A100 nodes and IceLake nodes (AVX512)
2023-12-04 15:45:57 -05:00
Peter Boyle
b77a9b8947
SDDC compiles starting
2023-11-30 14:31:51 -05:00
Peter Boyle
7d077fe493
Frontier compiel
2023-11-09 13:58:44 -05:00
david clarke
9cd4128833
fix naik bug
2023-11-03 14:11:38 -06:00
david clarke
c8b17c9526
Naik to CShift
2023-11-02 12:43:22 -06:00
david clarke
2ae2a81e85
attempt to fix Naik
2023-10-31 13:54:55 -06:00
david clarke
69c869d345
fixed stupid typo
2023-10-30 17:41:52 -06:00
david clarke
df9b958c40
naik now returns separately
2023-10-30 17:40:53 -06:00
david clarke
3d3376d1a3
LePage works, trying Naik
2023-10-27 16:26:31 -06:00
Christoph Lehner
f2648e94b9
getHostPointer added to Lattice
2023-10-23 13:47:41 +02:00
david clarke
21ed6ac0f4
added floating-point support
2023-10-20 13:54:26 -06:00
david clarke
7bb8ab7000
improve smearing templating
2023-10-20 08:41:02 -06:00
david clarke
2c824c2641
Merge branch 'develop' into hisq_fat_links
2023-10-17 16:03:59 -06:00
david clarke
391fd9cc6a
try lepage term
2023-10-17 14:57:15 -06:00
Peter Boyle
51051df62c
3GeV run setup
2023-10-16 20:49:52 +03:00
Peter Boyle
33097681b9
FTHMC compiled and merged to develop
2023-10-14 00:42:55 +03:00
Peter Boyle
07e4900218
FTHMC commit
2023-10-13 18:21:57 +03:00
Peter Boyle
36ab567d67
FTHMC 3 Gev
2023-10-13 18:21:57 +03:00
Peter Boyle
e19171523b
FTHMC Status at lattice conference commit
2023-10-13 18:21:56 +03:00
Peter Boyle
9626a2c7c0
Asynch handling
2023-10-13 18:21:56 +03:00
Peter Boyle
e936f5b80b
IfGridTensor shorthand
2023-10-13 18:21:56 +03:00
Peter Boyle
ffc0639cb9
Running in HMC tests
2023-10-13 18:21:56 +03:00
Peter Boyle
c5b43b322c
traceProduct eliminates non-contributing intermediate terms
2023-10-13 18:21:56 +03:00
Peter Boyle
c9c4576237
Improved frontier cshift
2023-10-13 18:21:56 +03:00
david clarke
bf4369f72d
clean up HISQSmear with decltypes
2023-10-12 12:41:06 -06:00
david clarke
36600899e2
working 7-link; Grid_log; generalShift
2023-10-12 11:11:39 -06:00
david clarke
b9c70d156b
Merge branch 'develop' into hisq_fat_links
2023-10-10 22:44:17 -06:00
david clarke
eb89579fe7
Merge remote-tracking branch 'origin/develop' into develop
2023-10-10 22:43:51 -06:00
david clarke
0cfd13d18b
7-link working
2023-10-10 22:41:52 -06:00
Christoph Lehner
e6ed516052
merged
2023-10-08 09:00:37 +02:00
Christoph Lehner
e2a3dae1f2
Option for multiple simultaneous CartesianStencils
2023-10-08 08:58:44 +02:00
Peter Boyle
6d0c2de399
Deprecate teh PVC directory and make a PVC-OEM generic PVC target with
...
no queueing system dependency -- just interactive scripts
2023-10-03 17:04:20 +00:00
Peter Boyle
7786ea9921
Bug fix in script
2023-10-03 09:58:44 -07:00
Peter Boyle
d93eac7b1c
Performance regressed and is OK in icpx 2023.2
2023-10-03 15:53:14 +00:00
Peter Boyle
afc316f501
Rename headers
2023-10-02 16:25:11 -04:00
Peter Boyle
f14bfd5c1b
Relocate sub includes
2023-10-02 16:23:38 -04:00
Peter Boyle
c5f1420dea
Merge remote-tracking branch 'LupoA/develop' into LupoA-develop
2023-10-02 16:22:35 -04:00
Peter Boyle
018e6da872
Merge pull request #440 from giltirn/feature/paddedcellgauge
...
Feature/paddedcellgauge
2023-10-02 10:00:42 -04:00
Peter Boyle
b77bccfac2
Merge pull request #444 from mmphys/feature/docX
...
Update doc complete list of Macports needed to build Grid on a fresh Mac
2023-10-02 09:57:11 -04:00
Peter Boyle
80359e0d49
Bland SYCL compile
2023-09-26 13:20:27 -07:00
Peter Boyle
3d437c5cc4
Making SYCL happy
2023-09-26 13:19:42 -07:00
david clarke
63d9b8e8a3
Merge remote-tracking branch 'origin/develop' into hisq_fat_links
2023-09-16 23:20:31 -06:00
david clarke
d247031c98
try 7-link
2023-09-16 23:18:16 -06:00
Peter Boyle
b8a7004365
Partial fraction test
2023-08-14 15:17:03 -04:00
david clarke
affff3865f
Merge branch 'develop' into hisq_fat_links
2023-08-11 23:08:04 -06:00
david clarke
9c22655b5a
Merge remote-tracking branch 'origin/develop' into develop
2023-08-11 23:06:42 -06:00
david clarke
99d879ea7f
5-link first attempt
2023-08-11 22:56:30 -06:00
Michael Marshall
bd56c95a6f
Update documentation with complete list of Macports needed to build Grid on a fresh Mac
2023-07-14 13:50:06 +01:00
Peter Boyle
994512048e
Merge pull request #439 from felixerben/bugfix/IRL_convergence
...
Bugfix/irl convergence
2023-07-12 16:32:26 -04:00
chillenzer
dbd8bb49dc
Merge pull request #32 from LupoA/sp2n/develop
...
Sp2n/develop
2023-07-04 15:23:43 +00:00
Julian Lenz
3a29af0ce4
Fixed linker error
2023-07-04 16:08:44 +01:00
Julian Lenz
f7b79cdd45
Added test for ProjectSpn
2023-07-03 18:00:32 +01:00
Alessandro Lupo
075b9d22d0
adjoint rep implemented as 2indx symmetric
2023-07-02 13:58:31 +01:00
Alessandro Lupo
b92428f05f
better test
2023-07-02 13:34:03 +01:00
Alessandro Lupo
34b11864b6
prettiest tests
2023-07-02 13:25:57 +01:00
Christopher Kelly
1dfaa08afb
The stencils for the staple and rect-staple padded cell implementations are now created and stored by workspace classes that allow for reuse providing the grids remain consistent
...
The workspaces are now used by the plaq+rectangle gauge action resulting in a further 2x performance improvement as measured on a 16^4 local volume for 2 nodes (16 ranks) of Crusher
2023-06-28 15:11:24 -04:00
david clarke
9d263d9a7d
fix bug in HISQSmearing; move benchmark b/c i don't understand how makefiles work
2023-06-28 10:05:34 -06:00
david clarke
9015c229dc
add benchmark to see whether matrix multiplication is slower than read from object
2023-06-27 21:28:26 -06:00
Christopher Kelly
f44dce390f
Implemented acclerator-optimized versions of localCopyRegion and insertSliceLocal to speed up padding
...
Fixed const correctness on PaddedCell methods
Fixed compile issues on Crusher
Added timing breakdowns for PaddedCell::Expand and the padded implementations of the staples, visible under --log Performance
Optimized kernel for StaplePadded
Test_iwasaki_action_newstaple now repeats the calculation 10 times and reports average timings
2023-06-27 14:58:10 -04:00
Christopher Kelly
bb71e9a96a
Added PaddedCell and GeneralisedLocalStencil header includes to standard base headers
...
Moved versions of the padded-cell implementations of staple and rect-staple from test code to WilsonLoops header
Added StapleAndRectStapleAll which is now called by the plaq+rectangle action class. Under the hood it uses the padded cell implementations with maximal reuse of the padded gauge links
2023-06-27 11:23:30 -04:00
78bae9417c
returning Nstop vectors even if not all meet true convergence criterion
2023-06-27 14:38:19 +01:00
dd170ead01
whitespace
2023-06-27 11:37:01 +01:00
014704856f
do one more iteration if not all vectors converged
2023-06-27 11:33:30 +01:00
david clarke
a7eabaad56
rudimentary appendShift convenience method, which allows the user to append an arbitrary shift in one line
2023-06-26 23:59:28 -06:00
david clarke
eeb4703b84
develop wrappers to make the stencils easier to construct
2023-06-26 17:45:35 -06:00
david clarke
a07421b3d3
Merge branch 'develop' into hisq_fat_links
2023-06-26 13:51:32 -06:00
david clarke
cda53b4068
Merge remote-tracking branch 'origin/develop' into develop
2023-06-26 13:51:06 -06:00
Christopher Kelly
6f6844ccf1
Added new StapleAll and RectStapleAll functions that return the staples for all mu as an array
...
Modified plaq+rectangle gauge actions to use the above
Added a test code to confirm the above changes
2023-06-26 15:48:47 -04:00
Christopher Kelly
4c6613d72c
Modified RectStapleDouble and RectStapleOptimised to use Gauge-BC respecting CshiftLink
...
Added test code tests/debug/Test_optimized_staple_gaugebc demonstrating equivalence of above to RectStapleUnoptimised for cconj gauge BCs
Removed optimized staple only being used for periodic gauge BCs; it is now always used
2023-06-26 10:20:23 -04:00
Peter Boyle
ee92e08edb
Merge pull request #435 from fjosw/fix/warnings_in_WilsonKernelsImplementation
...
Unused variable in WilsonKernelsImplementation
2023-06-23 11:47:19 -04:00
Peter Boyle
c1dcee9328
Merge pull request #437 from fjosw/fix/stencil_debug
...
Added GridLogDebug to BuildSurfaceList debug message
2023-06-23 11:47:00 -04:00
Alessandro Lupo
559257bbe9
better documentation and filelist names
2023-06-23 16:16:48 +01:00
Peter Boyle
6b150961fe
Better script
2023-06-23 18:09:25 +03:00
Alessandro Lupo
cff1f8d3b8
rm unused variables and formatting
2023-06-23 16:04:18 +01:00
Alessandro Lupo
f27d2083cd
adjustments in SUn and Sp2n impl
2023-06-23 15:34:08 +01:00
Christopher Kelly
36cc9c524f
Threaded the constructor of GeneralLocalStencil
2023-06-23 09:57:38 -04:00
Alessandro Lupo
2822487450
rm unncessary line
2023-06-23 14:55:23 +01:00
Alessandro Lupo
e07fafe46a
minor adjustments to twoindex
2023-06-23 12:18:04 +01:00
Alessandro Lupo
063d290bd8
missing function
2023-06-23 11:11:20 +01:00
Alessandro Lupo
4e6194d92a
Avoid code duplication in ProjectSUn
2023-06-23 11:03:50 +01:00
Alessandro Lupo
de30c4e22a
minor improvements
2023-06-23 10:49:41 +01:00
david clarke
df99f227c1
include missing staple orientations; invert path direction, which was backwards
2023-06-22 14:57:10 -06:00
Peter Boyle
5bafcaedfa
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2023-06-22 19:59:45 +03:00
Peter Boyle
bfeceae708
FTHMC
2023-06-22 12:58:18 -04:00
Peter Boyle
eacb66591f
Config command
2023-06-22 19:56:40 +03:00
Peter Boyle
fadaa85626
Update
2023-06-22 19:56:27 +03:00
Peter Boyle
02a5b0d786
Updating run during testing
2023-06-22 19:52:46 +03:00
Peter Boyle
0e2141442a
Dennis says broken
2023-06-22 19:19:51 +03:00
Peter Boyle
769eb0eecb
Precision coverage
2023-06-22 19:19:20 +03:00
Christopher Kelly
4241c7d4a3
Imported coalescedReadGeneralPermute GPU implementation from Christoph
...
Fixed bug in padded staple code where extract was being called on the result before the GPU view was closed
Fixed compile issue with pointer cast in padded staple code
Added timing summaries of padded staple code and timing breakdown of staple implementation to Test_padded_cell_staple
2023-06-21 16:01:01 -04:00
david clarke
d536c67b9d
add HISQSmearing to Smearing.h
2023-06-20 16:04:48 -06:00
david clarke
f44f005dad
rename _lvl1 --> _linkTreatment
2023-06-20 15:48:27 -06:00
david clarke
26b2caf570
add template parameter to Smear_HISQ_fat for MILC interfacing
2023-06-20 15:37:54 -06:00
Christopher Kelly
7b11075102
The user can now specify the implementation of Cshift used by the PaddedCell class through a virtual base class API. Implementations for default (regular Cshift) and for gauge links (which respects the gauge BCs)
...
Fixed const-correctness for PaddedCell and ConjugateGimpl::setDirections
Modified test code for padded-cell implementation of staple, rect-staple to use cconj BCs
2023-06-20 17:09:56 -04:00
Christopher Kelly
abc658dca5
Added coalescedReadGeneralPermute CPU implementation based on Christoph's GPT code
...
In a test code, implemented a padded-cell version of the staple and rectangular-staple calculation
2023-06-20 16:14:25 -04:00
david clarke
8bb078db25
Merge branch 'develop' into hisq_fat_links
2023-06-20 13:05:00 -06:00
david clarke
b61ba40023
Merge remote-tracking branch 'origin/develop' into develop
2023-06-20 13:04:53 -06:00
Christoph Lehner
452bf2e907
Accelerator basisRotate also on HIP
2023-06-20 20:36:24 +03:00
Alessandro Lupo
2372275b2c
Merge pull request #36 from LupoA/sp2n/gpu-bugfix
...
Sp2n/gpu bugfix [close #30 ]
2023-06-20 13:46:00 +01:00
chillenzer
ef736e8aa4
Merge pull request #35 from LupoA/sp2n/enableSp
...
consistent enable sp config flag
2023-06-20 10:41:09 +00:00
Julian Lenz
5e539e2d54
Forgot some follow-ups on changed signature
2023-06-18 12:37:51 +01:00
Julian Lenz
96773f5254
Apparently forgot to remove one Lattice version
2023-06-18 12:21:39 +01:00
Alessandro Lupo
d80df09f3b
consistent enable sp config flag
2023-06-16 19:16:46 +01:00
Julian Lenz
621e612c30
Fix non-zero ret on device bug
2023-06-16 16:27:49 +01:00
Julian Lenz
8c3792721b
ClangFormat
2023-06-16 15:58:23 +01:00
Julian Lenz
c95bbd3948
Remove accelerated lattice version
2023-06-16 15:50:26 +01:00
Julian Lenz
e28ab7a732
Re-included instantiations for symmetric 2Index AS Sp
2023-06-16 14:20:37 +01:00
Alessandro Lupo
c797cbe737
deal with post-merge trauma
2023-06-16 14:20:37 +01:00
Alessandro Lupo
e09dfbf1c2
definetely the right merge upstream/develop
2023-06-16 14:19:46 +01:00
85e35c4da1
fix: added GridLogDebug to BuildSurfaceList debug message.
2023-06-16 10:31:16 +01:00
Peter Boyle
d72e914cf0
Profiling temporary code until optimised
2023-06-15 10:43:04 -04:00
Peter Boyle
3b5254e2d5
Optional checkpoint smeared configs for FTHMC
2023-06-15 10:43:04 -04:00
Peter Boyle
f1c358b596
Additional tests
2023-06-15 10:43:04 -04:00
Peter Boyle
c0ef210265
Hot start should be properly Hot
2023-06-15 10:43:04 -04:00
Peter Boyle
e3e1cc1962
Ta project
2023-06-15 10:43:04 -04:00
Peter Boyle
723eadbb5c
Keep methods virtual
2023-06-15 10:43:04 -04:00
Peter Boyle
e24637ec1e
Clean up
2023-06-15 10:43:04 -04:00
Peter Boyle
8b01ff4ce7
Integrator over to smeared force structure
2023-06-15 10:43:04 -04:00
Peter Boyle
588197c487
Smeared action virtual class
2023-06-15 10:43:04 -04:00
Julian Lenz
116d90b0ee
First attempt on #30
2023-06-15 15:09:37 +01:00
Julian Lenz
b0646ca187
Remove some unused variables
2023-06-15 15:09:09 +01:00
Peter Boyle
1352bad2e4
Sunspot compile
2023-06-15 11:22:46 +00:00
david clarke
14d352ea4f
added smearParams struct
2023-06-12 16:55:44 -06:00
david clarke
1cf9ec1cce
now compiles
2023-06-09 16:27:45 -06:00
chillenzer
4895ff260e
Merge pull request #28 from LupoA/sp2n/config
...
compile sp2n fermion impl only if declared at config time
2023-06-09 13:07:48 +00:00
david clarke
4b994a1bc7
trouble with compilation
2023-06-08 17:37:25 -06:00
david clarke
e506d6d369
Merge branch 'develop' into hisq_fat_links
2023-06-07 21:16:20 -06:00
david clarke
ab56ad8d7a
fix 3-link stencil
2023-06-07 21:14:58 -06:00
Alessandro Lupo
470d93006a
compile sp2n fermion impl only if declared at config time
2023-06-07 12:53:33 +01:00
chillenzer
2f3d03f188
Merge pull request #27 from LupoA/sp2n/documentation
...
documentation for gaugegroup and sp2n
2023-06-01 16:42:27 +00:00
Alessandro Lupo
8db7c23bee
improve documentation
2023-06-01 17:39:10 +01:00
chillenzer
69dc5172dc
Merge pull request #26 from LupoA/sp2n/irreps
...
Sp2n/irreps
2023-06-01 16:28:15 +00:00
Julian Lenz
fd72eb6546
Merge branch 'sp2n/algorithm' into sp2n/irreps
2023-06-01 17:24:01 +01:00
Peter Boyle
ffd7301649
Updated masked / fthmc smeared config container
2023-06-01 06:23:02 -04:00
Peter Boyle
d2a8494044
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2023-06-01 06:22:33 -04:00
Peter Boyle
0982e0d19b
Jacobian action wrapper for FTHMC
2023-06-01 06:15:08 -04:00
Peter Boyle
3badbfc3c1
Refactor the Action and Smeared gauge configuration containers. Add first pass at FTHMC action
2023-06-01 06:14:28 -04:00
Peter Boyle
5465961e30
New test for FTHMC portion
2023-06-01 06:14:04 -04:00
477b794bc5
fix: unused variable removed.
2023-05-29 14:08:53 +01:00
Christoph Lehner
e8c29e2fe5
Merge pull request #31 from paboyle/develop
...
Sync
2023-05-28 16:13:12 +02:00
Peter Boyle
4835fd1a87
HIP stream synch
2023-05-27 17:58:22 +03:00
Peter Boyle
6533c25814
Lumi
2023-05-27 16:13:32 +03:00
Alessandro Lupo
b405767569
make private methods private
2023-05-26 17:02:16 +01:00
Alessandro Lupo
fe88a0c12f
cleaner twoindex class, cleaner tests
2023-05-26 16:55:30 +01:00
Alessandro Lupo
e61a9ed2b4
partial revert
2023-05-26 13:54:26 +01:00
Alessandro Lupo
de8daa3824
group is SUn by default
2023-05-26 13:44:41 +01:00
Alessandro Lupo
3a50fb29cb
directly call sp helper
2023-05-26 13:28:47 +01:00
Alessandro Lupo
6647d2656f
rm unnecessary specialisation
2023-05-26 12:27:22 +01:00
Alessandro Lupo
a6f4dbeb6d
remove redundant template parameter
2023-05-26 12:13:40 +01:00
Alessandro Lupo
92a282f2d8
Merge pull request #24 from LupoA/sp2n/fix_static_assert_symmetric
...
Move static_assert inside of function
2023-05-26 11:13:50 +01:00
Alessandro Lupo
ca2fd9fc7b
documentation for gaugegroup and sp2n
2023-05-25 18:40:54 +01:00
david clarke
3825329f8e
Merge branch 'develop' into hisq_fat_links
2023-05-24 15:37:25 -06:00
Alessandro Lupo
be1a4f5860
implement TwoIndexSymm for sp2n
2023-05-22 17:21:03 +01:00
Peter Boyle
1b2914ec09
FT-HMC smearing, derivative chain rule, log det and force first pass.
2023-05-22 10:21:37 -04:00
Peter Boyle
519f795066
Header not liked by gcc on mac? puzzling
2023-05-22 10:21:12 -04:00
Alessandro Lupo
5897b93dd4
debug tests, fix dimension
2023-05-22 13:42:21 +01:00
Alessandro Lupo
af091e0881
DimensionHelper for 2index irreps
2023-05-21 16:56:06 +01:00
Alessandro Lupo
3c1e5e9517
Merge pull request #25 from LupoA/sp2n/unify_representations
...
Sp2n/unify representations [close #3 ]
2023-05-21 14:55:27 +01:00
Alessandro Lupo
85b2cb7a8a
changing some hardcoded SUn lines
2023-05-21 14:50:28 +01:00
david clarke
c7bdf2c0e4
3-link test at least gives an answer
2023-05-21 04:33:20 -06:00
Peter Boyle
4240ad5ca8
Preparing for FTHMC
2023-05-19 21:21:55 -04:00
Peter Boyle
d418347d86
public for convenience to see rho params
2023-05-19 21:21:05 -04:00
Peter Boyle
29a4bfe5e5
Clean up
2023-05-19 21:20:45 -04:00
Peter Boyle
9955bf9daf
Regresses to Qlat
2023-05-19 17:32:13 -04:00
Christoph Lehner
da9cbfc7cc
Suppress BuildSurfaceList verbosity in Stencil.h
2023-05-19 20:22:20 +02:00
Christoph Lehner
6b9f07c1ed
Merge pull request #30 from paboyle/develop
...
Merge upstream
2023-05-19 20:20:58 +02:00
Julian Lenz
b8bdc2eefb
Unified two index representations
2023-05-18 18:36:29 +01:00
Julian Lenz
0078826ff1
Move static_assert inside of function
2023-05-18 18:14:53 +01:00
Julian Lenz
e855c41772
Unified spfundamental.h with fundamental.h
2023-05-18 18:11:20 +01:00
chillenzer
d169c275b6
Merge pull request #22 from LupoA/sp2n/unify_twoindex
...
Unify TwoIndex
2023-05-18 14:55:02 +00:00
Julian Lenz
a5125e23f4
Typo
2023-05-18 15:41:35 +01:00
Julian Lenz
7b83c80757
Merge branch 'sp2n/unify_twoindex' of github.com:LupoA/Grid into sp2n/unify_twoindex
2023-05-18 15:36:14 +01:00
Julian Lenz
e41821e206
Disable two index symmetric
2023-05-18 15:29:55 +01:00
david clarke
bf91778550
verbose plaquette example; fat link test frame
2023-05-17 15:15:54 -06:00
Alessandro Lupo
5a75ab15a2
typo in 2S dim
2023-05-17 20:47:57 +01:00
Alessandro Lupo
932c783fbf
2AS for every Nc!
2023-05-17 20:22:05 +01:00
Julian Lenz
55f9cce577
Revert "Added automated HMC test for Nc=4"
...
This reverts commit eee27b8b30 .
2023-05-17 09:17:48 +01:00
Alessandro Lupo
b3533ca847
correct tests (failing)
2023-05-16 17:43:52 +01:00
Alessandro Lupo
fd2a637010
test 2index
2023-05-16 14:10:39 +01:00
Julian Lenz
eee27b8b30
Added automated HMC test for Nc=4
2023-05-15 18:37:33 +01:00
Julian Lenz
8522352aa3
ClangFormat
2023-05-15 18:36:05 +01:00
Alessandro Lupo
3beb8f4091
fixing typo, getting pre-changes physics
2023-05-15 16:00:15 +01:00
Alessandro Lupo
12a706e9b1
de-hardcode the number of generators
2023-05-15 15:48:21 +01:00
Alessandro Lupo
170aa7df01
fix (dimension to be improved)
2023-05-15 15:20:18 +01:00
Julian Lenz
e8ad1fef53
Unify TwoIndex
2023-05-12 14:35:50 +01:00
Peter Boyle
876c8f4478
Nodes on padded cell
2023-05-11 12:35:49 -04:00
Peter Boyle
9c8750f261
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2023-05-11 12:29:09 -04:00
Peter Boyle
91efd08179
Option for Qlat generator basis
2023-05-11 12:27:45 -04:00
Peter Boyle
9953511b65
Mac compile
2023-05-11 12:27:29 -04:00
Peter Boyle
025fa9991a
For FTHMC
2023-05-11 12:26:14 -04:00
Peter Boyle
e8c60c355b
Padded cell code
2023-05-11 12:25:50 -04:00
Peter Boyle
6c9c7f9d85
Permute fix
2023-05-11 12:24:21 -04:00
Peter Boyle
f534523ede
Debug
2023-05-11 12:23:11 -04:00
Peter Boyle
1b8a834beb
Debug
2023-05-11 12:22:24 -04:00
Alessandro Lupo
aa9df63a05
rename group projections based on determinants
2023-05-10 14:50:52 +01:00
chillenzer
3953312a93
Merge pull request #20 from LupoA/sp2n/unify_gaugeimpltypes
...
Sp2n/unify gaugeimpltypes
2023-05-03 15:17:10 +00:00
Julian Lenz
6e62f4f616
ClangFormat
2023-05-03 16:15:12 +01:00
Julian Lenz
6a7bdca53b
Take over additional algebra tests from Alessandro
2023-05-03 16:02:02 +01:00
Julian Lenz
c7fba9aace
Take over additional group tests from Alessandro
2023-05-03 16:01:48 +01:00
Julian Lenz
ac6c7cb8d6
Merge in Alessandro's changes [test fails]
2023-05-03 02:53:03 +01:00
Julian Lenz
c5924833a1
ClangFormat
2023-05-03 02:39:36 +01:00
Julian Lenz
ac0a74be0d
Taken care of algebra tests
2023-05-03 02:32:42 +01:00
Julian Lenz
42b0e1125d
Naming and argument types
2023-05-03 01:51:46 +01:00
Julian Lenz
339c4fda79
Extracted is_element_of Sp2n
2023-05-02 15:44:34 +01:00
Alessandro Lupo
9b85bf9402
better projection test
2023-05-02 15:42:20 +01:00
Alessandro Lupo
86b02c3cd8
cleaning up requested by Julian
2023-05-02 13:31:17 +01:00
Alessandro Lupo
7b3b7093fa
cleaning up requested by Ed
2023-05-02 12:50:57 +01:00
Alessandro Lupo
881b08a465
Correct implementation of SpTa
2023-04-27 18:17:06 +01:00
Julian Lenz
3ee5444c69
Remove commented out stuff
2023-04-21 08:08:18 +01:00
Julian Lenz
5e28fe56d2
Remove code duplication: Iterating through vectors
2023-04-21 08:08:06 +01:00
Peter Boyle
3aa43e6065
Debug info
2023-04-20 14:21:13 -04:00
Peter Boyle
78ac4044ff
HMC
2023-04-20 13:28:07 -04:00
Peter Boyle
119c3db47f
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2023-04-18 15:13:16 -04:00
Peter Boyle
21bbdb8fc2
Crusher
2023-04-18 15:11:16 -04:00
Alessandro Lupo
5aabe074fe
Rename Sympl* to Sp*
2023-04-18 11:50:20 +01:00
Peter Boyle
739bd7572c
Example code
2023-04-17 21:51:55 +00:00
Peter Boyle
074627a5bd
Pass file descriptors through AF_UNIX for level_zero
2023-04-17 21:50:52 +00:00
Peter Boyle
6a23b2c599
Drop UVM
2023-04-17 21:49:58 +00:00
Alessandro Lupo
dace904c10
fix typo
2023-04-14 18:06:18 +01:00
Alessandro Lupo
be98d26610
small change I missed in previous commit
2023-04-13 17:48:43 +01:00
Peter Boyle
bd891fb3f5
tests to compile
2023-04-12 18:32:44 -04:00
Peter Boyle
3984265851
Merge pull request #432 from paboyle/hotfix/nvcc-warnings
...
Unused statements generating warnings removed
2023-04-12 16:59:02 -04:00
Peter Boyle
45361d188f
Merge pull request #427 from fjosw/feat/bug_report_issue_template
...
Feat/bug report issue template
2023-04-12 16:58:41 -04:00
Peter Boyle
80c9d77e02
Merge pull request #433 from paboyle/hotfix/virtual-dtor
...
Virtual destructor for LinearOperator
2023-04-12 16:56:18 -04:00
Peter Boyle
3aff64dddb
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2023-04-11 12:19:15 -07:00
Peter Boyle
b4f2ca81ff
Copy queue and compute queue same as better concurrency
2023-04-11 12:18:21 -07:00
Peter Boyle
d1dea5f840
New driver
2023-04-11 12:16:52 -07:00
Peter Boyle
54f8b84d16
Fence
2023-04-11 12:16:08 -07:00
Peter Boyle
da503fef0e
Name change on barrier routine
2023-04-11 12:14:04 -07:00
Peter Boyle
4a6802098a
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2023-04-07 15:43:28 -04:00
Peter Boyle
f9b41a84d2
Trajectory runs to completion on Crusher within wall clock time
2023-04-07 15:42:45 -04:00
5d7e0d18b9
virtual destructor for LinearOperator
2023-04-07 14:30:38 +01:00
9e64387933
mores unused statements removed
2023-04-07 14:27:18 +01:00
983b681d46
unused statement cleaning
2023-04-07 14:12:02 +01:00
4072408b6f
Update README.md
2023-04-07 11:45:28 +01:00
bd76b47fbf
Update CI badge in README
2023-04-07 11:44:48 +01:00
Christoph Lehner
5f75735dab
Add M and Mdag to WilsonTMFermion
2023-04-06 18:25:05 +02:00
Alessandro Lupo
178376f24b
minor stylistic changes
2023-04-06 12:08:17 +01:00
18ce23aa75
Fix NEON SIMD
2023-04-06 11:30:48 +01:00
chillenzer
6a0eb466ee
Merge pull request #19 from LupoA/refactoring_sp2n
...
refactoring sp2n
2023-04-05 10:50:58 +00:00
Peter Boyle
ffa7fe0cc2
Merge branch 'feature/dirichlet' into develop
2023-04-04 23:13:52 -04:00
Peter Boyle
6b979f0a69
Dirichlet improvements that I failed to commit
2023-04-04 23:13:17 -04:00
Alessandro Lupo
4ea29b8f0f
Template group into GaugeImplTypes. Closing #2
2023-04-04 17:49:28 +01:00
Alessandro Lupo
778291230a
expand ProjecOnGaugeGroup, change ProjectOnSp2nAlgebra into SpTa, fixing some of its issues
2023-04-04 17:48:13 +01:00
Peter Boyle
86dac5ff4f
Better printing
2023-04-04 07:42:19 -07:00
Peter Boyle
4a382fad3f
Use distinct SYCL queue for copies
2023-04-04 07:41:41 -07:00
Peter Boyle
cc753670d9
Barrier elimination, surface list build
2023-04-04 07:39:14 -07:00
Peter Boyle
cc9d88ea1c
Fence changes and EXT kernel loop cout reduction
2023-04-04 07:37:23 -07:00
Peter Boyle
b281b0166e
Put the barrier in the subroutine
2023-04-04 07:36:03 -07:00
Peter Boyle
6a21f694ff
Apply barrier in Gather kernel sequence.
...
Could place before comms, or in Gather, but decided to insist Gather means Gather is done
2023-04-04 07:33:24 -07:00
Peter Boyle
fc4db5e963
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2023-04-03 18:26:11 -04:00
Peter Boyle
6252ffaf76
No unified
2023-04-03 18:25:22 -04:00
Alessandro Lupo
026e736dfa
Projection on algebra can now be templated. Fix #12
2023-04-03 16:31:19 +01:00
Alessandro Lupo
4275b3f431
Fix typo and remove unnecessary lines
2023-04-03 12:01:52 +01:00
Peter Boyle
af64c1c6b6
Had managed to drop the accelerator_barrier() in the Wilson Compressor gather
2023-03-30 17:34:44 -04:00
Peter Boyle
866f48391a
Temporary fix for develop incorrect results
2023-03-30 17:10:13 -04:00
Peter Boyle
a4df527d74
Merge pull request #428 from mmphys/bugfix/comm_none
...
Fixes for --enable-comms=none
2023-03-30 08:38:14 -04:00
Michael Marshall
5764d21161
Fixes for --enable-comms=none
2023-03-30 10:15:28 +01:00
Peter Boyle
496d04cd85
Weaken the Fence
2023-03-29 18:58:51 -04:00
Peter Boyle
10e6d7c6ce
Merge branch 'feature/dirichlet' into develop
2023-03-29 16:26:47 -04:00
Peter Boyle
c42e25e5b8
Dirichlet remove
2023-03-29 16:25:52 -04:00
Peter Boyle
a00ae981e0
Fence propagation from SYCL
2023-03-29 15:00:40 -04:00
Peter Boyle
58e020b62a
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2023-03-29 14:37:40 -04:00
Peter Boyle
a7e1aceeca
Compile fix on Nvidia
2023-03-29 14:36:50 -04:00
Peter Boyle
7212432f43
More careful fencing
2023-03-28 20:10:22 -07:00
Peter Boyle
4a261fab30
Changes premerge to develop
2023-03-28 20:04:21 -07:00
Peter Boyle
6af97069b9
Preparing for close of feature/dirichlet
...
Initial code change review complete
2023-03-28 13:39:44 -07:00
Peter Boyle
5068413cdb
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2023-03-28 08:35:38 -07:00
Peter Boyle
71c6960eea
Commet
2023-03-28 08:34:24 -07:00
Peter Boyle
ddf6d5c9e3
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2023-03-28 11:33:05 -04:00
39214702f6
feat: indentation fixed.
2023-03-28 16:30:34 +02:00
3e4614c63a
feat: draft for bug-report issue template added.
2023-03-28 16:24:35 +02:00
Peter Boyle
900e01f49b
Temporary
2023-03-27 21:35:06 -07:00
Peter Boyle
2376156fbc
Merge branch 'develop' into feature/dirichlet
2023-03-27 21:33:50 -07:00
Peter Boyle
3f2fd49db4
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2023-03-27 17:29:54 -07:00
Peter Boyle
0efa107cb6
Script update
2023-03-27 17:29:43 -07:00
Peter Boyle
8feedb4f6f
Include files moved
2023-03-27 17:29:21 -07:00
Peter Boyle
05e562e3d7
Move the copy synch out to stencil and do one per call instead of one per packet
2023-03-27 17:28:38 -07:00
Peter Boyle
dd3bbb8fa2
MOve the synchronise out to the stencil so one call instead of one call per packet
2023-03-27 17:27:45 -07:00
Peter Boyle
2fbcf13c46
SYCL fix
2023-03-27 14:25:14 -07:00
Peter Boyle
4ea48ef0c4
Merge pull request #419 from lehner/feature/gpt
...
Separate rankSum from sum
2023-03-24 15:42:16 -04:00
Peter Boyle
5c85774ee3
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2023-03-24 15:40:57 -04:00
Peter Boyle
d8a9a745d8
stream synchronise
2023-03-24 15:40:30 -04:00
Peter Boyle
dcf172da3b
Merge pull request #415 from paboyle/feature/block_lanczos22
...
Feature/block lanczos22
2023-03-24 12:08:16 -04:00
Peter Boyle
d57ed25071
Merge branch 'feature/dirichlet' into feature/block_lanczos22
2023-03-24 12:08:09 -04:00
Peter Boyle
546be724e7
Merge pull request #421 from UniOfLeicester/feature/accel_Copy_plane
...
Populate the Cshift_table in the GPU
2023-03-24 12:04:06 -04:00
Peter Boyle
8a1b9073f9
Mshift update
2023-03-23 15:39:30 -04:00
Peter Boyle
1a7114d4b9
Temporary algorithm while sorting out mixed prec
2023-03-23 15:38:35 -04:00
Peter Boyle
3f385f717c
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
...
Conflicts:
systems/PVC/benchmarks/run-2tile-mpi.sh
systems/PVC/config-command
2023-03-23 14:52:53 -04:00
Peter Boyle
481bbaf1fc
Interface to query memory use
2023-03-23 12:55:31 -04:00
Peter Boyle
281488611a
WriteDiscard on construct
2023-03-23 10:28:50 -04:00
Peter Boyle
c180a52518
Merge branch 'feature/dirichlet' of https://www.github.com/paboyle/Grid into feature/dirichlet
2023-03-23 10:28:01 -04:00
Peter Boyle
90130e25e9
TODO list
2023-03-23 10:27:02 -04:00
Peter Boyle
23298acb81
Merge pull request #424 from giltirn/feature/dirichlet-precchange
...
Precision change implementation
2023-03-22 23:04:52 -04:00
Peter Boyle
52384e34cf
Discard on construct
2023-03-22 19:40:32 -04:00
Peter Boyle
d0bb033ea2
Device resident GPU block buffer instead of UVM as hit likely UVM
...
bug. Code worked on CUDA 11.4 but fails on later drivers (certainly 530.30.02, but need to
find the perlmutter driver version).
2023-03-22 19:07:32 -04:00
Peter Boyle
c6621806ca
Compiling on laptop and running
2023-03-21 17:27:09 -04:00
Peter Boyle
0b6f0f6d2f
Merge branch 'feature/dirichlet' of https://www.github.com/paboyle/Grid into feature/dirichlet
2023-03-21 16:06:55 -04:00
Peter Boyle
b5b759df73
Merge branch 'develop' into feature/dirichlet
2023-03-21 16:05:46 -04:00
Peter Boyle
7db8dd7a95
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2023-03-21 16:04:27 -04:00
Peter Boyle
8b43be39c0
Config command
2023-03-21 16:00:52 -04:00
Peter Boyle
f17f879206
Test update
2023-03-21 15:59:29 -04:00
Peter Boyle
68428fceab
Integrator update
2023-03-21 15:58:49 -04:00
Peter Boyle
4135f2dcd1
Compressor
2023-03-21 15:41:41 -04:00
Peter Boyle
c5bdf61215
AUdit fix
2023-03-21 15:38:39 -04:00
Peter Boyle
88e218e8ee
Stencil updates
2023-03-21 15:37:58 -04:00
Peter Boyle
0f2b786436
Vector -> vector
2023-03-21 15:36:11 -04:00
Peter Boyle
e1c326558a
COmms improvements
2023-03-21 08:53:56 -07:00
Peter Boyle
bae0f8ea99
Merge pull request #425 from rrhodgson/feature/CacheLogging
...
Huge Cache
2023-03-21 08:59:08 -04:00
Peter Boyle
bbbcd36ae5
Merge pull request #426 from rrhodgson/feature/LCDeflation
...
Batched Local Coherence Tools
2023-03-21 08:58:40 -04:00
Peter Boyle
39c0815d9e
WriteDiscard
2023-03-21 08:57:29 -04:00
Alessandro Lupo
1b8176e2c0
fix code duplication
2023-03-17 14:58:00 +00:00
Alessandro Lupo
cbc053c3db
Revert "projection on Sp2n algebra, to be used instead of Ta"
...
This reverts commit ba7f9d7b70 .
2023-03-17 11:36:58 +00:00
Alessandro Lupo
cdf3f6ef6e
Merge branch 'refactoring_sp2n' of https://github.com/LupoA/Grid into refactoring_sp2n
2023-03-15 15:59:50 +00:00
Alessandro Lupo
ba7f9d7b70
projection on Sp2n algebra, to be used instead of Ta
2023-03-15 15:55:12 +00:00
Peter Boyle
a997d24743
Remove nofma
2023-03-14 12:10:31 -07:00
Peter Boyle
861e5d7f4c
SYCL version update. Why do they keep making incompatible changes
2023-03-14 12:10:02 -07:00
Peter Boyle
14cc142a14
Warning remove
2023-03-14 12:09:26 -07:00
Peter Boyle
f36b87deb5
syscall fix
2023-03-14 12:09:00 -07:00
Peter Boyle
eeb6e0a6e3
Renable cache blocking and efficient UPI type SHM comms
2023-03-14 09:10:27 -07:00
Peter Boyle
cad5b187dd
Cleanup
2023-03-14 09:08:16 -07:00
Peter Boyle
87697eb07e
SHared compile
2023-03-14 09:07:36 -07:00
Alessandro Lupo
371fd123fb
consequence of iSUnMatrix being no longer a member of the SU class
2023-03-14 10:47:07 +00:00
Alessandro Lupo
d6ff644aab
Towards the day all tests compile
2023-03-14 10:43:25 +00:00
Julian Lenz
29586f6b5e
Deactivate some tests for Nc!=3
2023-03-13 08:17:14 +00:00
Alessandro Lupo
fd057c838f
add ProjectOnGaugeGroup and ProjectGn to allow future templating in GaugeImplTypes
2023-03-10 12:10:46 +00:00
Alessandro Lupo
f51222086c
Move functions from GaugeGroup to group specific implementations
2023-03-09 16:22:20 +00:00
a3e935c902
Batched block project/promote size checks
2023-02-27 11:38:16 +00:00
7731c7db8e
Add huge cache type and allow Ncache==0
2023-02-26 14:15:28 +00:00
ff97340324
Expose cached bytes
2023-02-26 12:22:45 +00:00
Christopher Kelly
83d86943db
Fixed compile bug in MemoryManagerShared caused by Audit function not being passed a string
2023-02-23 13:09:45 -05:00
Christopher Kelly
e82cf1d311
Further prec-change improvements
...
Mixed prec CG algorithm has been modified to precompute precision change workspaces
As the original Test_dwf_mixedcg_prec has been coopted to do a performance stability and reproducibility test, requiring the single-prec CG to be run 200 times, I have created a new version of Test_dwf_mixedcg_prec in the solver subdirectory that just does the mixed vs double CG test
2023-02-23 09:45:29 -05:00
Christopher Kelly
1db58a8acc
Precision change improvements
...
Added a new, much faster implementation of precision change that uses (optionally) a precomputed workspace containing pointer offsets that is device resident, such that all lattice copying occurs only on the device and no host<->device transfer is required, other than the pointer table. It also avoids the need to unpack and repack the fields using explicit lane copying. When this new precisionChange is called without a workspace, one will be computed on-the-fly; however it is still considerably faster than the original implementation.
In the special case of using double2 and when the Grids are the same, calls to the new precisionChange will automatically use precisionChangeFast, such that there is a single API call for all precision changes.
Reliable update and mixed-prec multishift have been modified to precompute precision change workspaces
Renamed the original precisionChange as precisionChangeOrig
Fixed incorrect pointer offset bug in copyLane
Added a test and a benchmark for precisionChange
Added a test for reliable update CG
2023-02-21 10:52:42 -05:00
920a51438d
Added batched Mixed precision CG
2023-02-14 17:04:13 +00:00
be528b6d27
Add batched block project/promote functions
2023-02-14 14:37:10 +00:00
Alessandro Lupo
f73691ec47
Merge pull request #18 from nickforce989/sp2n/newbranch
...
Sp2n/newbranch
2023-02-13 10:22:27 +01:00
Peter Boyle
ccd21f96ff
Plaquette agreeing and moving to final form (slowly) need to optimise
2023-02-01 22:57:44 -05:00
Peter Boyle
4b90cb8888
First cut passes combining padded cell with general stencil towards fast plaquette and staggered force
2023-02-01 22:14:10 -05:00
Niccolo Forzano
7ebda3e9ec
Merge commit 'b10e1b7bc8bec809f874e9e48a3ccc7b2619c9d1' into sp2n/newbranch
2023-01-19 12:10:18 +00:00
Niccolo Forzano
b10e1b7bc8
Fixed files giving zero force computation on GPU, issue #8
2023-01-18 18:04:47 +00:00
Peter Boyle
796abfad80
Merge pull request #422 from fjosw/fix/NVCC_DIAG_PRAGMA_SUPPORT
...
Disable diagnostic pragma warnings for CUDA 12+
2023-01-17 09:34:49 -05:00
ad0270ac8c
fix: diagnostic pragma warnings fixed for CUDA 12+
2023-01-12 12:36:30 +00:00
Makis Kappas
7d62f1d6d2
Populate the Cshift_table in the GPU
...
Cshift is allocated in Unified memory and used
in the LambdaApply kernels but also populated
from the host. This creates a lot of Unified HtoD
and DtoH mem operations and has a negative effect
in performance. With this commit we populate the
Cshift table in the device with the
populate_Cshift_table() kernel.
2023-01-11 21:26:25 +00:00
Christoph Lehner
458c943987
merged upstream
2022-12-31 11:16:21 +02:00
Christoph Lehner
88015b0858
Split sum in rankSum and GlobalSum
2022-12-26 10:01:32 +01:00
Peter Boyle
4ca1bf7cca
Added gauge invariance test
2022-12-21 07:23:16 -05:00
Peter Boyle
2ff868f7a5
CPU open doesn't need to free space
2022-12-20 05:10:23 -05:00
Peter Boyle
ede02b6883
Memory manager debug Felix case
2022-12-20 05:10:23 -05:00
Peter Boyle
1822ced302
Bug fix
2022-12-20 05:10:23 -05:00
Peter Boyle
37ba32776f
More logging
2022-12-20 05:10:23 -05:00
Peter Boyle
99b3697b03
More loggin
2022-12-20 05:10:23 -05:00
Peter Boyle
43a45ec97b
SSC_START
2022-12-20 05:10:23 -05:00
Peter Boyle
b00a4142e5
A=A fix
2022-12-20 05:10:23 -05:00
Peter Boyle
3791bc527b
Logging pulled in from dirichlet branch
2022-12-20 05:10:23 -05:00
Alessandro Lupo
d7dea44ce7
Merge pull request #17 from chillenzer/unify_gauge_groups
...
Fix compilation error in nvcc (closes #15 )
2022-12-19 16:24:03 +00:00
Peter Boyle
d8c29f5fcf
Updated FFT test for PETSc
2022-12-18 12:05:00 -05:00
Julian Lenz
37b6b82869
Fix file extensions
2022-12-18 16:12:56 +00:00
Julian Lenz
92ad5b8f74
Compiler error fix: NVCC requires names for templ. par.
2022-12-18 15:50:19 +00:00
Peter Boyle
281f8101fe
Matt FFT test
2022-12-17 20:35:33 -05:00
Peter Boyle
472ed2dd5c
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-12-17 20:17:09 -05:00
Peter Boyle
4f85672674
Simpler test for PETSc
2022-12-17 20:16:11 -05:00
Peter Boyle
dc747c54be
Merge branch 'develop' into feature/dirichlet
...
Conflicts:
Grid/qcd/action/fermion/WilsonCompressor.h
Grid/stencil/Stencil.h
2022-12-13 08:24:58 -05:00
Peter Boyle
140684d706
Head to head vs HMC
2022-12-13 08:15:38 -05:00
Peter Boyle
5bb7ba92fa
Test for DDHMC force term
2022-12-13 08:15:11 -05:00
Peter Boyle
b54d0f3c73
Smaller deltaH down to 7000s on t=0.5 trajectory
2022-12-13 08:14:27 -05:00
Peter Boyle
ff6777a98d
Variable depth experiments
2022-12-13 08:13:51 -05:00
Peter Boyle
07acfe89f2
Merge pull request #417 from rrhodgson/feature/fermtoprop
...
Feature/fermtoprop
2022-12-06 12:45:03 -05:00
40234f531f
FermToProp accelerator_for -> thread_for
2022-12-06 17:34:51 +00:00
d49694f38f
PropToFerm fix
2022-12-06 15:48:54 +00:00
Alessandro Lupo
8c80f1c168
Merge pull request #14 from chillenzer/unify_gauge_groups
...
Unify gauge groups (closes #5 )
2022-12-01 17:35:46 +00:00
Chulwoo Jung
dc6a38f177
Minor cleanup
2022-11-30 17:13:12 -05:00
Chulwoo Jung
82c1ecf60f
Block lanczos added
2022-11-30 16:08:40 -05:00
Peter Boyle
67f569354e
Partial dirichlet changes
2022-11-30 15:51:13 -05:00
Peter Boyle
97a098636d
FermToProp
2022-11-30 15:36:35 -05:00
Peter Boyle
e13930c8b2
Faster fermtoprop case
2022-11-30 15:11:29 -05:00
Julian Lenz
0af7d5a793
Rename Grid/qcd/utils/<Group>_impl.h -> Grid/qcd/utils/<Group>.h
2022-11-30 17:12:00 +00:00
Julian Lenz
505fa49983
Renamed SUn.h -> GaugeGroup.h
2022-11-30 17:09:48 +00:00
Julian Lenz
7bcf33def9
Removed Sp2n.h
2022-11-30 16:59:46 +00:00
Julian Lenz
a13820656a
Removed iSUnMatrix, etc.
2022-11-30 15:09:03 +00:00
Julian Lenz
fa71b46a41
Hide nsp
2022-11-30 14:44:23 +00:00
Julian Lenz
b8b3ae6ac1
Make helper functions private
2022-11-30 13:29:14 +00:00
Julian Lenz
55c008da21
Removed forward declaration
2022-11-30 13:12:21 +00:00
Julian Lenz
2507606bd0
With function overloading (still dirty).
2022-11-30 12:54:36 +00:00
Julian Lenz
7c2ad4f8c8
Attempt with SFINAE (failed)
2022-11-30 11:57:39 +00:00
Julian Lenz
54c8025aad
Remove unnecessary pwd in scripts/filelist
2022-11-28 17:50:38 +00:00
Julian Lenz
921e23e83c
Separated out everything SU specific
2022-11-28 17:47:50 +00:00
Julian Lenz
6e750ecb0e
Remove apparently forgotten file
2022-11-28 16:33:46 +00:00
Julian Lenz
b8f1f5d2a3
Introduce GaugeGroup
2022-11-25 17:45:32 +00:00
Julian Lenz
9273f2937c
Autoformat google style
2022-11-25 17:44:08 +00:00
Julian Lenz
1aa28b47ae
Add existing test to check
2022-11-25 17:40:40 +00:00
Julian Lenz
629cb2987a
Fix typo in Makefile.am
2022-11-25 17:40:21 +00:00
Julian Lenz
03235d6368
Fixed type in configure.ac
2022-11-25 16:57:40 +00:00
Alessandro Lupo
22064c7e4c
Fixing #11
2022-11-25 13:10:29 +00:00
Peter Boyle
5fa573dfd3
partial send fix
2022-11-25 00:51:04 -05:00
Peter Boyle
f6402cb6c4
AUDIT removal
2022-11-25 00:50:33 -05:00
Peter Boyle
bae6c263dc
Audit
2022-11-25 00:47:01 -05:00
Peter Boyle
d71672dca9
Bug fix
2022-11-25 00:46:35 -05:00
Peter Boyle
121c9e2ceb
Tracing
2022-11-25 00:45:21 -05:00
Peter Boyle
63a30ae34f
Tracing
2022-11-25 00:45:05 -05:00
Peter Boyle
7d8231ba32
Tracing
2022-11-25 00:44:57 -05:00
Peter Boyle
b690b1cbe9
Audit
2022-11-25 00:43:57 -05:00
Peter Boyle
c0fb20fc03
Audit check for wrongly locked data
2022-11-25 00:43:12 -05:00
Peter Boyle
bc9579dac6
Old code path removed
2022-11-25 00:40:45 -05:00
Peter Boyle
a5c77f8b95
Tracing moved in order
2022-11-25 00:40:27 -05:00
Alessandro Lupo
2de03e5172
Revert "Revert "Fixing issue #11 : consistent use of ncolour and nsp""
...
This reverts commit 3af4929dda .
2022-11-23 19:40:28 +00:00
Alessandro Lupo
3af4929dda
Revert "Fixing issue #11 : consistent use of ncolour and nsp"
...
This reverts commit 1ba429345b .
2022-11-23 19:34:59 +00:00
Alessandro Lupo
1ba429345b
Fixing issue #11 : consistent use of ncolour and nsp
2022-11-23 18:45:01 +00:00
Peter Boyle
3dbfce5223
Tests clean build on HIP
2022-11-16 20:15:51 -05:00
Peter Boyle
e51eaedc56
Making tests compile
2022-11-15 22:58:30 -05:00
Peter Boyle
e2a938e7f7
GPU happy for compile...?
2022-11-15 17:48:18 -05:00
Peter Boyle
ddad25211b
Extra instantiations
2022-11-15 17:47:52 -05:00
Peter Boyle
6209120de9
Fix to GPU compile attempt
2022-11-15 17:25:58 -05:00
Peter Boyle
fe6e8f5ac6
Benchmark_comms fix
2022-11-15 17:00:49 -05:00
Peter Boyle
ee84dcb400
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-11-15 16:41:55 -05:00
Peter Boyle
0ae0e5f436
Partial Dirichlet test
2022-11-15 16:40:38 -05:00
Peter Boyle
e047616571
Multilevel integrator test
2022-11-15 16:39:39 -05:00
Peter Boyle
1af7572c61
Some test HMCs for DDHMC
2022-11-15 16:38:51 -05:00
Peter Boyle
653039695b
Partial dirichlet changes
2022-11-15 16:37:15 -05:00
Peter Boyle
ca62abd203
Record some perturbative free field calculation
2022-11-15 16:36:46 -05:00
Peter Boyle
e74666a09c
Double length vector type for fast precision change
2022-11-15 16:34:21 -05:00
Peter Boyle
45a001e078
Debug compile
2022-11-15 16:27:20 -05:00
Peter Boyle
0352da34f0
Several deleted files
2022-11-15 16:26:49 -05:00
Peter Boyle
7d302a525d
Natural place for this routine is here
2022-11-15 16:24:55 -05:00
Peter Boyle
e2e269e03b
Partial dirichlet BCs
2022-11-15 16:24:26 -05:00
Peter Boyle
0db4f1803f
Partial dirichlet support
2022-11-15 16:23:41 -05:00
Peter Boyle
5fe480d81c
Generic patch
2022-11-15 16:21:45 -05:00
Peter Boyle
0566fc6267
Partial Dirichlet
2022-11-15 16:21:24 -05:00
Peter Boyle
a11c12e2e7
Modifications for partial dirichlet BCs
2022-11-15 16:20:01 -05:00
Peter Boyle
0655dab466
Open MP on host enabled
2022-11-08 13:38:54 -08:00
Peter Boyle
7f097bcc28
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2022-11-08 13:23:40 -08:00
Peter Boyle
5c75aa5008
Device mem
2022-11-08 13:22:57 -08:00
Peter Boyle
1873101362
PVC
2022-11-08 13:22:45 -08:00
Peter Boyle
63fd1dfa62
Config on PVC
2022-11-08 13:22:09 -08:00
Peter Boyle
bd68861b28
SYCL sum
2022-11-08 12:49:26 -08:00
Peter Boyle
82e959f66c
SYCL reduction
2022-11-08 12:45:25 -08:00
Peter Boyle
006268f556
DWF Slow version
2022-11-02 20:24:51 -04:00
Peter Boyle
78acae9b50
Simple DWF for easy check
2022-11-02 20:24:17 -04:00
Peter Boyle
a3927a8a27
Dirichlet
2022-11-02 20:22:27 -04:00
Peter Boyle
d9dd9a5b5f
LLVM update
2022-11-02 19:51:50 -04:00
Peter Boyle
eae1c02111
Bounds check
2022-11-02 19:50:32 -04:00
Peter Boyle
132d841b05
Compile fix
2022-11-02 19:33:22 -04:00
Peter Boyle
62e52de06d
Merge pull request #414 from fjosw/feat/eCloverGPU
...
Compact Exponential Cloverterm on GPU
2022-11-01 09:15:44 -04:00
184adeedb8
feat: renamed open_boundaries to fixedBoundaries
2022-10-26 12:53:46 +01:00
5fa6a8b96d
docs: CompactClover debug info generalized.
2022-10-26 12:41:14 +01:00
a2a879b668
docs: CompactClover Debug Info improved.
2022-10-25 17:20:42 +01:00
9317d893b2
docs: details about inversion of CompactClover term added.
2022-10-25 17:10:06 +01:00
86075fdd45
feat: MassTerm and ExponentiateClover merged into InstantiateClover
2022-10-25 17:05:34 +01:00
b36442e263
feat: CloverHelpers::InvertClover implemented which handles the
...
inversion of the Clover term depending on clover type and the boundary
conditions.
2022-10-25 16:57:01 +01:00
513d797ea6
fix: signature of CompactWilsonCloverHelpers::Exponentiate fixed.
2022-10-25 16:17:22 +01:00
9e4835a3e3
feat: changed CompactWilsonExpClover exponentiation to Taylor expansion
...
with Horner scheme.
2022-10-25 15:19:43 +01:00
Peter Boyle
2e8c3b0ddb
Slow implementation of Shamir DWF
2022-10-18 18:10:01 -04:00
Peter Boyle
991667ba5e
Revert
2022-10-13 18:50:35 -04:00
Peter Boyle
8a07b52009
Dirichlet
2022-10-13 18:44:47 -04:00
Peter Boyle
2bcff94b52
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-10-13 18:42:04 -04:00
Peter Boyle
d089739e2f
Hack for lattice sites
2022-10-13 17:55:50 -04:00
Peter Boyle
204c283e16
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-10-11 14:59:07 -04:00
Peter Boyle
551a5f8dc8
RRII gpu option
2022-10-11 14:44:55 -04:00
Peter Boyle
c82b164f6b
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-10-04 17:41:48 -04:00
Peter Boyle
584a3ee45c
Merge pull request #412 from giltirn/patch/adaptive-wflow
...
Patch/adaptive wflow
2022-10-04 17:23:19 -04:00
Peter Boyle
eec0c9eb7d
Merge pull request #411 from giltirn/patch/dirichlet-fixes
...
Various fixes / changes
2022-10-04 17:22:01 -04:00
Peter Boyle
477ebf24f4
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2022-10-04 11:19:43 -07:00
Peter Boyle
0d5639f707
Run script update
2022-10-04 11:13:41 -07:00
Peter Boyle
413312f9a9
Benchmark the halo construction.
...
THe bye counts are out and should be doubled for SIMD directions
2022-10-04 11:12:59 -07:00
Peter Boyle
03508448f8
Remove verbose
2022-10-04 11:12:15 -07:00
Peter Boyle
e1e5c75023
Stencil gather improvements - SVM was running slow and used for a pointer array that wasn't needed to be in SVM
2022-10-04 11:11:10 -07:00
Peter Boyle
9296299b61
Better commenting
2022-10-04 11:10:34 -07:00
Christopher Kelly
66d001ec9e
Refactored Wilson flow class; previously the class implemented both iterative and adaptive smearing, but only the iterative method was accessible through the Smearing base class. The implementation of Smearing also forced a clunky need to pass iterative smearing parameters through the constructor but adaptive smearing parameters through the function call. Now there is a WilsonFlowBase class that implements common functionality, and separate WilsonFlow (iterative) and WilsonFlowAdaptive (adaptive) classes, both of which implement Smearing virtual functions.
...
Modified the Wilson flow adaptive smearing step size update to implement the original Ramos definition of the distance, where previously it used the norm of a difference which scales with the volume and so would choose too coarse or too fine steps depending on the volume. This is based on Chulwoo's code.
Added a test comparing adaptive (with tuneable tolerance) to iterative Wilson flow smearing on a random gauge configuration.
2022-10-03 10:59:38 -04:00
Peter Boyle
fad2f969d9
Summit up to date
2022-09-27 10:58:43 -04:00
Peter Boyle
48165c1dc1
Ticked off a few items
2022-09-27 10:58:00 -04:00
Peter Boyle
25df2d2c3b
Various precision options
2022-09-27 10:57:12 -04:00
Peter Boyle
af9ecb8b41
Current tests compiling
2022-09-27 10:56:55 -04:00
Peter Boyle
234324599e
Double2
2022-09-27 10:56:10 -04:00
Peter Boyle
97448a93dc
Double2 compiles and dslash runs
2022-09-27 10:55:25 -04:00
Peter Boyle
70c83ec3be
More instantiations
2022-09-27 10:54:23 -04:00
Peter Boyle
8f4e2ee545
Double2
2022-09-27 10:53:46 -04:00
Peter Boyle
e8bfbf2f7c
D2 operators
2022-09-27 10:37:45 -04:00
Peter Boyle
9e81b42981
D2 fields
2022-09-27 10:37:19 -04:00
Peter Boyle
6c9eef9726
D2 fields
2022-09-27 10:36:54 -04:00
Peter Boyle
7ffbc3e98e
Double2 improved. REally don't like 'convertType' - localise to a GPT
...
header
2022-09-27 10:35:31 -04:00
Peter Boyle
68e4d833dd
Run through wrapper script
2022-09-23 16:49:29 -04:00
Peter Boyle
a2cefaa53a
Faster
2022-09-23 16:49:14 -04:00
Peter Boyle
a0d682687e
Better logging of Fdt for force gradient
2022-09-23 16:22:53 -04:00
Peter Boyle
eb552c3ecd
dt info
2022-09-23 16:22:28 -04:00
Peter Boyle
97cce103d7
Tolerances control
2022-09-23 16:21:49 -04:00
Peter Boyle
87ac7104f8
Prettier
2022-09-23 16:20:46 -04:00
Peter Boyle
e4c117aabf
Compile fix, multishift mixed prec support
2022-09-23 16:19:27 -04:00
Peter Boyle
5b128a6f9f
MixedPrec Multishift with better precision scheme for GPU
2022-09-23 16:18:47 -04:00
Christopher Kelly
19da647e3c
Added support for non-periodic gauge field implementations in the random gauge shift performed at the start of the HMC trajectory
...
(The above required exposing the gauge implementation to the HMC class through the Integrator class)
Made the random shift optional (default on) through a parameter in HMCparameters
Modified ConjugateBC::CshiftLink such that it supports any shift in -L < shift < L rather than just +-1
Added a tester for the BC-respecting Cshift
Fixed a missing system header include in SSE4 intrinsics wrapper
Fixed sumD_cpu for single-prec types performing an incorrect conversion to a single-prec data type at the end, that fails to compile on some systems
2022-09-09 12:47:09 -04:00
Peter Boyle
1713de35c0
Improved config flags
2022-09-05 21:50:02 -04:00
Peter Boyle
1177b8f661
Merge branch 'develop' into feature/dirichlet
2022-08-31 19:05:57 -04:00
Peter Boyle
442bfb3d42
Merge branch 'develop' into feature/dirichlet
2022-08-31 19:04:19 -04:00
Peter Boyle
e7d9b75fdd
Warning fixes
2022-08-31 19:01:14 -04:00
Peter Boyle
3d0e3ec363
Tracing
2022-08-31 18:31:46 -04:00
Peter Boyle
3c1c51f9aa
Merge branch 'feature/dirichlet-gparity' into feature/dirichlet
2022-08-31 18:25:34 -04:00
Peter Boyle
8cc3c522c3
Merge pull request #409 from giltirn/feature/dirichlet-gparity-stage
...
Import round 5
2022-08-31 18:22:50 -04:00
Peter Boyle
913fbca74a
Merge pull request #410 from gkanwar/photon_and_sha_patches
...
Photon.h and SHA256 patches
2022-08-31 18:01:45 -04:00
Peter Boyle
5c87342108
Used in g-2 sign off
2022-08-31 17:35:32 -04:00
Peter Boyle
66177bfbe2
Used in g-2 sign off
2022-08-31 17:35:07 -04:00
Peter Boyle
5205e68963
RocTX, NVTX, text based self profiling
2022-08-31 17:34:09 -04:00
Peter Boyle
cd5cf6d614
Tracing replaces self timing hooks
2022-08-31 17:33:41 -04:00
Peter Boyle
5abb19eab0
Remove self timing
2022-08-31 17:32:49 -04:00
Peter Boyle
06d7b88c78
Force reporting improved
2022-08-31 17:32:21 -04:00
Peter Boyle
cf72799735
Better action naming
2022-08-31 17:24:11 -04:00
Peter Boyle
cdb8fcc269
Width=4 support. This is too broad; hit it on physical point run.
...
Need to change strategy, I think.
2022-08-31 17:21:33 -04:00
Peter Boyle
b4f4130901
Defer SMP node links until after interior. Allows for DMA overlapping
...
compute
2022-08-31 17:20:21 -04:00
Peter Boyle
bb049847d5
Tracing replaces self timing
2022-08-31 17:19:02 -04:00
Peter Boyle
fd33c835dd
Feynman rule fix and tracing replaces self timing
2022-08-31 17:18:17 -04:00
Peter Boyle
21371a7e5b
Tracing replaces self timing
2022-08-31 17:16:05 -04:00
Peter Boyle
abfaa00d3e
Tracing replaces self timing
2022-08-31 17:15:24 -04:00
Peter Boyle
efee33c55d
Tracing replaces self timing
2022-08-31 17:14:57 -04:00
Peter Boyle
db0fe6ddbb
Tracing replaces self timinng
2022-08-31 17:14:14 -04:00
Peter Boyle
8a9e647120
Tracing replaces self timing
2022-08-31 17:13:44 -04:00
Peter Boyle
e6dcb821ad
Tracing replaces self timing
2022-08-31 17:12:31 -04:00
Peter Boyle
9bff188f02
Tracing replaces self timing
2022-08-31 17:12:05 -04:00
Peter Boyle
111b30ca1d
Tracing replaces self timing
2022-08-31 17:11:48 -04:00
Peter Boyle
24182ca8bf
HIP allows conserved currents.
...
Tracing replaces self timeing
2022-08-31 17:11:18 -04:00
Peter Boyle
ee2d7369b3
Tracing replaces self timing
2022-08-31 17:10:45 -04:00
Peter Boyle
7c686d29c9
Tracing replaces self timing
2022-08-31 17:10:17 -04:00
Peter Boyle
e8a0a1e75d
Tracing replaces self timing hooks
2022-08-31 17:09:47 -04:00
Peter Boyle
730be89abf
Remove timing hooks as tracing replaces
2022-08-31 17:08:44 -04:00
Peter Boyle
f991ad7d5c
Remove timing hooks as tracing replaces
2022-08-31 17:08:18 -04:00
Peter Boyle
b3f33f82f7
Decrease self timing hooks, use nvtx / roctx type tracing hooks instead
2022-08-31 17:06:47 -04:00
Peter Boyle
a34a6e059f
Logging improvement. Sinitial will be used to improve RHMC terms
2022-08-31 17:06:08 -04:00
Peter Boyle
1333319941
Tracing
2022-08-31 17:00:25 -04:00
Peter Boyle
9295ed8d20
Print full memory range
2022-08-31 16:59:51 -04:00
Peter Boyle
19cc7653fb
Tracing
2022-08-31 16:57:51 -04:00
Peter Boyle
5752538661
Tracing
2022-08-31 16:57:32 -04:00
Peter Boyle
ca40a1b00b
Tracing
2022-08-31 16:54:55 -04:00
Peter Boyle
659fac9dfb
Tracing hook
2022-08-31 16:54:25 -04:00
Peter Boyle
4dc3d6fce0
Buy into Nvidia/Rocm etc... tracing.
2022-08-31 16:53:19 -04:00
Gurtej Kanwar
60dfb49afa
Remove FP16 tests when FP16 is disabled
2022-08-21 17:29:55 +02:00
Gurtej Kanwar
554c238359
Update OpenSSL digest to use high-level methods
...
This avoids deprecation warnings when compiling against OpenSSL 3.0
but should still be backwards compatible. It is the recommended way
to use the digest API going forward.
2022-08-21 17:28:57 +02:00
Gurtej Kanwar
f922adf05e
Fix Photon ComplexField type
2022-08-21 16:16:18 +02:00
Peter Boyle
95b640cb6b
10TF/s on 32^3 x 64 on single node
2022-08-04 15:43:52 -04:00
Peter Boyle
2cb5bedc15
Copy stream HIP improvements
2022-08-04 15:24:03 -04:00
Peter Boyle
806b02bddf
Simplify dead code
2022-08-04 15:23:13 -04:00
Peter Boyle
de40395773
More timing. Think I should start to use nvtx and rocmtx ??
2022-08-04 13:37:16 -04:00
Peter Boyle
7ba4788715
Fix
2022-08-04 13:36:44 -04:00
Peter Boyle
06d9ce1a02
Synch ranks on node here for GPU - GPU memcopy
2022-08-04 13:35:56 -04:00
Peter Boyle
75bb6b2b40
Move barrier into the StencilSend begin routine
2022-08-04 13:35:26 -04:00
Peter Boyle
74f10c2dc0
Move barrier into Stencil Send
2022-08-04 13:34:11 -04:00
Peter Boyle
188d2c7a4d
PVC default, ignore ATS
2022-08-02 08:38:53 -07:00
Peter Boyle
17d7177105
Files for SYCL
2022-08-02 08:33:39 -07:00
Peter Boyle
bb0a0da47a
inon blocking caution due to SYCL
2022-08-02 08:09:43 -07:00
Peter Boyle
84110166e4
Fix the fence
2022-08-02 08:00:43 -07:00
Peter Boyle
d32b923b6c
Fencing on a stream in SYCL is needed. Didn't know that ... gulp
2022-08-02 07:58:04 -07:00
Peter Boyle
a93d5459d4
Better mpi request completion
2022-07-28 12:18:35 -04:00
Peter Boyle
9c21add0c6
High res timer replaces getttimeofday
2022-07-28 12:14:03 -04:00
Peter Boyle
639aab6563
High res timer instead of gettimeofday
2022-07-28 12:13:35 -04:00
Peter Boyle
8137cc7049
Allways concurrent comms
2022-07-28 12:01:51 -04:00
Peter Boyle
60e63dca1d
Add memory logging channel
2022-07-28 11:39:15 -04:00
Peter Boyle
486409574e
Expanded cach to avoid any allocs in HMC
2022-07-28 11:38:34 -04:00
Peter Boyle
a913b8be12
Dslash self timing. Might want to not have this
2022-07-28 11:37:55 -04:00
Peter Boyle
2239751850
Better logging
2022-07-28 11:37:36 -04:00
Peter Boyle
9b20f1449c
Better timing
2022-07-28 11:37:12 -04:00
Peter Boyle
b99453083d
Updated timing
2022-07-28 11:37:02 -04:00
Peter Boyle
2ab1af5754
Ensure no synchronize and not optoin dependent
2022-07-19 09:51:06 -07:00
Peter Boyle
5f8892bf03
Mistake pointed out by Camilo
2022-07-19 09:31:51 -07:00
Peter Boyle
f14e7e51e7
Grid accelerator
2022-07-12 10:56:22 -07:00
Peter Boyle
943fbb914d
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-07-11 13:48:42 -04:00
Peter Boyle
ca4603580d
Verbose
2022-07-11 13:48:35 -04:00
Peter Boyle
f73db8f1f3
Synch clocks
2022-07-11 13:47:39 -04:00
Peter Boyle
f7217d12d2
World barrier for clock synch
2022-07-11 13:45:31 -04:00
Peter Boyle
fab50c57d9
More loggin
2022-07-11 18:42:27 +01:00
Peter Boyle
3440534fbf
MixedPrec support
2022-07-10 21:35:18 +01:00
Peter Boyle
177b1a7ec6
Mixed prec
2022-07-10 21:34:10 +01:00
Peter Boyle
58182fe345
Different approach to default dirichlet params
2022-07-10 21:32:58 +01:00
Peter Boyle
1f907d330d
Different default params for dirichlet
2022-07-10 21:31:48 +01:00
Peter Boyle
b0fe664e9d
Better force log info
2022-07-10 21:31:25 +01:00
Peter Boyle
c0f8482402
Remove SSC marks
2022-07-07 17:49:36 +01:00
Peter Boyle
3544965f54
Stream doesn't work
2022-07-07 17:49:20 +01:00
Christopher Kelly
33e4a0caee
Imported changes from feature/gparity_HMC branch:
...
Rework of WilsonFlow class
Fixed logic error in smear method where the step index was initialized to 1 rather than 0, resulting in the logged output value of tau being too large by epsilon
Previously smear_adaptive would maintain the current value of tau as a class member variable whereas smear would compute it separately; now both methods maintain the current value internally and it is updated by the evolve_step routines. Both evolve methods are now const.
smear_adaptive now also maintains the current value of epsilon internally, allowing it to be a const method and also allowing the same class instance to be reused without needing to be reset
Replaced the fixed evaluation of the plaquette energy density and plaquette topological charge during the smearing with a highly flexible general strategy where the user can add arbitrary measurements as functional objects that are evaluated at an arbitrary frequency
By default the same plaquette-based measurements are performed, but additional example functions are provided where the smearing is performed with different choices of measurement that are returned as an array for further processing
Added a method to compute the energy density using the Cloverleaf approach which has smaller discretization errors
Added a new tensor utility operation, copyLane, which allows for the copying of a single SIMD lane between two instances of the same tensor type but potentially different precisions
To LocalCoherenceLanczos, added the option to compute the high/low eval of the fine operator on every restart to aid in tuning the Chebyshev
Added Test_field_array_io which demonstrates and tests a single-file write of an arbitrary array of fields
Added Test_evec_compression which generates evecs using Lanczos and attempts to compress them using the local coherence technique
Added Test_compressed_lanczos_gparity which demonstrates the local coherence Lanczos for G-parity BCs
Added HMC main programs for the 40ID and 48ID G-parity lattices
2022-07-01 14:12:12 -04:00
Peter Boyle
1f903d9296
Merge branch 'feature/dirichlet' into feature/dirichlet-gparity
2022-07-01 12:12:50 -04:00
Peter Boyle
4df1e0987f
Merge branch 'feature/dirichlet-gparity' of https://github.com/paboyle/Grid into feature/dirichlet-gparity
2022-07-01 09:55:43 -04:00
Peter Boyle
588c2f3cb1
Faster axpy_norm and innerProduct
2022-07-01 09:44:58 -04:00
Peter Boyle
bd99fd608c
Introduce a non-default stream for compute operatoins
2022-07-01 09:42:53 -04:00
Peter Boyle
57b442d0de
Log memory operations
2022-07-01 09:42:17 -04:00
Peter Boyle
751a4562d7
Timing improvement
2022-07-01 09:41:43 -04:00
Peter Boyle
ca66301dee
Remove debug
2022-06-30 14:53:12 -04:00
Peter Boyle
808bb59206
Mixed prec DD-RHMC
2022-06-30 13:50:09 -04:00
Peter Boyle
4b7f51d19d
Create a new RNG file
2022-06-30 13:49:50 -04:00
Peter Boyle
d03152fac4
New file under debug
2022-06-30 13:49:35 -04:00
Peter Boyle
137f190258
Dirichlet implementation
2022-06-30 13:45:07 -04:00
Peter Boyle
53d01312b3
Rough flop counting, need to add M5D, M5Ddag, MooeeInv flops
2022-06-30 13:44:09 -04:00
Peter Boyle
220050822a
Speed up M5D and M5Ddag
2022-06-30 13:43:27 -04:00
Peter Boyle
87ad76d81b
Initialise timeval
2022-06-30 13:42:46 -04:00
Peter Boyle
042ab1a052
Update GridStd.h
2022-06-27 13:21:39 -04:00
Peter Boyle
4ac1094856
Updated config commands
2022-06-27 12:16:24 -04:00
Peter Boyle
d44a57b0af
Allow frequency=0 to disable
2022-06-27 12:15:55 -04:00
Peter Boyle
dc000d10ee
Spelling correction
2022-06-27 12:14:57 -04:00
Peter Boyle
3685f391cf
More verbose CG
2022-06-27 12:11:08 -04:00
Peter Boyle
efd7338a00
Allow dirichlet at round the world link
2022-06-27 12:10:27 -04:00
Peter Boyle
e1e7b1e224
RNG fix
2022-06-27 12:09:52 -04:00
Peter Boyle
7319d4e1ad
Merge pull request #407 from giltirn/feature/dirichlet-gparity-stage
...
Import round 4
2022-06-22 15:23:36 -04:00
Christopher Kelly
fd933420c6
Imported changes from feature/gparity_HMC branch:
...
Added a bounds-check function for the RHMC with arbitrary power
Added a pseudofermion action for the rational ratio with an arbitrary power and a mixed-precision variant of the same. The existing one-flavor rational ratio class now uses the general class under the hood
To support testing of the two-flavor even-odd ratio pseudofermion, separated the functionality of generating the random field and performing the heatbath step, and added a method to obtain the pseudofermion field
Added a new HMC runner start type: CheckpointStartReseed, which reseeds the RNG from scratch, allowing for the creation of new evolution streams from an existing checkpoint. Added log output of seeds used when the RNG is seeded.
EOFA changes:
To support mixed-precision inversion, generalized the class to maintain a separate solver for the L and R operators in the heatbath (separate solvers are already implemented for the other stages)
To support mixed-precision, the action of setting the operator shift coefficients is now maintained in a virtual function. A derived class for mixed-precision solvers ensures the coefficients are applied to both the double and single-prec operators
The ||^2 of the random source is now stored by the heatbath and compared to the initial action when it is computed. These should be equal but may differ if the rational bounds are not chosen correctly, hence serving as a useful and free test
Fixed calculation of M_eofa (previously incomplete and #if'd out)
Added functionality to compute M_eofa^-1 to complement the calculation of M_eofa (both are equally expensive!)
To support testing, separated the functionality of generating the random field and performing the heatbath step, and added a method to obtain the pseudofermion field
Added a test program which computes the G-parity force using the 1 and 2 flavor implementations and compares the result. Test supports DWF, EOFA and DSDR actions, chosen by a command line option.
The Mobius EOFA force test now also checks the rational approximation used for the heatbath
Added a test program for the mixed precision EOFA compared to the double-prec implementation,
G-parity HMC test now applied GPBC in the y direction and not the t direction (GPBC in t are no longer supported) and checkpoints after every configuration
Added a test program which computes the two-flavor G-parity action (via RHMC) with both the 1 and 2 flavor implementations and checks they agree
Added a test program to check the implementation of M_eofa^{-1}
2022-06-22 10:27:48 -04:00
Peter Boyle
8208a6214f
Merge branch 'feature/dirichlet-gparity' into feature/dirichlet
2022-06-15 19:23:48 -04:00
Peter Boyle
3d8146b596
Merge branch 'feature/dirichlet-gparity' of https://github.com/paboyle/Grid into feature/dirichlet-gparity
2022-06-15 19:20:27 -04:00
Peter Boyle
31efa5c4da
Script updates for current summit
2022-06-15 19:19:44 -04:00
Peter Boyle
d10d30dda8
Script update
2022-06-15 19:18:58 -04:00
Peter Boyle
0e9666bc92
Test update
2022-06-15 19:18:42 -04:00
Peter Boyle
6efd80f104
Printing
2022-06-15 18:23:46 -04:00
Peter Boyle
fdef7a1a8c
Dirichlet fix
2022-06-15 00:05:20 -04:00
Peter Boyle
501bb117bf
Const correct
2022-06-15 00:04:09 -04:00
Peter Boyle
05ca7dc252
Const correctness
2022-06-14 23:41:05 -04:00
Peter Boyle
e9648a1635
Useful periodic print. CG convergence bound is remarkably accurate on
...
low eigenvalue in numerical tests
2022-06-14 23:40:04 -04:00
Peter Boyle
2df98a99bc
Merge pull request #406 from giordano/patch-1
...
Update default value of gen-simd-width in README
2022-06-14 17:46:25 -04:00
Mosè Giordano
315ea18be2
Update default value of gen-simd-width in README
2022-06-14 22:41:05 +01:00
Peter Boyle
9a9f4a111f
Merge pull request #405 from giltirn/feature/dirichlet-gparity-stage
...
Import round 3
2022-06-06 18:45:37 -04:00
Christopher Kelly
1ad54d049d
To PeriodicBC and ConjugateBC, added a new function "CshiftLink" which performs a boundary-aware C-shift of links or products of links. For the latter, the links crossing the global boundary are complex-conjugated.
...
To the gauge implementations, added CshiftLink functions calling into the appropriate operation for the BC in a given direction.
GaugeTransform, FourierAcceleratedGaugeFixer and WilsonLoops::FieldStrength no longer implicitly assume periodic boundary conditions; instead the shifted link is obtained using CshiftLink and is aware of the gauge implementation.
Added an assert-check to ensure that the gauge fixing converges within the specified number of steps.
Added functionality to compute the timeslice averaged plaquette
Added functionality to compute the 5LI topological charge and timeslice topological charge
Added a check of the properties of the charge conjugation matrix C=-gamma_2 gamma_4 to Test_gamma
Fixed const correctness for Replicate
Modified Test_fft_gfix to support either conjugate or periodic BCs, optionally disabling Fourier-accelerated gauge fixing, and tuning of alpha using cmdline options
2022-06-02 15:30:41 -04:00
Peter Boyle
57bd0a0a22
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-06-01 19:29:38 -04:00
Peter Boyle
b49db84b08
Slurm updates
2022-06-01 19:27:42 -04:00
Peter Boyle
583f7c52f3
SSC mark
2022-06-01 19:27:29 -04:00
Peter Boyle
58a86c9164
SSC mark removal
2022-06-01 19:27:06 -04:00
Peter Boyle
a25b32847f
Crusher patch
2022-06-01 19:26:37 -04:00
Peter Boyle
6f1a2e132b
SSC mark causing problems
2022-06-01 19:26:06 -04:00
Peter Boyle
b1ede7b46d
Faster RNG init
2022-06-01 19:25:42 -04:00
Peter Boyle
e762c940c2
Reduce the loop over exterior for GPU to indirection table
2022-06-01 14:29:25 -07:00
Peter Boyle
6a1a198144
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-05-29 11:08:09 -04:00
Peter Boyle
34faa39f4f
Clean up Dirichlet. Big oops fix
2022-05-28 17:18:08 -07:00
Peter Boyle
5ddea3829d
Extra easier signature for peek
2022-05-28 15:52:39 -07:00
Peter Boyle
7eb29cf529
MPI fix
2022-05-28 15:51:34 -07:00
Peter Boyle
f729b9b889
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-05-25 14:16:09 -04:00
Peter Boyle
4f997c5f04
Remove extra face kernels in Dirichlet
2022-05-25 11:15:25 -07:00
Peter Boyle
a9c2e1df03
Merge pull request #404 from rrhodgson/feature/json_nvcc
...
Feature/json nvcc
2022-05-25 13:30:11 -04:00
Peter Boyle
d3496d2fe0
Merge pull request #397 from giltirn/feature/dirichlet-gparity-stage
...
Gparity HMC import round 2
2022-05-25 13:29:45 -04:00
Peter Boyle
60f4cb0ffd
Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet
2022-05-25 12:38:10 -04:00
Peter Boyle
136d843ce7
Crusher updates
2022-05-25 12:36:09 -04:00
Peter Boyle
18028f4309
Merge branch 'develop' into feature/dirichlet
2022-05-24 18:26:18 -07:00
Peter Boyle
5164016740
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2022-05-24 18:25:57 -07:00
Peter Boyle
d83beaa890
Update perlmutter
2022-05-24 18:25:00 -07:00
Peter Boyle
f9f05e995b
Update perlmutter
2022-05-24 18:24:38 -07:00
Peter Boyle
e651b9e7ab
Clean up stencil with better intranode Dirichlet / DDHMC support.
...
14TF/s on a Perlmutter node
2022-05-24 18:23:39 -07:00
Peter Boyle
47b4e91473
Verbose change
2022-05-24 18:19:18 -07:00
Peter Boyle
3f31afa4fc
Clean up verbose
2022-05-24 18:18:51 -07:00
da4daea57a
Updated json to latest release 3.10.5
2022-05-24 16:16:06 +01:00
Peter Boyle
af3b065add
Merge pull request #403 from fjosw/fix/cuda_11_5_warnings
...
Fixed nvcc 11.5+ warnings
2022-05-24 11:10:02 -04:00
e346154c5d
Updated json CUDA compile guards
2022-05-24 15:48:01 +01:00
7937ac2bab
fix: conditional pragmas according to new NVCC_DIAG_PRAGMA_SUPPORT standard in pugixml/pugixml.cc
2022-05-24 15:31:03 +01:00
e909aeedf0
fix: conditional pragmas according to new NVCC_DIAG_PRAGMA_SUPPORT standard in Grid_Eigen_Dense.h
2022-05-24 15:29:42 +01:00
bab8aa8eb0
fix: conditional pragmas according to new NVCC_DIAG_PRAGMA_SUPPORT
...
standard in DisableWarnings.h
2022-05-24 15:27:40 +01:00
Peter Boyle
38b22f05be
Merge pull request #402 from fjosw/fix/clover_warnings
...
fixed clover warnings
2022-05-24 10:05:27 -04:00
3ca0de1c40
Fix json write for vector<string>
2022-05-24 14:37:33 +01:00
c7205d2a73
Removed nvcc guards for json
2022-05-24 14:30:26 +01:00
617c5362c1
fix: fixed warning: missing return statement at end of non-void function
...
in CloverHelpers
2022-05-24 11:37:33 +01:00
Peter Boyle
083b58e66d
Merge pull request #401 from JPRichings/LocalCoheranceDeflation
...
Local coherance batch deflation
2022-05-20 11:44:22 -04:00
Peter Boyle
633427a2df
Merge pull request #400 from JPRichings/wilson_sweep
...
bench wilson sweep fix
2022-05-20 11:43:40 -04:00
2031d6910a
Merge branch 'paboyle:develop' into wilson_sweep
2022-05-20 16:20:23 +01:00
Peter Boyle
f82ce67624
Dirichlet improved
2022-05-19 19:17:11 -07:00
Peter Boyle
b52e8ef65a
Dirichlet changes
2022-05-19 16:45:41 -07:00
Peter Boyle
2594e3c230
Dirichlet option
2022-05-19 16:45:19 -07:00
Peter Boyle
8cedb45af2
Dirichlet BCs
2022-05-19 16:45:02 -07:00
Peter Boyle
aa008cbe99
Updated for new Dirichlet interface
2022-05-19 16:44:39 -07:00
79e34b3eb4
Local Coherence batch deflation
2022-05-19 14:53:17 +01:00
4f3d581ab4
Merge branch 'paboyle:develop' into LocalCoheranceDeflation
2022-05-19 14:46:17 +01:00
Peter Boyle
6fb6ca5b6b
Merge branch 'develop' into feature/dirichlet
2022-05-17 09:09:00 -07:00
Peter Boyle
b8ee19691c
Updated config for PM
2022-05-17 09:08:12 -07:00
Peter Boyle
d16427b837
Merge pull request #399 from fjosw/fix/Nc_neq_3
...
fix: assert for dimensions of compact Wilson clover moved to constructor
2022-05-17 09:03:42 -04:00
4b1997e2f3
wilson sweep test
2022-05-16 15:58:33 +01:00
8939d5dc73
bugfix: eo operator called in correct location
2022-05-16 00:28:28 +01:00
b051e00de0
Additional Local Coherance Deflation operator()
2022-05-16 00:25:13 +01:00
8aa75b492f
Merge branch 'develop' into fix/Nc_neq_3
2022-05-10 14:22:03 +01:00
Peter Boyle
0274f40686
Merge pull request #389 from mbruno46/mbruno-eclover
...
Feature/expClover
2022-05-10 09:18:19 -04:00
Peter Boyle
77aa147ce5
Merge branch 'develop' into mbruno-eclover
2022-05-10 09:16:53 -04:00
32facbd02a
fix: assert for dimensions of compact Wilson clover moved to
...
constructor.
2022-05-10 10:53:22 +01:00
Christopher Kelly
6121397587
Imported changes from feature/gparity_HMC branch:
...
Added storage of final true residual in mixed-prec CG and enhanced log output
Fixed const correctness of multi-shift constructor
Added a mixed precision variant of the multi-shift algorithm that uses a single precision operator and applies periodic reliable update to the residual
Added tests/solver/Test_dwf_multishift_mixedprec to test the above
Fixed local coherence lanczos using the (large!) max approx to the chebyshev eval as the scale from which to judge the quality of convergence, resulting a test that always passes
Added a method to local coherence lanczos class that returns the fine eval/evec pair
Added iterative log output to power method
Added optional disabling of the plaquette check in Nerscio to support loading old G-parity configs which have a factor of 2 error in the plaquette
G-parity Dirac op no longer allows GPBC in the time direction; instead we toggle between periodic and antiperiodic
Replaced thread_for G-parity 5D force insertion implementation with accelerator_for version capable of running on GPUs
Generalized tests/lanczos/Test_dwf_lanczos to support regular DWF as well as Gparity, with the action chosen by a command line option
Modified tests/forces/Test_dwf_gpforce,Test_gpdwf_force,Test_gpwilson_force to use GPBC a spatial direction rather than the t-direction, and antiperiodic BCs for time direction
tests/core/Test_gparity now supports using APBC in time direction using command line toggle
2022-05-09 16:27:57 -04:00
Peter Boyle
4de50ab146
Merge pull request #396 from fjosw/fix/readd_config.h
...
fix: readded Config.h and Version.h to HFILEs in Grid/Makefile.am
2022-05-09 08:26:48 -04:00
8b12a61097
fix: readded Config.h and Version.h to HFILEs in Grid/Makefile.am
2022-05-09 11:53:22 +01:00
Peter Boyle
79ea027c0b
Merge pull request #377 from RJHudspith/develop
...
NERSC and ILDG for non-SU(3) configuration checkpoints
2022-05-03 08:55:48 -04:00
Peter Boyle
62339d437f
Merge pull request #387 from lehner/feature/gpt
...
Parity mass terms for domain wall fermions to enable 4d eofa
2022-05-03 08:52:18 -04:00
Peter Boyle
698e745276
Merge pull request #390 from fjosw/feature/conserved_current_wilson
...
Conserved current for wilson fermions
2022-05-03 08:51:10 -04:00
Peter Boyle
0417b96896
Merge pull request #391 from giltirn/feature/dirichlet-gparity-stage
...
First stage of import
2022-05-03 08:50:18 -04:00
Peter Boyle
9a6e2c315d
Merge pull request #394 from fjosw/fix/gauge_fix_ErrorOnNoConverge
...
SteepestDescentGaugeFix now exits when the algorithm does not converge.
2022-05-03 08:49:26 -04:00
e61fed87db
SteepestDescentGaugeFix now exits when the algorithm does not converge.
...
This behaviour can be altered by setting err_on_no_converge to false.
2022-04-20 15:41:55 +01:00
Christopher Kelly
81fe4c937e
Hopefully fix link errors on Intel compilers due to having no function body for MomentumFilterBase::apply_phase
2022-04-12 09:51:59 -04:00
Christopher Kelly
f77f3a6598
Imported G-parity flavor algebra + tester from feature/gparity_HMC branch
2022-04-06 10:21:04 -04:00
Peter Boyle
239afb18fb
Merge branch 'feature/dirichlet' into feature/dirichlet-gparity
2022-04-05 16:49:32 -04:00
Peter Boyle
ef820a26cd
Bcopy on crusher compile
2022-04-05 16:49:02 -04:00
Peter Boyle
65abe4d0d3
Merge branch 'feature/dirichlet' into feature/dirichlet-gparity
2022-04-05 16:26:54 -04:00
Peter Boyle
5012adfebf
Merge branch 'develop' into feature/dirichlet
2022-04-05 16:26:19 -04:00
Peter Boyle
b808d48fa1
Tone down printing in integrator
2022-04-05 16:25:22 -04:00
Peter Boyle
83f818a99d
Updates for DDHMC
2022-04-05 16:24:34 -04:00
b8bc560b51
Test_wilson_conserved_current implemented, all 5d references removed.
2022-04-05 17:33:45 +01:00
6bc2483d57
Merge branch 'feature/eclover' into feature/conserved_current_wilson
2022-04-05 15:26:49 +01:00
82aecbf4cf
Test_wilson_conserved_current added
2022-04-05 15:26:39 +01:00
Mattia Bruno
ee23a76aa0
Merge pull request #2 from fjosw/feature/eclover
...
Feature/eclover
2022-04-05 13:30:13 +02:00
d7191e5a02
SeqConservedCurrent implemented for Wilson fermions
2022-04-05 11:48:56 +01:00
c8a824425b
Error message added if another conserved current than vector is requested for
...
Wilson type fermions.
2022-04-05 10:58:22 +01:00
f23626a6b8
End scope by additional block in CloverHelpers.h
2022-04-02 16:08:15 +01:00
6577a03d16
Explcitly closed views in Exponentiate_Clover
2022-04-01 18:39:12 +01:00
427c8695fe
Change signs and prefactors for conserved current to mimic the 5d
...
version.
2022-04-01 16:20:21 +01:00
9e82c468ab
Multiplication of diagonal mass in exponentiate fixed for gpus
2022-04-01 15:54:43 +01:00
603fd96747
Missing link multiplication added.
2022-04-01 10:58:56 +01:00
fe993c0836
/=2 replaced by *=0.5
2022-03-31 17:08:17 +01:00
cdf31d52c1
GaugeGrid and typo fixed
2022-03-31 17:04:35 +01:00
0542eaf1da
First version of conserved current contraction for Wilson type quarks
2022-03-31 17:02:09 +01:00
Christoph Lehner
317bdcf158
nerscio parametrization
2022-03-24 13:10:47 +01:00
Peter Boyle
387397374a
Current run options
2022-03-23 16:35:11 -04:00
Mattia Bruno
9ca2c98882
Merge branch 'develop' of https://github.com/paboyle/Grid into mbruno-eclover
2022-03-22 15:31:37 +01:00
Peter Boyle
605cf401e1
Merge branch 'feature/sumd-npr' into develop
2022-03-16 22:43:12 +00:00
Peter Boyle
f99c3660d2
Merge branch 'feature/cpu-threaded-smp' into develop
2022-03-16 22:07:54 +00:00
Peter Boyle
92a83a9eb3
Performance improve for Tesseract
2022-03-16 17:14:36 +00:00
Mattia Bruno
53ae01a34a
Merge pull request #1 from fjosw/feature/eclover
...
Feature/eclover
2022-03-15 15:23:35 +01:00
Peter Boyle
b615fa0f35
Merge pull request #388 from fjosw/feature/sumd-npr
...
Feature/sumd npr
2022-03-15 09:05:57 -04:00
Christoph Lehner
76c294a7ba
open bc fix
2022-03-08 13:55:16 +01:00
0c0c2b1e20
Unnecessary arguments of CloverHelpers::Exponentiate_Clover removed.
2022-03-08 09:44:51 +00:00
Christoph Lehner
e2fc3a0f04
Merge pull request #28 from paboyle/develop
...
Sync with Upstream
2022-03-08 09:58:51 +01:00
451e7972fd
Reintroduced explicit inversion of the Clover term in case of the
...
CompactExpClover because of the open boundary O(a) improvement. Changed
the timing output to GridLogDebug
2022-03-07 17:43:33 +00:00
56c089d347
Removed leftover comments
2022-03-07 16:40:20 +00:00
acf740e44d
Merge pull request #1 from FelixPGZiegler/feature/eclover
...
Feature/eclover
2022-03-07 16:25:11 +00:00
182f513404
Merge remote-tracking branch 'fjosw/feature/eclover' into feature/eclover
2022-03-07 15:22:04 +00:00
d5b2323a57
included Cayley-Hamilton exponentiation for the compact Wilson exp clover, bug fix for inverse of exp clover
2022-03-07 14:44:24 +00:00
FelixPGZiegler
bad18d4417
Merge branch 'paboyle:develop' into feature/eclover
2022-03-07 13:54:10 +00:00
Peter Boyle
bb5c16b97f
New scripts
2022-03-03 17:00:37 -05:00
Peter Boyle
0d80eeb545
small DDHMC update
2022-03-03 16:56:02 -05:00
d1decee4cc
Cleaned up unused variables in Lattice_reduction_gpu.h
2022-03-02 16:54:23 +00:00
d4ae71b880
sum_gpu_large and sum_gpu templates added.
2022-03-02 15:40:18 +00:00
Peter Boyle
b0f4eee78b
New files
2022-03-01 19:09:13 -05:00
Peter Boyle
5340e50427
HMC running with new formulation
2022-03-01 17:10:25 -05:00
Peter Boyle
e16fc5b2e4
Threaded intranode comms transfer - ideally between NUMA domains
2022-03-01 11:17:24 -05:00
Peter Boyle
694306f202
Configure for mac arm
2022-03-01 10:53:44 -05:00
Peter Boyle
9aac1e6d64
Merge branch 'develop' into feature/sumd-npr
2022-03-01 10:51:38 -05:00
Peter Boyle
3e882f555d
Large / small sumD options
2022-03-01 08:54:45 -05:00
438caab25f
generate_instantiations.sh now correctly produces instantiations for CompactClover variant, redundant instantiations removed.
2022-02-27 18:27:18 +00:00
239e2c1ee6
tests: wilson clover cg tests now include compact variant as well as
...
exponential wilson clover operators
2022-02-27 18:26:34 +00:00
013dc2ef33
tests: core tests for wilson clover and wilson exp clover including
...
compact version extended/added
2022-02-27 18:13:47 +00:00
Christoph Lehner
9616811c3d
Merge branch 'feature/gpt' of https://github.com/lehner/Grid into feature/gpt
2022-02-24 22:03:05 +01:00
Christoph Lehner
8a3002c03b
separate left and right masses for CayleyFermion5D
2022-02-24 22:02:56 +01:00
Peter Boyle
0f1c5b08a1
Dirichlet filters running on AMD and now integrated in Fermion op
2022-02-23 19:29:28 -05:00
Peter Boyle
70988e43d2
Passes multinode dirichlet test with boundaries at
...
node boundary or at the single rank boundary
2022-02-23 01:42:14 -05:00
Mattia Bruno
71034f828e
attempt to fix broken WilsonExpClover; Compact version still broken will be replaced by F.Joswig
2022-02-23 01:02:27 +01:00
Peter Boyle
aab3bcb46f
Dirichlet first cut - wrong answers on dagger multiply.
...
Struggling to get a compute node so changing systems
2022-02-22 19:58:33 +00:00
Mattia Bruno
11437930c5
cleaned up definitions of wilsonclover fermions
2022-02-22 10:45:16 +01:00
Mattia Bruno
3d44aa9cb9
cleaned up cloverhelpers; fixed test compact_clover which runs
2022-02-22 01:10:19 +01:00
Mattia Bruno
2851870d70
expClover support via helpers template class
2022-02-22 00:05:43 +01:00
Peter Boyle
da06d15f73
Merge branch 'feature/feature/staggered-comms' into develop
2022-02-17 04:58:50 +00:00
Peter Boyle
e8b1251b8c
Staggered fix finished
2022-02-17 04:51:13 +00:00
Peter Boyle
63dbaeefaa
Extra barrier prior to finalize just in case it fixes an issue on Tursa
2022-02-16 14:01:43 +00:00
Peter Boyle
e8c187b323
SyCL happier?
2022-02-15 11:24:38 -05:00
Peter Boyle
fad5a74a4b
Bug fix to detection case
2022-02-15 10:27:39 -05:00
Peter Boyle
e83f6a6ae9
Merge branch 'develop' into feature/feature/staggered-comms
2022-02-15 08:52:39 -05:00
Peter Boyle
0c1618197f
Faster intranode MPI works now
2022-02-15 08:52:07 -05:00
Peter Boyle
f49d5c2d22
Updated scripts for crusher
2022-02-14 17:55:16 -05:00
Peter Boyle
a3b022d469
Crusher compile
2022-02-14 15:09:08 -05:00
Peter Boyle
48772f0976
Merge pull request #384 from jdmaia/hip_launchbounds
...
Changing thread block order and adding launch_bounds
2022-02-14 11:08:28 -05:00
Peter Boyle
c322420580
Dont instantiate an Nc=3 and non-GP hardwired code for other implementations
2022-02-14 16:04:08 +00:00
Azusa Yamaguchi
6283d11d50
Add the comment line to tell the existance of copied data/buffer
2022-02-08 15:22:06 +00:00
Julio Maia
86f4e17928
Changing thread block order and adding launch_bounds
2022-02-07 11:29:37 -06:00
Peter Boyle
6616d5d090
Commit
2022-02-02 16:38:24 -05:00
Peter Boyle
215df671be
Merge pull request #382 from DanielRichtmann/feature/compact-clover
...
Compact Clover Fermions
2022-02-01 21:45:38 -05:00
Daniel Richtmann
1b6b12589f
Get splitting up into implementation and instantiation files correct
2022-02-02 00:51:11 +01:00
Daniel Richtmann
3082ab8252
Check in compact version of wilson clover fermions
2022-02-02 00:50:05 +01:00
Daniel Richtmann
add86cd7f4
Abandon ET for clover application, use construct similar to multLink
2022-02-01 23:09:06 +01:00
Daniel Richtmann
0b6fd20c54
Enable memory coalescing in clover term generation
2022-02-01 23:09:06 +01:00
Daniel Richtmann
e83423fee6
Refactor clover to align with other files and prepare for upcoming changes
2022-02-01 23:09:06 +01:00
Daniel Richtmann
b4f8e87982
Have Grid's cli interface understand floats
2022-02-01 23:09:06 +01:00
Peter Boyle
135808dcfa
Less verbose
2021-12-07 16:24:24 -05:00
Peter Boyle
7f7d06d963
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-12-07 09:06:42 -08:00
Peter Boyle
2bf3b4d576
Update to reduce memory footpring in benchmark test
2021-12-07 09:02:02 -08:00
RJHudspith
0bd83cdbda
Fixes for Nc!=3 Nersc IO, Gauge and Gauge_NCxNC compatible with GLU. Trace normalisation changed in places removing explicit threes. Guards against non-su3 tests and tests failing when LIME is not compiled.
2021-11-28 21:51:03 +01:00
Peter Boyle
f34d34bd17
2 nodes
2021-11-22 22:27:16 -05:00
Peter Boyle
e32d5141b4
Updated to make MPI reliable still gives good perf, but MPI will be slow
...
intranode
2021-11-22 21:46:31 -05:00
Peter Boyle
6d5277f2d7
Update to Spock
2021-11-22 20:58:02 -05:00
Peter Boyle
14d82777e0
Best modules for spock
2021-11-22 20:47:16 -05:00
Peter Boyle
2a4e739513
Enable XGMI copy (need to rename nvlink to cover NVLINK/XGMI/XeLink)
2021-11-22 20:46:09 -05:00
Peter Boyle
8079dc2a14
Cray MPI not working right yet
2021-11-22 20:45:44 -05:00
Peter Boyle
6ceb556684
Intranode asynch hipMemCopy
2021-11-22 20:45:12 -05:00
Peter Boyle
76cde73705
HIP improvements on messaging and intranode hipMemCopyAsynch
2021-11-22 20:44:39 -05:00
Peter Boyle
cc094366a9
Merge pull request #375 from JPRichings/develop
...
Lattice object ACCcache probe
2021-11-09 18:19:32 -05:00
41a575ff9b
Format edit
2021-11-09 21:56:23 +00:00
12ef413065
fix to deflation.h
2021-11-09 21:20:36 +00:00
829a328451
remove deflation timing
2021-11-09 20:46:57 +00:00
402523c62e
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-11-09 12:57:40 +00:00
d7bef70b5c
Helper functions to allow probe of cache state of lattice objects.
2021-11-09 12:57:09 +00:00
2ad1811642
Added timing to deflation code.
2021-11-09 12:33:25 +00:00
Alessandro Lupo
88bdd4344b
2indx antisymm representation of sp2n
2021-11-04 18:27:35 +00:00
a65a497bae
Merge branch 'develop' of github.com:paboyle/Grid into develop
2021-10-29 13:01:34 +01:00
b27b12828e
reverse previous "fix", missing statement was probably intentional, added a comment to that effect
2021-10-29 13:01:31 +01:00
Peter Boyle
42d56ea6b6
Verbosity
2021-10-29 02:23:08 +01:00
Peter Boyle
0b905a72dd
Better reduction for GPUs
2021-10-29 02:22:22 +01:00
Peter Boyle
fe9edf8526
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
2021-10-29 02:03:27 +01:00
Peter Boyle
44204c7e06
Extra code
2021-10-29 02:02:56 +01:00
Peter Boyle
33b3789598
Merge pull request #364 from AndrewYongZhenNing/develop
...
CayleyFermion5D Conserved current fix
2021-10-27 20:27:20 -04:00
Peter Boyle
195ab2888d
Merge branch 'develop' into develop
2021-10-27 20:26:57 -04:00
Peter Boyle
85f750d753
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
2021-10-27 00:28:05 +01:00
Peter Boyle
a4ce6e42c7
Warning free compile on make all and make tests under nvcc
2021-10-27 00:27:03 +01:00
Peter Boyle
5398b7e7e3
Max 128 size
2021-10-26 09:16:29 -07:00
fd13a3f2be
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-10-26 10:45:46 +01:00
c144b32368
deflation timers
2021-10-26 10:37:24 +01:00
Alessandro Lupo
4044536eea
add projection on sp2n algebra
2021-10-26 10:20:44 +01:00
Alessandro Lupo
4d8ae6221c
fix projection
2021-10-22 10:44:54 +01:00
Peter Boyle
ba7e371b90
Warning free compile on Tursa.
...
Hopefully got all reqd virtual dtors
2021-10-21 19:56:52 +01:00
Peter Boyle
99e7a5d18a
Merge pull request #371 from edbennett/hmc-documentation-update
...
update documentation for GenericHMCRunner - thanks
2021-10-18 14:36:43 -04:00
f824d99059
update documentation for GenericHMCRunner
2021-10-18 09:50:16 +01:00
Peter Boyle
749b8022a4
Linear operator and SparseMatrix virtual destructors
2021-10-15 20:47:18 +01:00
Peter Boyle
7e0057d2c4
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
2021-10-15 20:46:51 +01:00
Peter Boyle
cfe9e870d3
Stream
2021-10-15 20:46:44 +01:00
Peter Boyle
e9c4f06cbf
Merge pull request #370 from fjosw/bugfix/gpu_sum_shm
...
Error Handling sum_Dgpu large objects
2021-10-14 09:12:47 -04:00
1f9688417a
Error message added when attempting to sum object which is too large for
...
the shared memory
2021-10-13 20:45:46 +01:00
Alessandro Lupo
4e31e4e094
Better tests
2021-10-13 15:07:23 +01:00
Alessandro Lupo
0d6674e489
hot start for sp2n
2021-10-12 18:53:54 +01:00
Alessandro Lupo
b145fd4f5b
necessary to merge
2021-10-12 17:08:46 +01:00
Alessandro Lupo
8a5b794f25
necessary change to merge with upstrm
2021-10-12 16:04:03 +01:00
Alessandro Lupo
291e80f88a
sp2n as config option
2021-10-12 16:00:32 +01:00
Alessandro Lupo
1ace5850ae
first hmc
2021-10-12 16:00:32 +01:00
Alessandro Lupo
283f14b7c1
fix sp2n projection
2021-10-12 16:00:32 +01:00
Alessandro Lupo
1d6e708083
tests!
2021-10-12 16:00:32 +01:00
Alessandro Lupo
89457e25e3
sp fermion instantiation
2021-10-12 16:00:32 +01:00
Alessandro Lupo
7e3b298d3d
project on sp2n
2021-10-12 16:00:32 +01:00
Alessandro Lupo
7ff3e5eed4
gauge and fermion implementation for sp2n
2021-10-12 16:00:32 +01:00
Alessandro Lupo
19eb51cf41
sp2n generators
2021-10-12 15:53:33 +01:00
Alessandro Lupo
470d4dcc6d
sp2n as config option
2021-10-12 15:47:56 +01:00
Alessandro Lupo
ed03bfd555
first hmc
2021-10-12 12:16:47 +01:00
Alessandro Lupo
8c0fbcccae
fix sp2n projection
2021-10-12 12:12:16 +01:00
Alessandro Lupo
d4866157fe
tests!
2021-10-12 09:06:15 +01:00
Peter Boyle
16c2a99965
Overlap cudamemcpy - didn't set up stream right
2021-10-11 13:31:26 -07:00
Alessandro Lupo
b6496b6cb5
sp fermion instantiation
2021-10-11 16:32:10 +01:00
Alessandro Lupo
4f5fe57920
project on sp2n
2021-10-11 16:28:15 +01:00
Alessandro Lupo
11fb943b1e
gauge and fermion implementation for sp2n
2021-10-11 16:21:25 +01:00
Peter Boyle
cda915a345
Better options
2021-10-07 20:29:09 +01:00
Peter Boyle
7c16189e16
Merge pull request #368 from Heinrich-BR/develop
...
Accelerated Pick-Set Checkerboard functions
2021-10-07 15:13:09 -04:00
Peter Boyle
ecbfccea43
Merge pull request #369 from paboyle/gauge-group-covariance
...
expose gauge group in GImpl and generic Nc fix
2021-10-07 15:11:12 -04:00
Peter Boyle
a8eda8f6da
Summit scripts
2021-10-05 21:22:10 -04:00
Peter Boyle
9b1a0653cf
Summit results
2021-10-05 21:22:01 -04:00
Peter Boyle
7cb1ff7395
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-10-05 20:13:42 -04:00
Peter Boyle
ab6ea29913
Print removal
2021-10-05 20:13:25 -04:00
b5c81a02b6
Merge branch 'develop' of github.com:paboyle/Grid into develop
2021-10-05 21:13:01 +01:00
d899ee80fc
skip record fixed to include norm metadata
2021-10-05 21:12:47 +01:00
Peter Boyle
4016e705fc
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-10-05 14:56:57 -04:00
Peter Boyle
2f4e85e5d6
Summit set up
2021-10-05 14:56:17 -04:00
Peter Boyle
8ed0b57b09
Memory verbose and tracking, shrink default cache
...
Print PCI device IDs on node 0
2021-10-05 11:41:03 -04:00
Alessandro Lupo
046a23121e
sp2n generators
2021-10-05 15:51:22 +01:00
a976fa6746
expose gauge group in GImpl and generic Nc fix
2021-10-05 14:19:47 +01:00
6c66b8d997
deflated guesser can optionally be used with less vectors than provided
2021-09-30 19:25:12 +01:00
9523ad3d73
vector version of Schur solver use vector guesser
2021-09-28 12:45:47 +01:00
73a95fa96f
LinearFunction loops over vectors by default, can be overloaded
2021-09-28 12:44:26 +01:00
7e130076d6
Fixed line left behind
2021-09-24 17:26:31 +01:00
6efdad6f21
Removed Halo benchmark
2021-09-24 17:18:04 +01:00
a822c48565
Added accelerated pick-set checkerboard functions
2021-09-24 17:13:25 +01:00
014fb76e88
Merge branch 'develop' of https://github.com/Heinrich-BR/Grid into develop
2021-09-24 16:45:25 +01:00
30e5311b43
Update from the gods upstream
2021-09-24 16:39:56 +01:00
Peter Boyle
67e08aa952
New file not run yet
2021-09-23 23:39:55 +02:00
Peter Boyle
ed1f20f3a1
Merge pull request #367 from mmphys/bugfix/H5NS
...
Hdf5 namespace
2021-09-23 12:36:11 -04:00
Peter Boyle
cffc736bb3
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-09-22 06:03:06 -07:00
Peter Boyle
c0d56a1c04
Perlmutter tune up
2021-09-22 06:02:34 -07:00
Peter Boyle
3206f69478
SYCL happy
2021-09-21 18:01:35 -07:00
Peter Boyle
b2ccaad761
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-09-21 12:18:05 -07:00
Peter Boyle
8eb1232683
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-09-21 09:25:07 -07:00
Peter Boyle
c6ce3ad03b
Some properties
2021-09-21 09:20:21 -07:00
Peter Boyle
b3b033d343
Clean
2021-09-21 09:18:54 -07:00
Peter Boyle
ca9816bfbb
Typo
2021-09-21 04:12:04 +02:00
Peter Boyle
814d5abc7e
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-09-21 04:05:51 +02:00
Peter Boyle
a29122e2bf
Rebench
2021-09-21 04:05:04 +02:00
Peter Boyle
e188c0512e
Udpdate
2021-09-21 01:04:30 +02:00
Peter Boyle
1fb6aaf150
Device 2 Device with cudaMemcpy
2021-09-21 01:03:07 +02:00
Peter Boyle
894654f7ef
Simplificatoin, always gather faces
2021-09-21 01:02:34 +02:00
Peter Boyle
109507888b
Option to force use of MPI over Nvlink
2021-09-21 00:53:25 +02:00
Peter Boyle
68650b61fe
Options controlling behaviour
2021-09-21 00:51:01 +02:00
Michael Marshall
7ee66bf453
Make sure H5NS has empty definition if HDF5 built without C++ namespace. Add comment in Hdf5IO.cc indicating likely source of error using H5NS, i.e. lack of --enable-cxx in hdf5 configure.
2021-09-19 19:45:20 +01:00
Peter Boyle
8bd70ad8b5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-09-16 10:22:38 -07:00
Peter Boyle
af98525766
Merge pull request #359 from paboyle/feature/serialisation-update
...
Feature/serialisation update
2021-09-16 10:24:52 -04:00
Peter Boyle
1c2f218519
Merge pull request #360 from pjgeorg/ld-nvcc-openmp
...
nvcc: Add -fopenmp to LDFLAGS
2021-09-16 10:24:30 -04:00
Peter Boyle
c9aa1f507c
Merge pull request #363 from felixerben/feature/testMesonField
...
Feature/test meson field
2021-09-16 10:23:58 -04:00
Peter Boyle
ea7126496d
Merge pull request #361 from edbennett/fix-setdevice-message
...
make message about setdevice consistent with configure script
2021-09-16 10:23:37 -04:00
Peter Boyle
f660dc67e4
Merge pull request #366 from lehner/feature/gpt
...
Avx512 mixed prec
2021-09-15 20:27:13 -04:00
Christoph Lehner
ede8faea74
Merge branch 'paboyle:develop' into feature/gpt
2021-09-16 02:23:15 +02:00
Christoph Lehner
1b750761c2
Merge pull request #26 from waterret/feature/gpt
...
AVX512 drop mixed precision as well
2021-09-16 02:22:52 +02:00
Peter Boyle
145acf2919
Perf results
2021-09-16 01:06:28 +01:00
Peter Boyle
cc4a27b9e6
Scripts and performance
2021-09-16 00:15:35 +01:00
Peter Boyle
b4690e6091
Adding build basics for different systems
2021-09-16 00:00:38 +01:00
Luchang Jin
4b24800132
AVX512 drop mixed precision as well
2021-09-15 16:29:47 -04:00
Peter Boyle
9d2238148c
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
2021-09-15 19:25:57 +01:00
Peter Boyle
c15493218d
Two extra routines to break out SchurRedBlack on many RHS into stages to allow efficient deflation & split grid
...
Split grid solver still to do.
2021-09-15 19:24:39 +01:00
Peter Boyle
001a556a34
Merge pull request #365 from lehner/feature/gpt
...
Sync
2021-09-15 13:34:02 -04:00
Christoph Lehner
3d0f88e702
A64FX drop mixed precision as well
2021-09-15 18:38:32 +02:00
Christoph Lehner
dd091d0960
consistent pointer offloading instead of views
2021-09-15 16:58:05 +02:00
Christoph Lehner
e2abbf9520
Merge pull request #25 from paboyle/develop
...
Sync
2021-09-15 10:02:43 +02:00
Peter Boyle
c7baeb5bae
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-09-14 08:31:11 -07:00
Peter Boyle
402d80e197
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
2021-09-14 16:16:06 +01:00
Peter Boyle
86e33c8ab2
Significant GPU perf speed up finished
2021-09-14 16:14:23 +01:00
Peter Boyle
5dae6a6dac
Deprecate half prec comms
2021-09-14 15:06:59 +01:00
Peter Boyle
361bb8a101
Remove half prec comms
2021-09-14 15:06:29 +01:00
Peter Boyle
7efdb3cd2b
Remove half prec comms
2021-09-14 15:06:06 +01:00
Peter Boyle
65ef4ec29f
Move tables to device memory
2021-09-14 15:05:01 +01:00
Peter Boyle
d5835c0222
Switch to coalesced stencil face gather
2021-09-14 15:04:14 +01:00
Peter Boyle
a7b943b33e
Remove half prec comms
2021-09-14 05:05:33 +01:00
Peter Boyle
7440cde92f
No half prec comms; coalesced access on GPU
2021-09-14 05:04:56 +01:00
Peter Boyle
0fc662bb24
Dirac cuda 11.4 happy ; force host for functions accessing mult table
...
ET runs these on host BEFORE lodging result in AST for kernel
2021-09-14 05:00:44 +01:00
Peter Boyle
8195890640
Force MPI over NVLINK
2021-09-14 05:00:17 +01:00
Peter Boyle
4c88104a73
Fix compile warns
2021-09-11 23:08:05 +01:00
Peter Boyle
73b944c152
Drop half prec comms for now.
2021-09-11 23:07:18 +01:00
Peter Boyle
d1b0b7f5c6
Half prec comms dropping
2021-09-11 23:05:40 +01:00
Peter Boyle
381d8797d0
Drop half prec comms for now
2021-09-11 23:05:02 +01:00
11ee8a1061
Merge remote-tracking branch 'upstream/develop' into develop
2021-09-02 16:57:42 +01:00
Peter Boyle
b06526bc1e
Comment update
2021-08-30 21:15:39 -04:00
Peter Boyle
3044419111
Some sample code
2021-08-30 20:32:11 -04:00
Peter Boyle
bcfa9cf068
Improvement of output
2021-08-28 08:08:15 -07:00
Peter Boyle
114920b8de
Some example clean up
2021-08-25 12:24:17 +01:00
Peter Boyle
0d588b95f4
Bug fix to Example_Laplacian test
2021-08-23 23:14:26 +01:00
Peter Boyle
5b3c530aa7
Return value
2021-08-23 15:30:45 +01:00
Peter Boyle
c6a5499c8b
Fail on non-apple
2021-08-22 18:40:55 +01:00
Peter Boyle
ec9c3fe77a
Remove the file
2021-08-22 18:28:39 +01:00
Peter Boyle
6135ad530e
Extra examples / solutions
2021-08-22 18:25:07 +01:00
Peter Boyle
40098424c7
Examples
2021-08-22 14:17:12 +01:00
Peter Boyle
7163b31a26
Examples
2021-08-20 01:15:23 +01:00
Peter Boyle
ffbdd91e0e
Apple happiness
2021-08-20 01:15:00 +01:00
Peter Boyle
5d29e175d8
Typo fix
2021-08-10 18:25:43 +01:00
Peter Boyle
417dbfa257
Fix
2021-08-10 08:55:35 -07:00
peterx.a.boyle
1eda4d8e0b
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-08-10 05:41:18 -07:00
peterx.a.boyle
50181f16e5
Level 0 IPC set up
2021-08-10 05:35:15 -07:00
Peter Boyle
75030637cc
Improved comms benchmark, same as benchmark_comms_host_device
2021-08-10 05:16:30 -07:00
Peter Boyle
fe5aaf7677
Make comms benchmark same as Benchmark_comms_host_device
2021-08-09 04:06:30 -07:00
Peter Boyle
80ac2a73ca
Check is wrong (HtoD / DtoH)
2021-08-05 18:33:20 -04:00
Andrew Yong
770680669d
Whitespace removal.
2021-08-04 09:21:59 +01:00
Andrew Yong
0cdfc5cf22
Merge remote-tracking branch 'upstream/develop' into develop
2021-07-30 14:40:55 +01:00
d75a66a3e6
test done
2021-07-06 11:42:36 +01:00
fcc4374d7b
i/o done
2021-07-05 14:52:00 +01:00
67c3c16fe5
working test
2021-07-05 14:41:52 +01:00
25e9be50b5
created test file
2021-07-02 15:51:19 +01:00
428b8ba907
Updated from upstream and added halo benchmark
2021-06-29 01:05:12 +01:00
323cf6c038
make message consistent with configure script
2021-06-23 17:00:43 +01:00
Peter Boyle
29a22ae603
Simpler SYCL setup
2021-06-22 17:57:20 +00:00
Peter Boyle
403bff1a47
Force reqd subgroup size fo SYCL
2021-06-22 17:56:10 +00:00
Christoph Lehner
c50f27e68b
Make FFT play nice with split grid
2021-06-20 11:34:38 +02:00
Peter Georg
80afacec5b
nvcc: Add -fopenmp to LDFLAGS
2021-06-17 13:05:13 +02:00
Peter Boyle
6cd9224dd7
SYCL comms buffer allocate
2021-06-16 17:10:55 +00:00
Peter Boyle
4bf8196ff1
Merge branch 'develop' of https://www.github.com/paboyle/Grid into develop
2021-06-15 21:45:36 +00:00
Peter Boyle
4c5440fb06
const happy for sycl
2021-06-15 21:45:07 +00:00
a269a3d919
Merge pull request #358 from mmphys/feature/serialisation-test
...
Add a ragged std::vector to the serialisation test
2021-06-09 10:16:25 +01:00
Michael Marshall
0c4f585496
Test nested std::vector<grid tensor>
2021-06-08 00:05:35 +01:00
Michael Marshall
33d2df46a0
Merge branch 'develop' into feature/serialisation-test
...
* develop:
Update README.md
removing Travis CI constantly failing due to overtime (no way we can compile Grid on free time anymore)
2021-06-07 23:25:38 +01:00
Michael Marshall
2df308f649
Add a ragged vector to the serialisation tests. NB: Already had nested (regular) std::vector<std::vector<...>>
2021-06-07 23:25:07 +01:00
Peter Boyle
92def28bd3
Update README.md
2021-06-06 04:52:05 -04:00
ca10bfa1c7
removing Travis CI constantly failing due to overtime (no way we can compile Grid on free time anymore)
2021-06-04 11:12:22 +01:00
298a6ec51e
Merge pull request #357 from mmphys/bugfix/ragged
...
Bugfix/ragged Multi-dimensional ragged vectors
2021-06-04 10:34:46 +01:00
Michael Marshall
e5dbe488a6
Merge branch 'develop' into bugfix/ragged
...
* develop:
Remove synch
2021-06-03 08:25:56 +01:00
Peter Boyle
0e27e3847d
Remove synch
2021-06-03 04:24:19 +00:00
Michael Marshall
393727b93b
Documentation update (briefly) covering serialisation changes. For review
2021-06-01 15:49:37 +01:00
Michael Marshall
2b1fcd78c3
Fixes post review with Peter: a) Correct bug in isRegularShape - detect 3d matrix where 1st slice is 2x2 and second slice is 2x1; b) Synchronisation of EigenResizeCounter done by checking we're the OMP primary thread; c) Move definition of EigenResizeCounter to new file, BaseIO.cc
2021-05-31 22:24:54 +01:00
Michael Marshall
0a4e0b49a0
BaseIO: Added "EigenResizeCounter" to keep track of any allocations/deallocations to Eigen tensors during readback. On read, if the tensor is resized, EigenResizeCounter += delta memory (in bytes)
2021-05-31 12:49:56 +01:00
Michael Marshall
76af169f05
Add global namespace to Writer<T> and Reader<T> inside GRID_SERIALIZABLE_CLASS_MEMBERS (so that "using Grid" not necessary).
...
Fix issue with output of Grid::iMatrix so that M<3>{{148,149,150,} {151,152,153,} {154155156}} becomes M<3>{{148,149,150} {151,152,153} {154,155,156}}
2021-05-31 08:43:02 +01:00
Michael Marshall
7b89232251
Extended HDF5 serialisation of std::vector<T> where T now also includes Grid scalar/vector/matrix
...
Changed VectorUtils element traits to is_flattenable, because: a) contract changed on what it does; and b) no other Grid dependencies on element. Needs review.
Initial tests work ... needs proper regression testing.
2021-05-30 20:27:53 +01:00
Peter Boyle
b5aeae526f
Make Cshift fields static to avoid repeated reallocaate overhead
2021-05-28 16:33:08 +02:00
Michael Marshall
ef0ddd5d04
std::vector serialisation in hdf5 uses a different format if the vector is ragged. When reading back std::vector we need to check which format we're reading (since we don't know a priori) and this involves looking for attributes that may not exist. The c++ API: a) throws; and b) prints voluminous logging. Switched to non-throwing, non-logging, C version of the API after code review.
2021-05-24 18:43:55 +01:00
Michael Marshall
9b73dacf50
First row might still be ragged if multi dimensional. attrExists() doesn't throw, but easier to wrap in try ... catch than to explain in comment.
2021-05-22 04:34:32 +01:00
Michael Marshall
244b4aa07f
Serialise std::vector of numeric types as multidimensional object if size is regular ... or individually if ragged
2021-05-21 20:08:56 +01:00
u61464
8cfc7342cd
staggered hand unroll read coalesce
2021-05-05 14:17:18 -07:00
u61464
15ae317858
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-05-04 08:40:38 -07:00
u61464
834f536b5f
Fastest option on SyCL is now std::complex
2021-05-04 08:40:18 -07:00
Peter Boyle
c332d9f08b
Merge pull request #356 from felixerben/bugfix/stoutSmearing
...
Jamie's fix
2021-04-27 14:10:49 -04:00
cf2923d5dd
Jamie's fix
2021-04-27 16:53:37 +01:00
Peter Boyle
0e4413ddde
Merge pull request #355 from felixerben/bugfix/stoutSmearing
...
bugfix 3D stout smearing
2021-04-27 08:01:55 -04:00
009ccd581e
bugfix 3D stout smearing
2021-04-26 10:36:33 +01:00
Peter Boyle
8cd4263974
Tests compile
2021-04-25 22:20:37 -04:00
Peter Boyle
d45c868656
Change interface
2021-04-25 10:53:34 -04:00
Peter Boyle
955a8113de
Expose label only to reduce number of parameters
2021-04-25 10:36:38 -04:00
Peter Boyle
dbe210dd53
Open the ens_id
2021-04-25 10:25:59 -04:00
54c6b1376d
Quick fix of conserved current implementation in CayleyFermion5D. Now function treats current insertion with appropriate periodic boundary conditions in the mu=3 direction.
2021-04-21 16:56:46 +01:00
Peter Boyle
86e11743ca
set twists
2021-04-20 10:19:11 -04:00
f3f11b586f
Tadpole sign now in front of forward hopping term to be consistent with previous implementation and analytic form.
2021-04-17 12:44:27 +01:00
8083e3f7e8
Sign factor for tadpole implementation corrected.
2021-04-15 11:14:31 +01:00
Peter Boyle
980e721f6e
Update MetaData.h
2021-04-13 09:33:01 -04:00
364793154b
Reverted checkerboard changes
2021-04-09 15:47:17 +01:00
3e2ae1e9af
Added profiling messages to pick and set checkerboard functions
2021-04-08 16:58:47 +01:00
Henrique Rocha
d38ae2fd18
Merge branch 'develop' of https://github.com/Heinrich-BR/Grid into develop
2021-04-06 17:18:39 +01:00
Henrique Rocha
030e7754e4
Merge remote-tracking branch 'upstream/develop' into develop
2021-04-06 17:16:13 +01:00
Peter Boyle
e2a0142d87
Merge pull request #348 from AndrewYongZhenNing/develop
...
Conserved Tadpole Implementation for Shamir Action Only
2021-04-06 10:49:00 -04:00
895244ecc3
Merge with upstream; implemented conserved tadpole for Shamir action.
2021-04-06 13:46:33 +01:00
addeb621a7
Implemented tadpole operator for Shamir action.
2021-04-06 13:45:37 +01:00
3b7fce1e76
Reverted checkerboard changes
2021-04-02 14:38:41 +01:00
4d15417f93
Merge remote-tracking branch 'upstream/develop' into develop
2021-04-01 18:28:15 +01:00
ab3c855f65
Merge branch 'develop' of https://github.com/Heinrich-BR/Grid into develop
2021-04-01 18:22:05 +01:00
92e2c517d8
Changed pick- and setCheckerboard to use accelerator_for
2021-04-01 18:21:19 +01:00
Peter Boyle
a7fb25adf6
Make Cshift fields static to avoid repeated reallocaate overhead
2021-03-29 21:44:14 +02:00
Peter Boyle
e947992957
Improved force terms
2021-03-29 20:04:06 +02:00
Peter Boyle
bb89a82a07
Staggered coalseced read
2021-03-29 20:01:15 +02:00
Christoph Lehner
2bb374daea
hip-friendly
2021-03-19 11:33:23 +01:00
Peter Boyle
8bdadbadac
Cold start
2021-03-18 15:41:14 -04:00
Peter Boyle
15c50a7442
Explicit instantiate the template function
2021-03-18 15:40:42 -04:00
Peter Boyle
49b0af2c95
Update of tests to compile with the sRNG addition.
...
Audited the code conventions (again) with the CPS momentum denominator
and added anti periodic in time to the Test_mobius_force.cc and
tested the Test_dwf_gpforce.
Promoted thesee to test full HMC hamiltonian, tr P^2/2 + phidag MdagM phi
with the same pdot and Udot as audited in the Integrator.h etc...
With full comments and sources for factors.
2021-03-18 09:10:02 -04:00
Peter Boyle
9c2b37218a
sRNG parameter added
2021-03-18 06:24:11 -04:00
Peter Boyle
3c67d626ba
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-12 15:36:55 +01:00
Peter Boyle
51f506553c
Read out the local ID once, and store
2021-03-12 15:33:04 +01:00
Peter Boyle
226be84937
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-12 09:31:50 -05:00
Peter Boyle
001814b442
updated to do list. Start adding DDHMC work items
2021-03-12 09:31:17 -05:00
Peter Boyle
db3ac67506
Update thread issue
2021-03-12 14:55:07 +01:00
Peter Boyle
da91a884ef
NVCC versions found buggy added as guard
2021-03-11 23:54:53 +01:00
Peter Boyle
a71e6755e3
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-11 22:43:06 +01:00
Peter Boyle
cd5891eecd
Test that fails on Cuda 11.0
2021-03-11 22:34:28 +01:00
Peter Boyle
5bb7336f27
Merge pull request #347 from pjgeorg/fix-autotools-avx512
...
Fix inconsistent SIMD option AVX512
Thanks
2021-03-11 16:29:07 -05:00
Peter Boyle
ce1fc1f48a
Possible fallback plan for Fionn's compiler bbug in nvcc
2021-03-11 22:20:53 +01:00
Peter Georg
82402c6a7c
Add simd option SKL for ICC
2021-03-11 13:08:40 +01:00
Peter Georg
d9c4afe5b7
Fix inconsistent configure option AVX512
...
Before this change AVX512 enabled different instruction sets depending
on the compiler:
For Intel C++ Compiler Classic (ICC):
AVX512F, AVX512CD, AVX512DQ, AVX512BW, AVX512VL
i.e. Intel Xeon Skylake and newer
For Intel ICX, gcc, clang:
AVX512F, AVX512CD, AVX512ER, AVX512PF
i.e. Intel Xeon Phi x200/x205 (KNL/KNM)
With this commit AVX512 now only enables the common instruction sets
supported by all CPUs supporting any AVX-512 instructions set:
AVX512F and AVX512CD (called COMMON-AVX512 by icc)
2021-03-11 12:58:49 +01:00
Peter Boyle
f786ff8d69
Extend test from Fionn, fails on A100 apparently
2021-03-10 14:32:06 -05:00
u61464
a651caed5f
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-10 06:23:51 -08:00
u61464
0e21adb3f6
Gives 200GF/s on SyCL/DG1 8^4, doesn't uglify develop for other platforms too badly.
...
Easy to revert to clean more C++ stylistic code. Theres a SYCL_HACK macro I will clean up later once dpcpp
evolves a central nervous systems.
2021-03-10 05:40:51 -08:00
Peter Boyle
58bf9b9e6d
Clean up test
2021-03-10 02:45:22 +01:00
Peter Boyle
2146eebb65
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-09 04:31:46 +01:00
Peter Boyle
6a429ee6d3
2d loop hits Nvidia 16bit limit on large local vols
2021-03-09 04:31:10 +01:00
Peter Boyle
4d1ea15c79
More verbosity. The 16bit limit on Grid.y, Grid.z is annoying
2021-03-09 04:29:37 +01:00
Peter Boyle
a76cb005e0
Update Tensor_exp.h
2021-03-08 13:37:57 -05:00
Christoph Lehner
49ecbc81d4
Merge pull request #24 from ThomasWurm/feature/gpt
...
Put GlobalSum outside the slice loop in sliceSum
2021-03-08 16:01:47 +01:00
Thomas Wurm
9e5fb52eb9
Put GlobalSum outside the slice loop
2021-03-08 13:53:34 +01:00
Peter Boyle
a9604367c1
Merge pull request #336 from lehner/feature/gpt
...
Make ShmDims configurable; adjust GRID_MAX_SIMD to allow for 128 byte width on GPUs
2021-03-05 13:17:19 -05:00
Peter Boyle
d7065023cc
Merge pull request #332 from mmphys/feature/mres_schur
...
Optional changes to Test_cayley_mres e.g. Schur solver
2021-03-05 12:47:07 -05:00
Peter Boyle
89d299ceec
Merge pull request #333 from mmphys/bugfix/LatTransfer
...
Fix convertType for GPU in Lattice_transfer.h
2021-03-05 12:46:33 -05:00
Peter Boyle
e34eda66df
Merge pull request #344 from felixerben/feature/XiToSigma
...
Feature/xi to sigma
2021-03-05 12:45:44 -05:00
Christoph Lehner
b24181aa4f
Update Coordinate.h
...
Revert GRID_MAX_SIMD change
2021-03-05 16:56:58 +01:00
Peter Boyle
aa173e2998
Update README.md
2021-03-05 10:25:33 -05:00
7a19432e0b
whitespace
2021-03-05 10:57:09 +00:00
9b15704290
tested and consitent
2021-03-05 10:42:32 +00:00
Michael Marshall
017f955b2d
Merge branch 'develop' into feature/mres_schur
...
* develop:
Pass serial RNG around
Sycl happier
2021-03-04 20:42:02 +00:00
Michael Marshall
f252d69eef
Merge branch 'develop' into bugfix/LatTransfer
...
* develop:
Pass serial RNG around
Sycl happier
2021-03-04 20:41:30 +00:00
3b06e4655e
Merge branch 'develop' into feature/XiToSigma
2021-03-04 20:06:16 +00:00
d4b4de8f42
changes
2021-03-04 20:01:24 +00:00
Peter Boyle
c90beee774
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-03-03 23:50:29 +01:00
Peter Boyle
1eea9d73b9
Pass serial RNG around
2021-03-03 23:50:01 +01:00
u61464
679d1d22f7
Sycl happier
2021-03-03 11:21:43 -08:00
Michael Marshall
b2b5e0b98c
Merge branch 'develop' into feature/mres_schur
...
* develop:
Hand unrolled to use optimised code paths on GPU for coalesced reads in Wilson case. Other cases to do. This now includes comms code path.
Better SIMD usage/coalescence
2021-03-03 16:15:12 +00:00
Michael Marshall
03e54722c1
Merge branch 'develop' into bugfix/LatTransfer
...
* develop:
Hand unrolled to use optimised code paths on GPU for coalesced reads in Wilson case. Other cases to do. This now includes comms code path.
2021-03-03 16:13:23 +00:00
Peter Boyle
442336bd96
Hand unrolled to use optimised code paths on GPU for coalesced reads in Wilson case.
...
Other cases to do. This now includes comms code path.
2021-03-02 14:50:51 +01:00
Christoph Lehner
9c9566b9c9
Merge pull request #23 from paboyle/develop
...
Sync
2021-03-01 12:33:51 +01:00
Michael Marshall
1059a81a3c
Merge branch 'develop' into bugfix/LatTransfer
...
* develop:
Better SIMD usage/coalescence
2021-02-27 00:21:36 +00:00
Peter Boyle
2e61556389
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-02-26 17:52:20 +01:00
Peter Boyle
f9b1f240f6
Better SIMD usage/coalescence
2021-02-26 17:51:41 +01:00
Michael Marshall
69f41469dd
Merge branch 'develop' into bugfix/LatTransfer
...
* develop: (26 commits)
Added the ability to apply a custom "filter" to the conjugate momentum in the Integrator classes, applied both after refresh and after applying the forces Added a conjugate momentum "filter" that applies a phase to each site. With sites set to 1.0 or 0.0 this acts as a mask and enables, for example, the freezing of inactive gauge links in DDHMC Added tests/forces/Test_momentum_filter demonstrating the use of the filter to freeze boundary links
Correct misleading ac help string
Enable performance counting in WilsonFermion like in others
changed back A2AUtils warning
changed if and accelerator_for - no runtime errors any more
Mac OS (Darwin) sed -i flag for in-place editing differs from posix / gnu
Seems the intention with AutoConf produced Grid/Config.h was to use sed to translate standard PACKAGE_ #defines into GRID_ however due to missing '' after -i this hasn't been working. Perhaps it is too late to fix this, since we don't know who/what is relying on this downstream? ... but if they are, and AutoConf is being used, then likely these #defines have just been redefined anyway. Seems reasonable to redefine PACKAGE and VERSION as well, as none of these macros are used throughout Grid or Hadrons.
Fixed compile issues with maxLocalNorm2 for non-scalar lattices maxLocalNorm2 test now reuses the random field
MADWF 5d source option for hadrons - look at Grid of source Abort on GPU error
maxLocalNorm2()
change back benchmark_ITT
prettify
Flop cout matches DiRAC-ITT-2020
revert changes
merge develop
fixes
weird bug in 2pt function...
revert changes
final version, tested on CPU and GPU
bugfix
...
2021-02-25 09:19:17 +00:00
Michael Marshall
d620b303ff
Merge branch 'develop' into feature/mres_schur
...
* develop: (26 commits)
Added the ability to apply a custom "filter" to the conjugate momentum in the Integrator classes, applied both after refresh and after applying the forces Added a conjugate momentum "filter" that applies a phase to each site. With sites set to 1.0 or 0.0 this acts as a mask and enables, for example, the freezing of inactive gauge links in DDHMC Added tests/forces/Test_momentum_filter demonstrating the use of the filter to freeze boundary links
Correct misleading ac help string
Enable performance counting in WilsonFermion like in others
changed back A2AUtils warning
changed if and accelerator_for - no runtime errors any more
Mac OS (Darwin) sed -i flag for in-place editing differs from posix / gnu
Seems the intention with AutoConf produced Grid/Config.h was to use sed to translate standard PACKAGE_ #defines into GRID_ however due to missing '' after -i this hasn't been working. Perhaps it is too late to fix this, since we don't know who/what is relying on this downstream? ... but if they are, and AutoConf is being used, then likely these #defines have just been redefined anyway. Seems reasonable to redefine PACKAGE and VERSION as well, as none of these macros are used throughout Grid or Hadrons.
Fixed compile issues with maxLocalNorm2 for non-scalar lattices maxLocalNorm2 test now reuses the random field
MADWF 5d source option for hadrons - look at Grid of source Abort on GPU error
maxLocalNorm2()
change back benchmark_ITT
prettify
Flop cout matches DiRAC-ITT-2020
revert changes
merge develop
fixes
weird bug in 2pt function...
revert changes
final version, tested on CPU and GPU
bugfix
...
2021-02-24 18:07:27 +00:00
Peter Boyle
157fd1428d
Merge pull request #342 from paboyle/feature/link-update-mask
...
Feature/link update mask
2021-02-24 11:29:52 -05:00
Christopher Kelly
c791cb2214
Merge branch 'develop' into feature/link-update-mask
2021-02-23 11:51:54 -05:00
Christopher Kelly
d5ab571a89
Added the ability to apply a custom "filter" to the conjugate momentum in the Integrator classes, applied both after refresh and after applying the forces
...
Added a conjugate momentum "filter" that applies a phase to each site. With sites set to 1.0 or 0.0 this acts as a mask and enables, for example, the freezing of inactive gauge links in DDHMC
Added tests/forces/Test_momentum_filter demonstrating the use of the filter to freeze boundary links
2021-02-23 11:49:56 -05:00
0ed800f6e4
merge develop
2021-02-23 14:54:46 +00:00
Peter Boyle
0a32183825
Merge pull request #335 from felixerben/gpu/baryons
...
Gpu/baryons
2021-02-23 09:30:16 -05:00
Peter Boyle
2cacfbde2a
Merge pull request #341 from DanielRichtmann/fix/minor-things
...
Minor fixes
2021-02-22 09:28:50 -05:00
Daniel Richtmann
c073e62e0b
Correct misleading ac help string
2021-02-22 15:25:44 +01:00
Daniel Richtmann
e3d019bc2f
Enable performance counting in WilsonFermion like in others
2021-02-22 15:25:40 +01:00
7ae030f585
changed back A2AUtils warning
2021-02-18 13:24:50 +00:00
86b58d5aff
changed if and accelerator_for - no runtime errors any more
2021-02-18 12:04:32 +00:00
Peter Boyle
26e8b9f4a5
Merge pull request #340 from mmphys/bugfix/config
...
Mac OS (Darwin) sed -i flag for in-place editing differs from posix / gnu
2021-02-17 11:56:21 -05:00
Michael Marshall
35114c9e62
Mac OS (Darwin) sed -i flag for in-place editing differs from posix / gnu
2021-02-17 13:24:15 +00:00
Peter Boyle
dfd28a85c9
Merge pull request #339 from mmphys/bugfix/config
...
Optional rename PACKAGE_ to GRID_ in Grid/Config.h
2021-02-15 13:53:26 -05:00
Michael Marshall
a503332924
Seems the intention with AutoConf produced Grid/Config.h was to use sed to translate standard PACKAGE_ #defines into GRID_ however due to missing '' after -i this hasn't been working.
...
Perhaps it is too late to fix this, since we don't know who/what is relying on this downstream? ... but if they are, and AutoConf is being used, then likely these #defines have just been redefined anyway. Seems reasonable to redefine PACKAGE and VERSION as well, as none of these macros are used throughout Grid or Hadrons.
2021-02-14 21:27:54 +00:00
Peter Boyle
1ac13ec3a7
Merge pull request #338 from paboyle/bugfix/maxnorm2
...
Fixed compile issues with maxLocalNorm2 for non-scalar lattices
2021-02-08 12:08:11 -05:00
Christopher Kelly
55de69a569
Fixed compile issues with maxLocalNorm2 for non-scalar lattices
...
maxLocalNorm2 test now reuses the random field
2021-02-08 12:03:16 -05:00
Peter Boyle
eda9ab487b
MADWF 5d source option for hadrons - look at Grid of source
...
Abort on GPU error
2021-02-08 10:47:22 -05:00
Peter Boyle
cd99edcc5f
maxLocalNorm2()
2021-02-04 18:25:49 -05:00
Christoph Lehner
4705aa541d
Allow user to configure ShmDims via environment variables
2021-02-04 14:25:55 +01:00
Michael Marshall
3215d88a91
Simplify syntax with Grid::EnableIf post code review. Updated EnableIf so that ReturnType defaults to void in same way as std::enable_if see https://en.cppreference.com/w/cpp/types/enable_if
2021-02-03 15:17:03 +00:00
9b9a53f870
...
2021-02-02 13:06:43 +00:00
Christoph Lehner
019ffe17d4
Allow for GPU vector width beyond 64
2021-02-02 11:32:23 +01:00
bc496dd844
change back benchmark_ITT
2021-01-28 14:29:56 +00:00
a673b6a54d
prettify
2021-01-28 14:15:09 +00:00
1bf2e4d187
Merge branch 'develop' into gpu/baryons
2021-01-27 21:17:37 +00:00
Peter Boyle
96dd7a8fbd
Flop cout matches DiRAC-ITT-2020
2021-01-27 21:14:52 +00:00
7905afa9f5
revert changes
2021-01-27 21:14:52 +00:00
712bb40650
merge develop
2021-01-27 21:14:52 +00:00
81d88d9f4d
fixes
2021-01-27 21:09:51 +00:00
Michael Marshall
77063418da
Fix issue for GPU by ensuring accelerator_inline version of convertType is available for Grid::complex<T>. This removes many warnings in Hadrons
...
Simplify the SFINAE syntax and correct convertType for iScalar
2021-01-25 15:09:36 +00:00
Michael Marshall
2983b6fdf6
Optional (superficial) changes to make comparison with Hadrons WardIdentity module easier: use Schur solver; example of Hadrons random gauge init; logging updates; only solve reverse propagator if provided
2021-01-23 12:41:48 +00:00
Peter Boyle
69f1f04f74
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2021-01-21 21:39:59 -05:00
Peter Boyle
11a5fd09d6
Hot config
2021-01-21 21:39:41 -05:00
Peter Boyle
ff1fa98808
Fix for GPU conserveed current
2021-01-21 21:38:23 -05:00
df16202865
weird bug in 2pt function...
2021-01-19 19:25:27 +00:00
3ff7c2c02a
Merge branch 'develop' into gpu/baryons
2021-01-19 12:34:13 +00:00
fc6d07897f
revert changes
2021-01-19 12:32:48 +00:00
f9c8e5c8ef
Merge branch 'develop' of github.com:paboyle/Grid into develop
2021-01-19 12:30:29 +00:00
8bfa0e74f8
final version, tested on CPU and GPU
2021-01-19 12:27:57 +00:00
9b73a937e7
bugfix
2021-01-18 18:57:05 +00:00
Peter Boyle
b0339bc5a4
Merge branch 'feature/conjugate-bc-dirs' into develop
2021-01-15 09:28:39 -05:00
Peter Boyle
3c23a947cc
Fixed test for very much non-unit det
2021-01-15 09:16:02 -05:00
Peter Boyle
56111bb823
Merge branch 'develop' into feature/conjugate-bc-dirs
2021-01-14 21:01:22 -05:00
Peter Boyle
99445673f6
Gparity fix, and plaquette IO
2021-01-14 21:00:36 -05:00
Peter Boyle
97a59643f7
Red black coarse space
2021-01-14 20:49:13 -05:00
Peter Boyle
579595f547
Red black on coarse space
2021-01-14 20:48:35 -05:00
Peter Boyle
281ac5fc12
Red black support on coars
2021-01-14 20:48:08 -05:00
Peter Boyle
d8fa903b02
G5 on coarse spaces
2021-01-14 20:47:28 -05:00
Peter Boyle
eaff0f3aeb
Gamma5 on coaree spaces
2021-01-14 20:46:58 -05:00
Peter Boyle
e8e20c01b2
Coarsened vector test
2021-01-14 20:46:21 -05:00
Peter Boyle
a4afc3ea2a
Red black coarse space
2021-01-14 20:44:16 -05:00
fa12b9a329
bugfix
2021-01-13 10:04:17 +00:00
45fc7ded3a
test for sum
2021-01-12 09:10:37 +00:00
74de2d9742
whitespace changes
2021-01-08 18:28:36 +00:00
e759367d42
tested and working
2021-01-08 18:04:50 +00:00
Christoph Lehner
299d0de066
Merge pull request #21 from paboyle/develop
...
Sync
2020-12-22 20:59:15 +01:00
Peter Boyle
3fe75bc7cb
Merge pull request #329 from nmeyer-ur/feature/a64fx-3
...
Revised dslash/dwf kernels for A64FX
2020-12-20 08:17:15 -05:00
Nils Meyer
45d49d8648
clean up
2020-12-19 03:35:18 +01:00
Nils Meyer
6013183361
removed Asm impls
2020-12-19 03:25:01 +01:00
Nils Meyer
4b882e8056
fixed lost bracket
2020-12-19 03:09:20 +01:00
Nils Meyer
3f9ae6e7e7
Merge branch 'develop' into feature/a64fx-3
2020-12-19 02:37:11 +01:00
Nils Meyer
909acd55cd
vnum variant for prefetches
2020-12-19 02:00:22 +01:00
Nils Meyer
4dd9e39e0d
up to +36% performance gain for dslash/dwf on QPACE 4 using GCC 10.1.1
2020-12-19 00:54:31 +01:00
Christoph Lehner
b4c1317ab4
Merge pull request #22 from DanielRichtmann/feature/clover-access-specifier
...
Clover access specifier
2020-12-18 16:20:19 +01:00
f36d6f3923
compiles on GPU. 3pt still wrong!!!!
2020-12-17 17:04:08 +00:00
Peter Boyle
7adb253e25
Merge pull request #328 from mmphys/feature/mrespatch
...
Enable existing conserved current code for CUDA
2020-12-17 11:10:29 -05:00
808f1e0e8c
merge develop
2020-12-15 16:33:29 +00:00
Michael Marshall
873519e960
Enable existing conserved current code for CUDA (compiles OK for CUDA 10.1). Add option to Test_cayley_mres to load a configuration
2020-12-14 16:06:10 +00:00
Peter Boyle
9aec4a3c26
SYCL
2020-12-10 02:11:17 -08:00
Daniel Richtmann
c438118fd7
Change access specifier of clover fields in order to allow deriving classes to access these
2020-12-08 14:42:11 +01:00
Peter Boyle
70510d151b
Merge pull request #327 from paboyle/feature/gparity_twist_GPU
...
Feature/gparity twist gpu
2020-12-07 12:02:20 -05:00
Christopher Kelly
9e7bacb5a4
Merge branch 'develop' into feature/gparity_twist_GPU
2020-12-07 11:55:39 -05:00
Christopher Kelly
2ef1fa66a8
Improved performance of G-parity kernel for GPUs by simplifying multLink implementation
2020-12-07 11:53:35 -05:00
Peter Boyle
cf76741ec6
Intel DPCPP Gold happy now (compiles all, runs Benchmark_dwf_fp32 )
2020-12-03 03:47:11 -08:00
Peter Boyle
497e7c1c40
Duplicate code
2020-12-02 17:55:30 -08:00
Peter Boyle
888eacd3b8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-24 21:46:33 -05:00
Peter Boyle
321f0f51b5
Project to SU(N)
2020-11-24 21:46:10 -05:00
Christoph Lehner
17ec9c5545
Merge pull request #20 from paboyle/develop
...
Sync
2020-11-24 12:20:43 +01:00
Peter Boyle
30ad9578a2
Merge branch 'lehner-feature/gpt' into develop
2020-11-24 06:10:24 -05:00
Peter Boyle
9dce101586
Merge branch 'feature/gpt' of https://github.com/lehner/Grid into lehner-feature/gpt
2020-11-24 06:10:16 -05:00
Peter Boyle
97e264d0ff
Christoph's changes
2020-11-23 15:46:11 +00:00
Peter Boyle
683a5e5bf5
Stencil use host vector for integera table on enable-shared=no and mirror it on device
2020-11-23 15:39:51 +00:00
Peter Boyle
d4861a362c
Stencil use non-UVM memory for look up table on enable-shared=no
2020-11-23 15:38:49 +00:00
Peter Boyle
5ff3eae027
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-20 13:14:44 -05:00
Peter Boyle
147dc15d26
Update
2020-11-20 13:13:59 -05:00
Christoph Lehner
c61ea72949
Merge pull request #19 from paboyle/develop
...
Sync
2020-11-20 17:31:13 +01:00
Peter Boyle
86e8b9fe38
ALLOC_ALIGN removed
2020-11-20 17:07:16 +01:00
Peter Boyle
612e468889
Configurable ALLOC_ALIGN and ALLOC_CACHE
2020-11-20 16:48:28 +01:00
Christoph Lehner
4ea8d128c2
Merge pull request #18 from paboyle/develop
...
Sync
2020-11-20 15:36:50 +01:00
Peter Boyle
e49b7f2f88
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-19 19:24:41 +01:00
Peter Boyle
aace3d47b9
partial work in progress
2020-11-19 19:24:14 +01:00
Peter Boyle
d5049949a4
Starting to fix reunitarise
2020-11-19 19:23:41 +01:00
Peter Boyle
f1c7480e3c
Warning remove
2020-11-19 19:23:03 +01:00
Peter Boyle
5adae5d6ff
Unused variable remove
2020-11-19 19:22:12 +01:00
Peter Boyle
a8412ace05
Merge pull request #317 from i-kanamori/develop
...
adding an error check for input: Parameters.StartingType
2020-11-18 23:09:40 -05:00
Peter Boyle
9fd1c2ad4b
Merge pull request #325 from DanielRichtmann/feature/threaded-clover-inversion
...
Threaded clover term inversion
2020-11-18 23:08:37 -05:00
Peter Boyle
4cf3575353
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-18 03:07:36 +00:00
Peter Boyle
804a810d68
Wildcard mismatch
2020-11-18 03:06:53 +00:00
Peter Boyle
8fcb392e24
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-17 04:51:31 -08:00
Peter Boyle
dd8d70eeff
Build without LIME
2020-11-17 04:41:15 -08:00
Peter Boyle
aa8aba6543
--shm-force-mpi
2020-11-16 20:15:50 -05:00
Peter Boyle
13df14f96e
Switch off SHM paths with --disable-shm
2020-11-16 18:07:15 -05:00
Peter Boyle
3aab983760
Flop count set as in DiRAC-ITT-2020 (mistaken 20% low, but must maintain consistency)
2020-11-16 17:13:58 +01:00
Peter Boyle
9c4dcc5ea3
Merge branch 'master' into develop
2020-11-16 16:34:57 +01:00
Peter Boyle
a1063ddbb9
Update options and simplify
2020-11-13 04:11:03 +01:00
Peter Boyle
18ef8056ec
Hide Shared Memory
2020-11-13 04:10:40 +01:00
Peter Boyle
1c673977fa
Must ask for COMMMS_THREADS
2020-11-13 03:59:36 +01:00
Peter Boyle
e9bc748828
Useful GPU machine benchmark for GDR used to shakeout Booster at Juelich - see slack earlyaccess channel
2020-11-13 03:58:34 +01:00
Peter Boyle
f48156529b
Work on 2,2,2,8 ranks
2020-11-13 03:57:58 +01:00
Peter Boyle
d05ce01809
TOFU behaviour now optional THREAD_MULTIPLE or THREAD_SERIALIZED
2020-11-13 03:52:19 +01:00
Peter Boyle
cf23eff60e
Device to Device, Memset, cannot assume UVM == Communicable
2020-11-13 03:51:08 +01:00
Peter Boyle
6e313575be
Use of default GPU is behaviour, not a system property. Move Summit specific to configure.ac
2020-11-13 03:50:16 +01:00
Peter Boyle
b13d1f7238
TOFU compat flag to help Isaaku
2020-11-13 03:49:44 +01:00
Peter Boyle
b5e7945dd9
Option for host or device Cshift implementation
2020-11-13 01:38:54 +01:00
Peter Boyle
7535566f54
Option for bounce through the SHM buffer
2020-11-12 22:54:27 +01:00
Peter Boyle
50b808ab33
Configure option between host and device
2020-11-12 22:28:12 +01:00
Peter Boyle
f16c2665f5
Host memory explict
2020-11-12 20:29:58 +01:00
Peter Boyle
41e28015ae
Volume divisible guarantee
2020-11-07 13:32:16 +01:00
3594ce877b
speedup in Sigma-to-nucleon
2020-11-03 20:04:30 +00:00
9bae6b889a
speedup in Sigma-to-nucleon
2020-11-03 20:03:09 +00:00
4014dfd5b9
first tested version
2020-11-03 16:13:08 +00:00
67023c334b
bugfix
2020-11-03 13:07:37 +00:00
a3de7026c8
bugfix
2020-11-03 12:51:50 +00:00
ee11678b1f
added Xi-to-Sigma rare decays
2020-11-03 12:41:35 +00:00
Peter Boyle
a0ccbb3bd6
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-11-01 01:16:35 +00:00
Peter Boyle
5eeabaa2bb
HIP fix
2020-11-01 01:16:01 +00:00
Peter Boyle
00d0d6d008
Hip Free managed
2020-10-31 18:14:31 -04:00
Peter Boyle
537a9f7030
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-10-31 18:13:30 -04:00
Peter Boyle
cc9c993f74
Project on group fix on GPU tracked to reciprocal sqrt collision between CUDA and Grid rsqrt
2020-10-31 18:12:47 -04:00
Peter Boyle
d10422ded8
Test project on group
2020-10-31 18:12:30 -04:00
Peter Boyle
f313565a3c
HiP compile
2020-10-31 12:12:40 +00:00
Daniel Richtmann
b3881d2636
Thread inversion of clover term
2020-10-30 16:18:58 +01:00
61d5860b46
Merge pull request #318 from rrhodgson/feature/BaryonSpinMat
...
Added untraced baryon contraction code
2020-10-28 18:39:59 +00:00
52d17987dc
BaryonUtils.h updated debug output
2020-10-23 11:41:08 +01:00
19d8bba97d
BaryonUtils function naming change
2020-10-21 11:58:53 +01:00
463d72d322
Added untraced baryon contraction code
2020-10-19 16:13:28 +01:00
d060341168
add an error check for Parameters.StartingType
2020-10-16 21:39:17 +09:00
c772bcd514
Merge https://github.com/paboyle/Grid into develop
2020-10-16 20:30:32 +09:00
Peter Boyle
3362f8dfa0
happy compile
2020-10-14 22:59:41 -04:00
Peter Boyle
bf3c9857e0
Closure changes
2020-10-14 21:37:14 -04:00
Peter Boyle
a88b3ceca5
Closure cases
2020-10-14 21:33:51 -04:00
Peter Boyle
aa135412f5
toComplex, toReal
2020-10-13 22:25:01 -04:00
Peter Boyle
9945399e60
Reaality issues fix by drop from ET
2020-10-13 22:24:32 -04:00
Peter Boyle
5eeffa49e8
Reality forced included
2020-10-13 22:23:57 -04:00
Peter Boyle
3f06209720
Pretty print
2020-10-13 22:18:51 -04:00
Peter Boyle
12e239dd9f
Merge branch 'release/dirac-ITT-2020'
2020-10-13 13:38:29 -04:00
Peter Boyle
af2301afbb
Merge pull request #312 from i-kanamori/debug_512
...
add reordring of random number generators in IO
2020-10-13 11:42:12 -04:00
Peter Boyle
f98856a26f
Merge pull request #314 from smangham/issue_readme_precision
...
Fix for deprecated configure options in documentation (issue #313 )
2020-10-13 11:41:38 -04:00
Sam Mangham
d55cc5b380
Fixed typo on --enable-comm, removed all references to --enable-precision except for config options, where it is listed as deprecated. Removed travis test for single precision.
2020-10-12 12:33:13 +01:00
c2b688abc9
Benchmark_IO: reducing max local volume to 32^4
2020-10-10 16:52:56 +01:00
b0d61b9687
Benchmark_IO cleaner output
2020-10-09 21:46:45 +01:00
5f893bf9af
Benchmark_IO procurement sizes
2020-10-09 21:31:59 +01:00
0e17bd6597
I/O benchmark cleanup
2020-10-09 20:29:57 +01:00
22caa158cc
multi-pass I/O benchmark, with statistic and robustness summary
2020-10-09 20:29:40 +01:00
b24a504d7c
hook to access last parallel I/O performance measurement
2020-10-09 20:28:54 +01:00
Peter Boyle
992ef6e9fc
more runtime
2020-10-08 22:19:20 -04:00
Peter Boyle
f32a320bc3
Single prec benchmark in double prec compile
2020-10-08 19:52:08 -04:00
Peter Boyle
5f0fe029d2
Improve meemory benchmarks for GPU (avoid host mem ping pong)
2020-10-08 19:51:28 -04:00
6b1486e89b
fixing number of colours defaulting to 4 in most cases
2020-10-08 16:31:24 +01:00
Peter Boyle
3f9c427a3a
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-10-07 13:12:57 -04:00
Peter Boyle
d201277652
Expose Nc as a compile time configure option.
...
Remove precision option
2020-10-07 13:07:00 -04:00
fdda7cf9cf
Merge branch 'feature/benchmark-io-update' into develop
2020-10-07 15:57:53 +01:00
e22d30f715
Merge branch 'develop' into feature/benchmark-io-update
2020-10-07 15:56:39 +01:00
1ba25a0d8c
more I/O benchmark code cleaning
2020-10-07 15:38:41 +01:00
9ba3647bdf
script to convert I/O benchmark logs to CSV
2020-10-07 15:35:03 +01:00
5ee832f738
I/O benchmark code cleaning
2020-10-07 15:31:51 +01:00
467deee46f
Merge branch 'debug_512' into develop
2020-10-07 15:18:44 +09:00
Peter Boyle
35a69a5133
SU4 x SU4
2020-10-06 21:48:35 -04:00
e9c5a271a8
fixing potential issues with log alignment and timer I/O
2020-10-06 17:58:16 +01:00
acac2d6938
standard C/C++ I/O in benchmark
2020-10-06 17:57:00 +01:00
97db2b8d20
add reordring of random number generator in IO
2020-10-06 17:25:59 +09:00
Christoph Lehner
80fd6ab407
Merge pull request #17 from paboyle/develop
...
sync upstream
2020-10-06 09:01:39 +02:00
Christoph Lehner
5534921bee
Merge pull request #16 from DanielRichtmann/feature/gpt-coarsenedmatrix
...
Enable checkerboard operations for CoarsenedMatrix
2020-10-01 10:55:13 +02:00
Peter Boyle
ace9cd64bb
dpcpp happy
2020-09-29 08:03:46 -07:00
Peter Boyle
a3e2aeb603
dpcpp options happiness
2020-09-29 06:50:10 -07:00
Peter Boyle
049dd25785
Revert accidental commit thanks michael
2020-09-23 04:13:50 -04:00
Peter Boyle
d43d372294
Merge pull request #311 from mmphys/bugfix/MPIasynch
...
Asynchronous calls removed - reflect this in Communicator_none.cc
2020-09-22 10:41:48 -04:00
Michael Marshall
b71a081cba
Asynchronous calls removed - reflect this in Communicator_none.cc
...
(Opportunistic doc update - OpenMP support on Mac OS)
2020-09-21 09:33:23 +01:00
Peter Boyle
c48909590b
MPI asynch call removal
2020-09-17 20:47:32 +01:00
Peter Boyle
446ef40570
HIP IPC
2020-09-17 20:31:46 +01:00
Peter Boyle
81441e98f4
HIP runs sensible
2020-09-16 03:35:03 +01:00
Peter Boyle
ecd3f890f5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-09-16 02:30:14 +01:00
Peter Boyle
1c881ce23c
HIP does not like half2 visible members x and y so must define own Half2
2020-09-16 02:28:33 +01:00
Peter Boyle
dacbbdd051
Hip Happy Birthday
2020-09-16 00:37:02 +01:00
Peter Boyle
2859955a03
HIP requires "inline"
2020-09-16 00:36:13 +01:00
Peter Boyle
cc220abd1d
inline for HIP
2020-09-16 00:35:38 +01:00
Peter Boyle
d1c0c0197e
HipCC requires inline on definition
2020-09-16 00:35:06 +01:00
Peter Boyle
fd9424ef27
innlines required to make HIP happy
2020-09-16 00:34:32 +01:00
Peter Boyle
a5c35c4024
Make HIP / Vega happy
2020-09-16 00:33:53 +01:00
Peter Boyle
e03b64dc06
HIP default flaags to work on ROCM
2020-09-16 00:33:09 +01:00
Peter Boyle
4677c40195
HIP improvements
2020-09-16 00:32:27 +01:00
Peter Boyle
288c615782
Hip improvements
2020-09-16 00:31:50 +01:00
Peter Boyle
48e81cf6f8
Hip Pragmas
2020-09-16 00:31:03 +01:00
Christoph Lehner
5cffa05c7e
remove slab allocator file
2020-09-13 14:06:25 -04:00
Christoph Lehner
d50a2164d7
remove slab allocator
2020-09-13 14:06:06 -04:00
Christoph Lehner
32ff766dbd
fix evict scheme, slab alloc
2020-09-13 14:02:53 -04:00
Christoph Lehner
01652d8cfe
SlabAllocator
2020-09-13 05:56:02 -04:00
Daniel Richtmann
4d2dc7ba03
Enable even-odd for CoarsenedMatrix
2020-09-11 20:32:02 +02:00
Christoph Lehner
51d1beb1f3
Merge pull request #15 from paboyle/develop
...
Sync with upstream
2020-09-07 14:20:33 +02:00
Peter Boyle
65b724bb5f
2 level hddcr
2020-09-03 21:46:43 -04:00
Peter Boyle
6dbd117aa5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-09-03 20:30:49 -04:00
Peter Boyle
198b29f618
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-09-03 20:29:54 -04:00
Peter Boyle
a8309638d4
UVM check in MPI calls
2020-09-03 20:29:26 -04:00
Peter Boyle
f98a4e880e
Merge pull request #310 from kostrzewa/accelerator_vector_stream_op_no_backspace
...
do not use backspace in AcceleratorVector (Coordinate) output stream operator
2020-09-03 20:24:59 -04:00
Peter Boyle
8244caff25
Remove the asynchronous non-Stencil calls.
2020-09-03 18:52:55 -04:00
Peter Boyle
bcd7895362
Include cuda.h
2020-09-03 15:49:13 -04:00
Peter Boyle
85b1c5df39
A never hit case that is not 100% confident is asserted for safety
2020-09-03 15:48:16 -04:00
Peter Boyle
b4255140d6
Stale data member eliminated
2020-09-03 15:47:46 -04:00
Peter Boyle
0c3095e173
Comms buffers to device memory
2020-09-03 15:45:35 -04:00
Peter Boyle
d3ce60713d
UVM, Device and Lattice/aligned allocators
2020-09-03 15:44:13 -04:00
Peter Boyle
eac1f08b7b
Close expressions passed as an argument
2020-09-01 15:30:33 -04:00
Peter Boyle
1654c4f3c0
Closure improved
2020-09-01 15:29:45 -04:00
Peter Boyle
8807d998bc
closure improved
2020-09-01 15:29:11 -04:00
Peter Boyle
5791021dcd
Speed up Cshift more with coalesced
2020-09-01 15:28:15 -04:00
Peter Boyle
c273fb051c
Peek poke laattice
2020-09-01 15:27:59 -04:00
Peter Boyle
c545530170
little worry large Nbasis doesnt compile GPU
2020-09-01 00:14:33 -04:00
Peter Boyle
d982a5b6d5
Fix coaarsened
2020-09-01 00:14:04 -04:00
Peter Boyle
15ca8637f3
No norms in HermOp
2020-09-01 00:13:32 -04:00
Peter Boyle
cbc995b74c
Made better interface
2020-09-01 00:12:54 -04:00
Peter Boyle
8b74174d74
Eigen tensor serialisatiino happy undeer GPU. Regret agreeing to let us couple Eigen types to Grid IO
2020-09-01 00:03:26 -04:00
Peter Boyle
e21fef17df
real and imag part not in ET
2020-08-31 23:56:26 -04:00
Peter Boyle
3d27708f07
Basic where test
2020-08-31 23:55:49 -04:00
Peter Boyle
b918744184
Prettificatoin
2020-08-31 23:54:46 -04:00
Peter Boyle
7d14a3c086
Where working
2020-08-31 23:53:46 -04:00
Peter Boyle
e14a84317d
GPU math unary calls
2020-08-31 23:50:49 -04:00
Peter Boyle
6c31b99f1f
I knew coupling Eigen Tensor to Grid serialisation was a bad iddea.
...
Now the complex is different on GPU creates probblems
2020-08-31 23:49:19 -04:00
Peter Boyle
9522dcd611
Remove dead commented ouot coode
2020-08-31 23:40:29 -04:00
Peter Boyle
ed469898dc
coalesced ET expressions
2020-08-31 23:38:40 -04:00
Peter Boyle
1eee94a809
Sorting real/im in read coalesced GPU ET
2020-08-31 23:36:49 -04:00
Bartosz Kostrzewa
54523369a3
do not use backspace in Coordinate output stream operator
2020-08-31 19:39:36 +02:00
Peter Boyle
a98c91c2a5
Merge pull request #309 from kostrzewa/format_benchmark_wilson_sweep
...
Format benchmark wilson sweep
2020-08-31 12:43:46 -04:00
Bartosz Kostrzewa
a9b92867a8
use tabulator
2020-08-31 18:41:17 +02:00
Bartosz Kostrzewa
65920faeba
correct formatting of Benchmark_wilson_sweep output
2020-08-31 18:39:27 +02:00
Christoph Lehner
249e2db87d
Merge pull request #14 from DanielRichtmann/feature/gpt-coarsenedmatrix
...
Expose more functions in CMat
2020-08-27 15:18:56 +02:00
Daniel Richtmann
cf3535d16e
Expose more functions in CMat
2020-08-27 14:06:48 +02:00
Christoph Lehner
d61ee817f4
Merge pull request #13 from DanielRichtmann/feature/gpt-coarsenedmatrix
...
Changes needed for GPT MG
2020-08-27 12:11:06 +02:00
Peter Boyle
3448b7387c
Almost there to coalesced ET
2020-08-26 17:04:49 -04:00
Peter Boyle
47b89d2739
Pragma protection improvementt
2020-08-26 17:04:27 -04:00
Christoph Lehner
2a75516330
state MPI/SLURM message only on world_rank zero
2020-08-26 12:34:17 -04:00
Daniel Richtmann
b2087f14c4
Fix CoarsenedMatrix regarding illegal memory accesses
...
Need a reference to geom since the lambda copies the this pointer which points to host memory, see
- https://docs.nvidia.com/cuda/cuda-c-programming-guide/#star-this-capture
- https://devblogs.nvidia.com/new-compiler-features-cuda-8/
2020-08-24 17:46:47 +02:00
Daniel Richtmann
dd1ba266b2
Fix mapping between dir + disp and point in CMat
2020-08-24 17:46:46 +02:00
Daniel Richtmann
1292d59563
Add a typedef + broaden interface of CMat
2020-08-24 17:46:45 +02:00
Christoph Lehner
9877ed9bf8
Merge pull request #12 from paboyle/develop
...
Sync
2020-08-22 16:35:35 +02:00
Christoph Lehner
f0dc0f3621
fix compile issue on Qpace3
2020-08-22 13:57:33 +02:00
Peter Boyle
1efe30d6cc
SLurm stop nodes using same GPU
2020-08-21 02:02:53 +02:00
Peter Boyle
0b787e9fe0
Avoid namespaec collision to make gcc happy
2020-08-20 22:23:29 +02:00
Peter Boyle
37ec4b241c
Default thread count sensible
2020-08-20 22:12:31 +02:00
Christoph Lehner
63b0a19f37
Merge pull request #11 from paboyle/develop
...
Sync
2020-08-20 20:53:39 +02:00
Peter Boyle
90ea7dfa99
Accelerator loops for device resident comms buf
2020-08-19 22:40:44 +02:00
Peter Boyle
f866d7c33e
Merge pull request #307 from lehner/feature/gpt
...
Merged Nils's A64FX and minor fixes (MemoryManager::InitMessage, Tensor_index zeroit, ...)
2020-08-18 23:27:21 -04:00
Christoph Lehner
542bdef198
cleanup comments
2020-08-14 18:39:44 +02:00
Christoph Lehner
06007db3d9
true shm_none implementation with GPUs that disables the use of device shared memory for the stencils
2020-08-14 18:37:00 +02:00
Christoph Lehner
12e6059a70
Merge branch 'feature/gpt' of https://github.com/lehner/Grid into feature/gpt
2020-08-13 16:16:52 +02:00
Christoph Lehner
dbaa24ebf6
further GPU memory access fixes (with this GPT passes all single-rank tests on non-summit GPUs)
2020-08-13 16:14:15 +02:00
Peter Boyle
3276aa67dc
Update
2020-08-12 14:15:53 -04:00
Christoph Lehner
3b30b9f0c0
Merge branch 'feature/gpt' of https://github.com/lehner/Grid into feature/gpt
2020-08-06 16:59:17 +02:00
Christoph Lehner
69db4816f7
fix variable capture in Scatter_plane_merge on accelerators
2020-08-06 16:57:16 +02:00
Christoph Lehner
3abe09025a
when using SHM_NONE allow multiple ranks per node but without using shared memory
2020-08-06 14:42:38 +02:00
Christoph Lehner
e33878e0de
Trigger re-run of CI
2020-08-06 11:50:24 +02:00
Christoph Lehner
27b4fbf3f0
assert for forbidden code path and fix check for faster CPU codepath in basisRotate
2020-08-03 07:57:33 -04:00
Christoph Lehner
968a90633a
Zero -> zeroit in Tensor_index
2020-07-31 02:07:17 -04:00
Christoph Lehner
6365a89ba3
create separate InitMessage for MemoryManager that can be called after communicator setup
2020-07-30 07:25:05 -04:00
Christoph Lehner
ddbb008694
Merge pull request #10 from lehner/feature/gpt-sycl
...
Feature/gpt sycl
2020-07-30 13:12:09 +02:00
Christoph Lehner
7997e0a449
Merge branch 'feature/gpt' into feature/gpt-sycl
2020-07-30 13:11:31 +02:00
Christoph Lehner
197612bc7a
fast cpu basisRotate and other small cleanups
2020-07-30 07:08:54 -04:00
Christoph Lehner
0e88bf4bff
remove Nils's default pragma
2020-07-29 10:24:35 -04:00
Christoph Lehner
3e64d78469
include versions.h again and add back asserts in Test_simd
2020-07-29 10:18:05 -04:00
Christoph Lehner
2004611def
Merge pull request #9 from nmeyer-ur/feature/a64fx-2
...
Feature/a64fx 2
2020-07-29 14:54:20 +02:00
Christoph Lehner
a2868c96a4
Merge pull request #8 from paboyle/develop
...
Doc recompile
2020-07-29 14:10:07 +02:00
Peter Boyle
7cf7f11e1a
Doc recompile
2020-07-22 14:44:11 -04:00
nmeyer-ur
ea7f8fda5e
fix typo
2020-07-22 09:34:05 +02:00
nmeyer-ur
906b78811b
exit in Init when using --comms-overlap
2020-07-22 08:57:01 +02:00
Christoph Lehner
97703b181b
Merge pull request #7 from paboyle/develop
...
Merge current develop
2020-07-12 16:24:53 +02:00
nmeyer-ur
d9474c6cb6
compiler-independent build using --enable-simd=A64FX
2020-07-09 10:07:02 +02:00
nmeyer-ur
bbd145382b
enable --enable-simd=A64FX in configure
2020-07-08 12:43:51 +02:00
nmeyer-ur
1b08cb7300
Merge branch 'develop' into feature/a64fx-2
2020-07-08 08:18:18 +02:00
nmeyer-ur
337d9dc043
move barrier in Benchmark_wilson
2020-07-08 08:13:40 +02:00
nmeyer-ur
8726e94ea7
merge upstream develop
2020-07-07 20:26:47 +02:00
nmeyer-ur
67db4993c2
reset head, update SVE readme
2020-07-07 19:54:52 +02:00
f1f655d92b
Merge pull request #304 from Heinrich-BR/develop
...
ScalarImpl.h updates
2020-07-06 10:16:03 +01:00
43334e88c3
Tiny change in a comment for clarity
2020-07-04 16:11:16 +01:00
4f1e66b044
Fixed HMC SU(N) integrator which was causing fields to leave Lie Algebra manifold for N>2
2020-07-04 03:53:06 +01:00
nmeyer-ur
fd3c8b0e85
correct build instructions qp4
2020-07-01 09:00:38 +02:00
nmeyer-ur
1635c263ee
disable TOFU by default
2020-06-30 19:27:08 +02:00
64fe5b21b4
Merge pull request #298 from rrhodgson/feature/baryon
...
Update baryon 2pt and add 3pt function
2020-06-29 18:45:00 +01:00
Peter Boyle
ee9889821d
Runs through to coarse space solve
2020-06-29 12:59:52 -04:00
eb470aa6dc
Update to baryon and added comments/fix whitespace
2020-06-29 09:43:01 +01:00
77af9a3ddc
Baryon revert sign
2020-06-26 10:08:42 +01:00
102089798c
BaryonUtils: update to autoView
2020-06-25 16:41:58 +01:00
39cea8b5a7
Merge branch 'develop' into feature/baryon
2020-06-25 16:24:07 +01:00
a65f66d2db
Merge branch 'feature/baryon3pt' into feature/baryon
2020-06-25 16:20:59 +01:00
Peter Boyle
936c5ecf69
Reduction GPU no compile fix
2020-06-24 17:28:31 -04:00
Peter Boyle
22cfbdbbb3
Boost precision in inner products in single
2020-06-24 12:52:31 -04:00
Peter Boyle
093d1ee21b
Force initial values
2020-06-24 08:54:49 -04:00
Peter Boyle
d6ba2581ce
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-06-24 08:25:08 -04:00
Peter Boyle
577c064184
Memory manager initialise earlier
2020-06-24 08:24:38 -04:00
Peter Boyle
2ff1fa6fad
UVM used shared for CPU alloccations andd ddont migrate
2020-06-23 22:14:56 -04:00
Peter Boyle
70be1bd8be
Adding code under development
2020-06-23 10:24:21 -04:00
4ef50ba31f
Baryon speedup
2020-06-23 11:44:20 +01:00
3e97a26f90
BaryonGamm3pt threads -> accelerator
2020-06-23 11:35:32 +01:00
599f28f6ef
Baryon bug fixes
2020-06-23 11:10:26 +01:00
Peter Boyle
c48da35921
Memory Vector UVM and Lattice alignedAllocator separate
2020-06-22 20:21:53 -04:00
Peter Boyle
6c5fa8dcd8
Aligned allocate on CPU put through this interface
2020-06-20 14:34:29 -04:00
Peter Boyle
0d2f913a1a
String.h for linux
2020-06-20 09:37:31 -04:00
Christoph Lehner
5b117865b2
Merge pull request #6 from paboyle/sycl
...
Sycl
2020-06-20 09:44:44 +02:00
Peter Boyle
1a74816c25
Hopeefully fixed
2020-06-19 17:50:52 -04:00
Peter Boyle
73de335256
Merge branch 'develop' into sycl
2020-06-19 17:44:16 -04:00
Peter Boyle
228fd450ce
Typo fix (excusee - my keyboard is starting to break)
2020-06-19 17:36:05 -04:00
Peter Boyle
b949cf6b12
PeekLocal needs a view to keep thread safe.
...
ALLOCATION_CACHEE reenable
2020-06-19 17:13:27 -04:00
Peter Boyle
11bc1aeadc
TThread count defaultt to fastest
2020-06-19 14:30:35 -04:00
Peter Boyle
66005929af
Set up the cache size on all ranks
2020-06-19 12:50:54 -04:00
Christoph Lehner
05bbc49a99
Edge case in GetShmDim check
2020-06-19 12:01:23 -04:00
Peter Boyle
ff7c847735
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
2020-06-19 01:22:16 -04:00
Peter Boyle
1aa988b2af
Comms overlap fix UVM case
2020-06-19 01:21:14 -04:00
Peter Boyle
edf17708a8
Range improvement
2020-06-18 22:41:06 -04:00
Christoph Lehner
81a8209749
ConvertType for blockInnerProduct
2020-06-18 11:53:21 -04:00
nmeyer-ur
a87e45ba25
SVE readme update
2020-06-18 11:23:08 +02:00
nmeyer-ur
465856331a
switch back to serialized; wrong results on single too
2020-06-15 15:39:39 +02:00
nmeyer-ur
cc958aa9ed
switch back to standard MPI_init due to wrong results in Benchmark_wilson using comms-overlap
2020-06-15 14:21:38 +02:00
Peter Boyle
f46f029dbb
Merge pull request #292 from lehner/feature/gpt-sycl
...
Catch edge case in SharedMemoryMPI::GetShmDims; Change default units …
2020-06-14 13:43:27 -04:00
Christoph Lehner
3dccd7aa2c
Catch edge case in SharedMemoryMPI::GetShmDims; Change default units to consistent MB in init args; Want last element not past last element in MemoryManagerCache.cc
2020-06-14 13:26:01 -04:00
nmeyer-ur
a25e4b3d0c
pred 32/64 for float/double instead of 8 in VLA patch
2020-06-13 14:44:37 +02:00
nmeyer-ur
d1210ca12a
switch to double/float instead of float64_t/float32_t in VLA patch
2020-06-13 13:59:32 +02:00
nmeyer-ur
36ea0e222a
type traits for ComplexF/D in VLA patch; cosmetics in VLS intrinsics
2020-06-13 13:42:35 +02:00
Peter Boyle
65e6e7da6f
Merge pull request #291 from lehner/feature/gpt-sycl
...
Feature/gpt sycl
2020-06-12 20:42:32 -04:00
Christoph Lehner
b5e87e8d97
summit compile fixes
2020-06-12 18:16:12 -04:00
Christoph Lehner
5f5807d60a
cleanup
2020-06-12 14:48:23 -04:00
nmeyer-ur
92281ec22d
add 3 op Mult for VLA
2020-06-12 18:49:05 +02:00
nmeyer-ur
87266ce099
comment out fcmla in vector types: need also MultAddReal
2020-06-12 18:37:19 +02:00
nmeyer-ur
2a23f133e8
reenable fcmla for VLA
2020-06-12 17:30:38 +02:00
nmeyer-ur
8dbf790f62
correct tbl2 for sp
2020-06-12 17:12:34 +02:00
nmeyer-ur
2402b4940e
vec_imm in float
2020-06-12 15:17:38 +02:00
nmeyer-ur
2111052fbe
apply VLA patch for memcpy reduction suggested by Arm, CAS-162542-D6W7Z7
2020-06-12 14:49:19 +02:00
Christoph Lehner
7974acff54
merged sycl to feature-gpt
2020-06-12 06:49:38 -04:00
f0d17d2b49
Added Baryon3pt code
2020-06-12 11:35:52 +01:00
244c003a1b
Updated Baryon code
2020-06-12 11:00:25 +01:00
0174f5f742
look for librt when using shm=shmopen
2020-06-11 16:50:43 +01:00
Peter Boyle
32b2b59be4
Offload
2020-06-10 20:36:26 -04:00
Peter Boyle
86bb0cc24b
Keep on GPU
2020-06-10 20:00:00 -04:00
Peter Boyle
84c19587e7
Offload
2020-06-10 19:59:31 -04:00
Peter Boyle
237ce92540
Offload loops
2020-06-10 19:59:11 -04:00
Peter Boyle
a7ffc61e82
acceleratorSIMTlane()
2020-06-10 19:58:33 -04:00
Peter Boyle
fd97f64612
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
2020-06-10 12:58:13 -04:00
Peter Boyle
8720aecb80
Offload more loops
2020-06-10 12:57:55 -04:00
Peter Boyle
cdf0a04fc5
Merge branch 'develop' into sycl
2020-06-09 04:00:12 -04:00
Peter Boyle
616d3dd737
CCommpile updates
2020-06-08 18:57:41 -04:00
Peter Boyle
8b066baca8
Implement transient mechanism
2020-06-08 18:28:53 -04:00
Peter Boyle
e97f3688db
Fix the HMC issue - kernel was launchnig asynchronously
2020-06-08 17:01:15 -04:00
nmeyer-ur
433766ac62
revert Add/SubTimesI and prefetching in stencil
...
This reverts commit 9b2699226c .
2020-06-08 12:02:53 +02:00
nmeyer-ur
93a37c8f68
test prefetch to L2 in stencil
2020-06-08 09:39:50 +02:00
Peter Boyle
89a1e78390
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
2020-06-05 23:20:37 -04:00
Peter Boyle
ffbb3fc02c
Merge pull request #287 from felixerben/baryon-cleaner
...
slightly cleaner baryon 2pt code
2020-06-05 22:54:52 -04:00
Peter Boyle
5a73ef3647
Minor tweak to compile
2020-06-05 21:50:15 -04:00
Peter Boyle
87e5d2f4b7
Merge branch 'sycl' of https://www.github.com/paboyle/Grid into sycl
2020-06-05 17:32:21 -07:00
Peter Boyle
d720f10758
Liink error fix
2020-06-05 17:29:20 -07:00
Peter Boyle
14fcd0912a
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
2020-06-05 19:14:17 -04:00
Peter Boyle
3111c0bd4f
Single precisiono hardwire
2020-06-05 19:13:27 -04:00
Peter Boyle
e03064490e
Merge branch 'sycl' of https://github.com/paboyle/Grid into sycl
2020-06-05 18:53:39 -04:00
Peter Boyle
1a4c8c3387
Global edit with change to View usage. autoView() creates a wrapper object that closes the view when scope closes.
2020-06-05 18:52:35 -04:00
Peter Boyle
2b1e259441
Decode of SYCL devices fix
2020-06-04 17:16:55 -07:00
Peter Boyle
f39c2a240b
Priintinig and device memory size detection
2020-06-04 14:58:03 -04:00
Peter Boyle
0d95805cde
Print improvement
2020-06-03 22:50:32 -04:00
Peter Boyle
f67830587f
Accelerator loop use
2020-06-03 22:50:09 -04:00
Peter Boyle
6bf7f839ff
Better printing and logging
2020-06-03 09:28:57 -04:00
Peter Boyle
e3147881a9
Cache scheme
2020-06-03 09:23:48 -04:00
nmeyer-ur
9872c76825
introduce AddTimesI and SubTimesI; slight benefit in operators, but < 1%; breaks all other impls
2020-06-03 15:20:13 +02:00
Peter Boyle
fb559614ad
Initialise meemory manager
2020-06-03 09:12:47 -04:00
Peter Boyle
e93e12b6a4
More verbose SYCL setup
2020-06-03 09:12:11 -04:00
Peter Boyle
0c3112cd94
Use view mechanism
2020-06-03 09:11:51 -04:00
Peter Boyle
8cfd5d2639
Need lattice view
2020-06-03 09:11:28 -04:00
Peter Boyle
1c9f20b15e
Views must be closed
2020-06-03 09:10:29 -04:00
Peter Boyle
32237895bd
Reorg memory manager for O(1) hash table
2020-06-03 09:09:52 -04:00
nmeyer-ur
5ee3ea2144
round-up after testing of prefetches in stencil close
2020-06-03 11:58:20 +02:00
Peter Boyle
c5c2dbc0ce
Optional CUDA info
2020-06-02 14:21:49 -04:00
Christoph Lehner
9fcb47ee63
Explicit error message instead of infinite loop in GlobalSharedMemory::GetShmDims
2020-06-02 07:44:38 -04:00
nmeyer-ur
5050833b42
revert changes due to performance penalty in Wilson using MPI
2020-06-02 13:08:57 +02:00
nmeyer-ur
7bee4ebb54
correct predication for svcadd
2020-06-02 10:51:39 +02:00
nmeyer-ur
71cf9851e7
correct type for vecd in TimesI and TimesMinusI
2020-06-02 10:44:15 +02:00
nmeyer-ur
b4735c9904
correct zero in svcadd
2020-06-02 10:38:05 +02:00
nmeyer-ur
9b2699226c
use fcadd in TimesI and TimesMinusI instead of tbl and neg
2020-06-02 10:32:44 +02:00
nmeyer-ur
5f52804907
update calculation of data
2020-05-30 10:55:17 +02:00
nmeyer-ur
936071773e
correct throughput in wilson and dwf
2020-05-29 22:15:59 +02:00
nmeyer-ur
1732f9319e
more mods; counters seem to work correctly
2020-05-29 18:44:00 +02:00
nmeyer-ur
91c81cab30
some corrections; compiles on my laptop; untested
2020-05-29 18:19:22 +02:00
nmeyer-ur
38164f8480
include counters in WilsonFermionImplementation.h
2020-05-29 17:59:26 +02:00
nmeyer-ur
f013979791
add counter support in WilsonFermion.h
2020-05-29 17:13:59 +02:00
nmeyer-ur
e947b563ea
add space in stencil output
2020-05-29 17:11:17 +02:00
nmeyer-ur
5cb3530c34
enable counters in Benchmark_wilson
2020-05-29 15:44:52 +02:00
nmeyer-ur
250008372f
update SVE readme
2020-05-29 15:44:25 +02:00
Peter Boyle
1d252d0922
Accelerator inline
2020-05-28 11:45:25 -04:00
Peter Boyle
006cc8a8f1
Staggereed move to accelerator
2020-05-28 08:33:06 -04:00
nmeyer-ur
4fedd8d29f
switch to MPI_THREAD_SERIALIZED instead of SINGLE
2020-05-27 14:08:34 +02:00
Peter Boyle
cf2938688a
Sycl unhappy fix
2020-05-25 08:36:53 -07:00
Peter Boyle
ee63721bad
int unhappiness sycl fix
2020-05-25 08:36:24 -07:00
Peter Boyle
22c5168d70
Sycl happier
2020-05-25 08:35:56 -07:00
Peter Boyle
949ac3cd24
Must avoid non-trivial copy constructors
2020-05-25 08:35:28 -07:00
Peter Boyle
7bc0166c1c
SYCLL maknig happy - must avoid non ttrivial copy constructors
2020-05-25 08:34:19 -07:00
Peter Boyle
cb0d1b3399
hopefullly fix buildd fail
2020-05-24 21:27:00 -04:00
Peter Boyle
d1f1ccc705
HIP changes
2020-05-24 21:18:49 -04:00
Peter Boyle
c7519a237a
Assertions fail on HIP foor unknown reasons - dedbugging
2020-05-24 14:02:47 -04:00
Peter Boyle
32be2b13d3
Updates for HiP
2020-05-24 14:00:55 -04:00
Peter Boyle
92b342a477
Hip reduction too
2020-05-24 13:50:28 -04:00
Peter Boyle
556da86ac3
HIP fp16
2020-05-24 13:41:58 -04:00
Peter Boyle
8285e41574
View location / access mode
2020-05-21 16:14:41 -04:00
Peter Boyle
f999408e92
View locatoin and access mode
2020-05-21 16:14:20 -04:00
Peter Boyle
a7abda89e2
View location & access mode
2020-05-21 16:13:59 -04:00
Peter Boyle
7860a50f70
Make view specify where and drive data motion - first cut.
...
This is a compile tiime option --enable-unified=yes/no
2020-05-21 16:13:16 -04:00
nmeyer-ur
6ddcef1bca
fix build error enabling fcmla/mac in vector types for VLA
2020-05-21 21:21:03 +02:00
nmeyer-ur
8c5a5fdfce
disable fcmla in vector type building for VLA
2020-05-21 19:41:42 +02:00
nmeyer-ur
046b1cbbc0
enable fcmla in tensor arithmetics; fixed-size works, VLA does not compile
2020-05-21 19:39:07 +02:00
nmeyer-ur
a65ce237c1
clean up; Exch1 VLA sp+dp integrate, tested, working
2020-05-21 09:48:06 +02:00
nmeyer-ur
cd27f1005d
clean up; Exch1 sp integrate, tested, working
2020-05-21 08:45:43 +02:00
nmeyer-ur
f8c0a59221
clean up; Exch1 dp integrate, tested, working
2020-05-21 02:48:14 +02:00
nmeyer-ur
832485699f
save some cycles in HtoD and DtoH by direct instead of multi-pass conversion
2020-05-20 23:04:35 +02:00
nmeyer-ur
81484a4760
symmetrize Mult and MultAddComplex
2020-05-20 22:36:45 +02:00
nmeyer-ur
9a86059761
symmetrize VLA and fixed size build messages
2020-05-20 20:05:42 +02:00
nmeyer-ur
b780b7b7a0
guard prevents multiple TOFU messages
2020-05-20 19:20:59 +02:00
nmeyer-ur
9e085bd04e
guard prevents multiple A64FX build messages
2020-05-20 19:16:30 +02:00
ferben
6c6812a5ca
GB/s output
2020-05-20 12:26:57 +01:00
Christoph Lehner
8358ee38c4
pull develop
2020-05-19 08:56:18 -04:00
ferben
1f154fe652
some cleanup in BaryonUtils
2020-05-19 13:48:56 +01:00
ferben
d708c0258d
some cleanup in BaryonUtils
2020-05-19 13:48:00 +01:00
Christoph Lehner
a7635fd5ba
summit mem
2020-05-18 17:52:26 -04:00
nmeyer-ur
6b6bf537d3
comment out mac in vector types
2020-05-18 20:36:16 +02:00
nmeyer-ur
323a651c71
correct typo
2020-05-18 19:58:27 +02:00
nmeyer-ur
9f212679f1
support fcmla in vector_types, untested
2020-05-18 19:55:18 +02:00
nmeyer-ur
032f7dde1a
update SVE readme, asm generator
2020-05-18 19:10:36 +02:00
Peter Boyle
ebb60330c9
Automatic data motion options beginning
2020-05-17 16:34:25 -04:00
5aa60be17d
SerialisableClassName method for serialisable enum, and boolean to test if a serialisable object is an enum
2020-05-15 20:00:34 +01:00
nmeyer-ur
50b1db1e8b
implemented correct _m form (using 3 operands instead of 2)
2020-05-15 10:01:05 +02:00
nmeyer-ur
015d8bb38a
introduced assertions in Benchmark_wilson, removed data output from Benchmark_dwf
2020-05-15 09:15:50 +02:00
nmeyer-ur
10a34312dc
some fixed-size code clean up
2020-05-14 23:20:16 +02:00
nmeyer-ur
db8c0e7584
replaced _x form with _m form when using even/odd predication
2020-05-14 23:17:35 +02:00
Christoph Lehner
32fbdf4fb1
Merge pull request #5 from paboyle/develop
...
Sync upstream
2020-05-13 09:02:56 +02:00
Peter Boyle
a9847aa866
Dependence fix
2020-05-12 20:03:37 -04:00
Peter Boyle
2e652431e5
No compile on summiit fix
2020-05-12 18:56:47 -04:00
Peter Boyle
8b5b55b682
Make tests all compile ccurrent Grid, mostly MdagM removal of norms fixes but a few minor
...
issues fiixed too
2020-05-12 17:57:24 -04:00
Peter Boyle
0e3c49f687
TransposeIndex was broken by Christoph
2020-05-12 17:57:01 -04:00
Peter Boyle
cb7ee37562
Close expressions in arg to cshift
2020-05-12 17:56:40 -04:00
Peter Boyle
82f71643a4
Remove the norm in MdagM
2020-05-12 17:55:53 -04:00
nmeyer-ur
d15ccad8a7
switched to vec* in Reduce
2020-05-12 20:41:14 +02:00
nmeyer-ur
0009b5cee8
updated SVE_README
2020-05-12 19:02:33 +02:00
nmeyer-ur
20d1941a45
enabled asm kernels for fixed-size A64FXFIXEDSIZE
2020-05-12 19:01:12 +02:00
Peter Boyle
d24d8e8398
Use X-direction as more bits meaningful on CUDA.
...
2^31-1 shoulddd always bee enough for SIMD and thread reduced local volume
e.g. 32*2^31 = 2^36 = (2^9)^4 or 512^4 ias big enough.
Where 32 is gpu_threads * Nsimd = 8*4
2020-05-12 10:35:49 -04:00
Christoph Lehner
162e4bb567
no automatic prefetching for now
2020-05-12 07:01:23 -04:00
Peter Boyle
07c0c02f8c
Speed up Cshift
2020-05-11 17:02:01 -04:00
Peter Boyle
8c31c065b5
Keep the Vector fixed to protect it from realloc
2020-05-11 17:00:30 -04:00
nmeyer-ur
b7c76ede29
Removed some assertions in Test_simd and removed exit() in Reduce
2020-05-11 22:43:00 +02:00
nmeyer-ur
05edf803bd
corrected typo
2020-05-12 03:59:59 +09:00
Christoph Lehner
b1c86900b2
Merge pull request #4 from paboyle/develop
...
merge
2020-05-11 20:59:29 +02:00
nmeyer-ur
78b8e40f83
switched to gcc's internal data types
2020-05-11 18:11:23 +02:00
nmeyer-ur
fc2e9850d3
temporarily enable TOFU by default when using A64FX or A64FXFIXEDSIZE
2020-05-11 13:25:02 +02:00
nmeyer-ur
ffaaed679e
MPI_THREAD_SINGLE hack for Fugaku, enabled by -DTOFU
2020-05-11 13:21:39 +02:00
Peter Boyle
bbbee5660d
First compiile on HiP
2020-05-10 05:28:09 -04:00
Peter Boyle
ea08f193e7
Allocator cache spliit into large/small pools
2020-05-10 05:24:26 -04:00
Peter Boyle
2bb2c68e15
Separate pools for small and large allocations cache
2020-05-09 22:57:21 -04:00
Peter Boyle
efe5bc6a3c
Split allocator cache into two pools of different sizes
2020-05-09 22:27:56 -04:00
nmeyer-ur
b2fd8b993a
fixed-size clean up
2020-05-09 22:53:42 +02:00
nmeyer-ur
291ee8c3d0
updated fixed-size implementation; only Exch1 and prefetches missing
2020-05-09 22:18:02 +02:00
nmeyer-ur
e1a5b3ea49
unions for tables eliminate explicit loads, gcc does not complain
2020-05-09 21:21:57 +02:00
nmeyer-ur
55a55660cb
reverted changes
2020-05-09 12:48:42 +02:00
Peter Boyle
384da487bd
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 18:55:11 -04:00
Peter Boyle
ee1de82a53
Working ITT benchmark again
2020-05-08 18:54:50 -04:00
Peter Boyle
2b576fc185
Comment deadd codde remove
2020-05-08 18:54:29 -04:00
Peter Boyle
52081acfa5
NVCC compile fixes
2020-05-08 13:14:12 -04:00
Peter Boyle
b01b7f761a
Merge pull request #283 from DanielRichtmann/feature/minor-fixes
...
Some small fixes
2020-05-08 10:52:03 -04:00
Daniel Richtmann
c83471bfd0
Fix missing checkerboards for adj und conjugate
2020-05-08 16:44:03 +02:00
Daniel Richtmann
ab0c5d77fb
Correct NonHermitianSchurOperatorBase
2020-05-08 16:44:02 +02:00
Daniel Richtmann
779e3c7442
Const-correctness for retrieval routines of GridStopWatch
2020-05-08 16:43:52 +02:00
Daniel Richtmann
0c570824f2
Add missing declaration of GridCmdOptionInt
2020-05-08 16:43:51 +02:00
Peter Boyle
f8b8e00090
Systematise the accelerator primitives and locate to Grid/threads/Accelerator.h / Accelerator.cc
...
Aim to reduce the amount of cuda and other code variations floating around all over the place.
Will move GpuInit iinto Accelerator.cc from Init.cc
Need to worry about SharedMemoryMPI.cc and the Peer2Peer windows
2020-05-08 06:23:55 -07:00
Peter Boyle
0dd1bdfa94
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-08 09:21:43 -04:00
Peter Boyle
1d65e2f62c
Slightly faster Chebyshev; ifdef'ed out the fastest until tested numerics
...
Lifteed from HDCR setup
2020-05-08 09:20:54 -04:00
Peter Boyle
93920c4811
Remove verbose
2020-05-08 09:19:54 -04:00
Peter Boyle
6859a3e1d4
Schur operator
2020-05-08 09:19:12 -04:00
Peter Boyle
21ca182c36
Comments remove
2020-05-08 09:18:24 -04:00
nmeyer-ur
ceb8b374da
API change v3
2020-05-08 15:04:44 +02:00
nmeyer-ur
4bc2ad2894
API change v2
2020-05-08 15:00:25 +02:00
nmeyer-ur
798af3e68f
retry changing StoD API
2020-05-08 14:34:59 +02:00
nmeyer-ur
b0ef2367f3
testing alternate call to PrecisionChange
2020-05-08 14:22:44 +02:00
nmeyer-ur
71a7350a85
changed 2nd argument in Reduce to native vector type
2020-05-08 12:26:51 +02:00
nmeyer-ur
6f79369955
trying to get rid of macro definition error
2020-05-08 12:19:24 +02:00
nmeyer-ur
f9cb6b979f
corrected more typos
2020-05-08 12:11:01 +02:00
nmeyer-ur
ed4d9d17f8
corrected type
2020-05-08 12:09:22 +02:00
nmeyer-ur
fbed02690d
some changes in breaking out A64FX: use -DA64FXFIXEDSIZE for fixed size, but also define GEN
2020-05-08 12:05:31 +02:00
nmeyer-ur
39f3ae5b1d
corrected more types
2020-05-08 11:07:14 +02:00
nmeyer-ur
e64bec8c8e
pulled SVE typedefs out of Optimization
2020-05-08 11:04:21 +02:00
nmeyer-ur
0893b4e552
fixed typos in PrecisionChange
2020-05-08 10:59:07 +02:00
nmeyer-ur
92f0f29670
fixed double overloading vecf in Div, corrected typos
2020-05-08 10:57:23 +02:00
nmeyer-ur
48a340a9d1
GEN seems to defined by default -> some fixes applied
2020-05-08 10:47:49 +02:00
nmeyer-ur
f45621109b
placed typedefs in Optimization
2020-05-08 10:41:52 +02:00
nmeyer-ur
32d1a0bbea
added even more debug output
2020-05-08 10:39:26 +02:00
nmeyer-ur
267cce66a1
added more debug output
2020-05-08 10:29:28 +02:00
nmeyer-ur
3417147b11
added real fma, corrected typos in tbls; integrated, must supply A64FXGCC with GEN in configure
2020-05-08 10:20:19 +02:00
nmeyer-ur
b338719bc8
first transition to fixed-size done, excl. Exch; next step: integration
2020-05-07 22:33:28 +02:00
nmeyer-ur
2b81cbe2c2
first attempt to introduce tables using fixed-size; still incomplete
2020-05-07 22:01:19 +02:00
nmeyer-ur
acff9d6ed2
transition to fixed size data types almost done; still incomplete
2020-05-07 21:24:07 +02:00
053b4dd495
Merge pull request #282 from felixerben/baryon-reversal
...
Baryon reversal
2020-05-07 18:09:17 +01:00
nmeyer-ur
a306a49788
first mods for fixed size; still incomplete
2020-05-07 19:07:49 +02:00
ferben
42bb5f0721
asserrtion
2020-05-07 18:06:12 +01:00
ferben
253bcc3426
back to old version
2020-05-07 18:03:17 +01:00
a887206413
Merge pull request #281 from felixerben/feature/baryonSpeedup
...
Feature/baryon speedup
2020-05-07 13:41:29 +01:00
ferben
591ebb6213
Merge branch 'develop' of github.com:paboyle/Grid into feature/baryonSpeedup
2020-05-07 11:13:21 +01:00
ferben
56e2f7d088
deleted test routines. cleaned up fast version. assert Ns=4,Nc=3.
2020-05-07 10:03:45 +01:00
nmeyer-ur
7ef03c5368
updated SVE readme
2020-05-06 16:30:37 +02:00
Peter Boyle
525418abfb
Merge pull request #273 from lehner/feature/gpt
...
Feature/gpt
2020-05-06 10:10:51 -04:00
Peter Boyle
5f780806c2
Merge pull request #279 from paboyle/bugfix/nvcc-config
...
configure fix for nvcc with extra arguments as CXX
2020-05-06 10:07:52 -04:00
Christoph Lehner
3c6ffcb48c
Merge branch 'develop' into feature/gpt
2020-05-06 15:03:35 +02:00
Christoph Lehner
87984ece7d
add Lattice_basis.h
2020-05-06 08:47:18 -04:00
Christoph Lehner
e9b295f967
Synchronize blocking infrastructure with GPT
2020-05-06 08:42:28 -04:00
Peter Boyle
224cbf0453
Merge pull request #280 from mmphys/bugfix/ET_go_home
...
Bugfix/et go home
2020-05-05 17:56:51 -04:00
Michael Marshall
c1e57d4357
Merge branch 'develop' into bugfix/ET_go_home
...
* develop:
SYCL prep - no sycl just make it compile through DPC++
dpc++ didn't like rdtsc()
Make compile if HAVE_LIME=0
Lime optional
2020-05-05 22:35:04 +01:00
Peter Boyle
28a1fcaaff
First compile against SYCL
2020-05-05 11:13:27 -07:00
Christoph Lehner
6b64727161
disable comments
2020-05-05 05:05:36 -04:00
Christoph Lehner
04863f8f38
debug new AcceleratorView
2020-05-04 16:07:03 -04:00
u37294
04927d2e40
SYCL prep - no sycl just make it compile through DPC++
2020-05-04 10:28:29 -07:00
u37294
7caed4edd9
dpc++ didn't like rdtsc()
2020-05-04 10:27:05 -07:00
u37294
59c51d2c35
Make compile if HAVE_LIME=0
2020-05-04 10:26:20 -07:00
u37294
ff53b231c8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-05-04 10:25:10 -07:00
u37294
fc19cf905b
Lime optional
2020-05-04 10:24:48 -07:00
Christoph Lehner
2a1387e992
rankInnerProduct
2020-05-03 17:27:11 -04:00
Christoph Lehner
9bfa51bffb
cleanup comment
2020-05-03 09:12:52 -04:00
Christoph Lehner
38532753f4
interface cleanup
2020-05-03 08:58:32 -04:00
Christoph Lehner
949be9605c
fix pragmas
2020-05-02 16:20:03 -04:00
Christoph Lehner
63cf201ee7
Add AdviseInfrequentUse
2020-05-02 11:38:42 -04:00
Christoph Lehner
c8af498a2a
BinaryIO fix for alternative little-endian format name (used in 96I ensemble)
2020-05-01 03:45:50 -04:00
Christoph Lehner
ddb192bac7
re-work double precision promotion for summit
2020-04-30 16:09:57 -04:00
Michael Marshall
7666300a6f
Merge branch 'develop' into bugfix/ET_go_home
...
* develop:
Basis rotate stack passig to GPU reduction
Clean up warning
2020-04-30 20:10:32 +01:00
Michael Marshall
4a4b9e305d
Fix: strToVec enters infinite loop and exhausts memory if operator>> fails before the end of string, e.g. if parsing "0_0_0" for momentum instead of "0 0 0".
2020-04-30 19:40:04 +01:00
Peter Boyle
9b2d2d0fc3
Basis rotate stack passig to GPU reduction
2020-04-30 12:31:07 -04:00
Peter Boyle
5011753f4f
Clean up warning
2020-04-30 10:23:48 -04:00
Michael Marshall
dbaeefaeef
All Eigen::TensorMap objects are fixed (i.e. cannot be dynamically resized)
2020-04-30 15:02:51 +01:00
Christopher Kelly
dee96cbf82
Added workaround in configure to still catch Cuda compiler when nvcc with extra arguments (eg -ccbin) is used as CXX
2020-04-29 10:37:11 -04:00
Peter Boyle
dd3ebc2ce4
Slow compile on NVCC switch off conserved current
2020-04-29 08:43:12 -04:00
Peter Boyle
103e7ae2f0
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-04-29 03:05:36 -04:00
Peter Boyle
29ae5615c0
Seqeuential fix
2020-04-29 03:05:15 -04:00
ferben
6240e02619
added assertion to avoid potential infinite loop
2020-04-27 18:50:53 +01:00
ferben
f4033ad8cb
baryon speedup by a factor 2
2020-04-27 17:46:14 +01:00
nmeyer-ur
5abec5b8a9
SVE_readme update, update Grid_vector_types.h
2020-04-25 13:48:26 +02:00
nmeyer-ur
499edc0636
updated SVE_README.txt; defined ARMCLANGCOMPAT macro
2020-04-25 13:41:24 +02:00
nmeyer-ur
d990e61be3
armclang 20.1 settings in SVE readme
2020-04-25 12:11:43 +02:00
nmeyer-ur
3edb2dc2da
removed -static from gcc CXXFLAGS
2020-04-24 13:04:34 +02:00
Christoph Lehner
f1fe444d4f
blocked precision promotion infrastructure upgrade
2020-04-24 06:27:20 -04:00
nils meyer
345721220e
resolved merge conflict
2020-04-24 10:14:21 +02:00
nils meyer
6db68d6ecb
added SVE configure for armclang and gcc
2020-04-24 10:10:47 +02:00
Peter Boyle
dae820aa96
Merge pull request #277 from mmphys/bugfix/grid-config
...
Bugfix/grid config
2020-04-23 10:26:54 -04:00
Michael Marshall
5daf176f4a
Updated to expose GRID_CXXLD in addition to CXXLD.
...
NB: CXXLD required as this is what drives linking behaviour.
2020-04-23 15:25:53 +01:00
Michael Marshall
e96c86ec14
Make grid-config message more specific for --cxx and --cxxld
2020-04-23 13:10:45 +01:00
nmeyer-ur
09f0963d1f
changes in configure.ac ; to be verified
2020-04-23 11:27:03 +02:00
nils meyer
6f44e3c192
reverted changes in configure.ac ; included SVE configure readme
2020-04-23 11:18:50 +02:00
Peter Boyle
c2c3cad20d
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2020-04-23 04:35:42 -04:00
Peter Boyle
edec9ee2e2
Conserved current rewrite done. Zmobius working
2020-04-23 04:34:01 -04:00
Peter Boyle
ed70cce542
Test for 5D DWF obserevables
2020-04-23 04:29:45 -04:00
Michael Marshall
4701201b5f
grid-config: Expose CXXLD (for GPU build) and update help
2020-04-22 18:42:30 +01:00
nils meyer
5893888f87
removed default no-strict-aliasing for gcc-10.0.1 exclusively
2020-04-22 19:29:55 +02:00
nmeyer-ur
39b448affb
Merge remote-tracking branch 'origin/develop' into feature/a64fx-2
2020-04-22 17:34:12 +02:00
nils meyer
e54a8f05a9
Exchange1 with generic version for now, should use svtbl2 in final version
2020-04-20 22:45:27 +02:00
Peter Boyle
0782b76ed4
Merge pull request #274 from paboyle/feature/zmobius_paramcompute
...
ZMobius parameter computation
2020-04-20 14:39:29 -04:00
Christopher Kelly
0896f2cead
Added missing include guards in bigfloat_double.h
2020-04-20 10:30:38 -04:00
Christopher Kelly
181709bba4
Merge branch 'develop' into feature/zmobius_paramcompute
2020-04-20 09:12:34 -04:00
nils meyer
64b72fc17f
testing gcc 10.0.1: build errors in Exchange1 using -DA64FX and in Lattice_base.h building Dslash only
2020-04-19 01:25:40 +02:00
Christoph Lehner
091d5c605e
towards more precise blocking
2020-04-17 04:25:28 -04:00
nils meyer
6fdce60492
revised BodyA64FX; 990 GiB/s Wilson, 687 GiB/s DW using intrinsics (armclang 20.0)
2020-04-16 22:43:32 +02:00
Peter Boyle
90229cfb0f
Merge pull request #270 from milc-qcd/feature/CGinfo
...
feature/CGinfo
2020-04-16 11:46:08 -04:00
Peter Boyle
0475c46ecb
Merge pull request #256 from djm2131/feature/BiCGSTAB
...
Import BiCGSTAB solvers and tests
2020-04-16 11:45:15 -04:00
Peter Boyle
3cca10e617
Merge pull request #276 from nils-asmussen/fix/regression_nt
...
fix regression in tests/core/Test_qed.cc
2020-04-16 11:42:39 -04:00
Christoph Lehner
327da332bb
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/gpt
2020-04-16 11:30:17 -04:00
nils meyer
852db4626a
re-introduced HOTFIX cause Grid binaries give wrong results otherwise; checked in good gridverter.py
2020-04-15 18:22:19 +02:00
43dc2814dd
fix regression in core/Test_qed.cc
2020-04-15 16:10:15 +01:00
nils meyer
6504a098cc
999 GiB/s Wilson; 694 GiB/s DW (DP)
2020-04-15 15:06:52 +02:00
nils meyer
79a385faca
disabled armclang hotfix cause armclang 20.0 performance gets a little
2020-04-15 11:46:55 +02:00
nils meyer
c12a67030a
980 GiB/s Wilson; 680 GiB/s DW (DP)
2020-04-15 10:55:06 +02:00
nils meyer
581392f2f2
now with pf, best results so far using intrinsics+pf
2020-04-12 22:06:14 +02:00
nils meyer
113f277b6a
enable dslash asm using -DA64FXASM, additionaly -DDSLASHINTRIN for intrinsics impl
2020-04-11 04:55:01 +02:00
Peter Boyle
f3a8d039a2
Merge branch 'feature/hdcr' into develop
2020-04-10 22:01:52 -04:00
nils meyer
974586bedc
Dslash finally works; cleaned up; uses MOVPRFX in assembly
2020-04-10 22:26:40 +02:00
4e864e56c9
develop pull
2020-04-10 17:19:18 +01:00
Peter Boyle
014dbfa464
Compile fix with OpDirAll
2020-04-10 11:57:09 -04:00
Peter Boyle
3b0e07882f
Adding another form of polynomial
2020-04-10 11:28:33 -04:00
Peter Boyle
8e81a811d0
Merge branch 'feature/hdcr' into develop
2020-04-10 11:14:49 -04:00
Peter Boyle
aa13118127
Missing conjugate already fixed in develop
2020-04-10 11:11:24 -04:00
Peter Boyle
6cdb09c884
Faster copy region
2020-04-10 11:10:52 -04:00
Peter Boyle
a65bc64f10
Accelerator peek poke
2020-04-10 11:09:59 -04:00
Peter Boyle
11dec4883c
Don't throw assert
2020-04-10 11:09:11 -04:00
Peter Boyle
afa458c812
Extra solvers
2020-04-10 11:08:19 -04:00
Peter Boyle
dc50190b8f
Faster GPU basis rotation
...
May need to later include Regensburg optimised CPU variant
2020-04-10 11:06:04 -04:00
nmeyer-ur
160f78c1e4
changed debug output to variable direct 3
2020-04-10 12:23:07 +02:00
nmeyer-ur
7e4e1bbbc2
changed debug output to variable direct 2
2020-04-10 12:22:04 +02:00
nmeyer-ur
e699b7e9f9
changed debug output to variable direct
2020-04-10 12:18:30 +02:00
nmeyer-ur
a28bc0de90
debug register address test in WilsonHand
2020-04-10 12:07:45 +02:00
nmeyer-ur
14d0fe4d6c
added predication in WilsonHand
2020-04-10 12:04:00 +02:00
nmeyer-ur
0ad2e0815c
debug output in WilsonHand
2020-04-10 11:56:29 +02:00
nils meyer
1c8ca05e16
Merge branch 'feature/a64fx-2' of https://github.com/nmeyer-ur/Grid into feature/a64fx-2
2020-04-09 23:32:19 +02:00
nils meyer
dc9c8340bb
switched to DSLASHINTRIN for A64FX Dslash intrinsics
2020-04-09 23:30:23 +02:00
nils meyer
19eef97503
specialized A64FX Dslash kernels
2020-04-09 23:25:25 +02:00
nmeyer-ur
635246ce50
corrected typo
2020-04-09 21:42:50 +02:00
nils meyer
5cdbb7e71e
fixed A64FX Dslash; compiles, but does not specialize -> assertion
2020-04-09 21:23:39 +02:00
nmeyer-ur
8123590a1b
changes
2020-04-09 16:45:47 +02:00
nmeyer-ur
86c9c4da8b
changes
2020-04-09 16:40:06 +02:00
nmeyer-ur
cd1efee866
changes
2020-04-09 16:35:13 +02:00
nmeyer-ur
bd310932f7
changes
2020-04-09 16:32:31 +02:00
nmeyer-ur
304762e7ac
changes
2020-04-09 16:26:01 +02:00
nmeyer-ur
d79ab03a6c
changes
2020-04-09 16:19:25 +02:00
nmeyer-ur
d5708e0eb2
more changes
2020-04-09 15:43:34 +02:00
nmeyer-ur
123f6b7a61
more changes
2020-04-09 15:17:19 +02:00
nmeyer-ur
2b6457dd9a
added xp/xm recon accum
2020-04-09 15:13:19 +02:00
nmeyer-ur
b367cbd422
defined ADD_RESULT
2020-04-09 15:08:45 +02:00
nmeyer-ur
e252c1aca3
addressing
2020-04-09 15:03:12 +02:00
nmeyer-ur
b140c6a4f9
addressing
2020-04-09 15:01:15 +02:00
nmeyer-ur
326de36467
revised sU addressing scheme
2020-04-09 14:44:25 +02:00
nmeyer-ur
9f224a1647
fixed typo in single
2020-04-09 14:30:21 +02:00
nmeyer-ur
bb46ba9b5f
fixed array size in single
2020-04-09 14:28:45 +02:00
nmeyer-ur
dd5a22b36b
revised declarations
2020-04-09 14:21:27 +02:00
nmeyer-ur
1ea85b9972
Disabled build message
2020-04-09 13:47:21 +02:00
nmeyer-ur
8fb63f1c25
added A64FX Wilson kernels single precision
2020-04-09 13:41:04 +02:00
nmeyer-ur
77fa586f6c
introduced A64FX Wilson kernels
2020-04-09 13:30:06 +02:00
Christoph Lehner
96e8e44fd4
Merge pull request #2 from DanielRichtmann/feature/fused-innerproduct-norm2
...
Fused innerProduct + norm2 on first argument operation
2020-04-06 13:16:58 +02:00
Daniel Richtmann
5fc8a273e7
Fused innerProduct + norm2 on first argument operation
2020-04-06 11:52:29 +02:00
d671a63e78
Update README.md
2020-04-03 19:52:15 +01:00
nmeyer-ur
15238e8d5e
reduce acle works, clean up
2020-04-03 20:40:44 +02:00
nmeyer-ur
b27e31957a
reduce acle revised
2020-04-03 19:46:15 +02:00
nmeyer-ur
46927771e3
reduce acle still needs overhaul
2020-04-03 19:30:48 +02:00
nmeyer-ur
d8cea77707
define simd width in header
2020-04-03 19:22:25 +02:00
nmeyer-ur
5f8a76d490
clean up, reduction in acle
2020-04-03 19:18:24 +02:00
nmeyer-ur
28d49a3b60
build problem resolved
2020-04-03 16:52:48 +02:00
nmeyer-ur
b4c624ece6
added A64FX support
2020-04-03 15:43:23 +02:00
2c22db841a
Added momentum scaling to scalar HMC theories in order to follow UKQCD/CPS conventions
2020-04-02 17:38:47 +01:00
Christoph Lehner
856d168e41
global sum over vectors of uint64_t
2020-03-29 07:56:05 -04:00
6235c7ba98
IPP path fix in configure
2020-03-27 17:23:29 +00:00
7e13724882
removing Hadrons
2020-03-27 12:03:32 +00:00
Christoph Lehner
b6cbdd2aa3
Merge pull request #1 from DanielRichtmann/feature/read-openqcd
...
Feature/read openqcd
2020-03-26 17:39:04 +01:00
Christoph Lehner
a2188ea875
remove debugging printf from WilsonKernelsImplementation
2020-03-26 09:12:36 -04:00
Daniel Richtmann
989af65807
Check in parallel reader for openqcd configs
2020-03-24 11:20:54 +01:00
Christoph Lehner
60db3133d3
make trace,adj,transpose unary operators
2020-03-16 17:59:56 -04:00
Christoph Lehner
c9b737a4e7
make trace,adj,transpose unary operators
2020-03-16 17:58:30 -04:00
Daniel Richtmann
037bb6ea73
Check in reader for openqcd configs
...
This reader is suboptimal in the sense that it opens the entire config on every MPI rank.
2020-03-16 14:28:02 +01:00
05ebc458e2
Merge pull request #260 from mmphys/feature/distil
...
Distillation: save eigenvalues of the Laplacian for all timeslices
2020-03-13 14:00:21 +00:00
Michael Marshall
3753508957
Making change 1) as simple as possible 2) as much like MSink/Point.hpp as possible
2020-03-12 13:47:51 +00:00
Michael Marshall
c1677fccf6
Merge branch 'develop' into feature/distil
...
* develop:
bugfix ZPerambulator
registered module supporting ZMobius action
changed to push_back according to request
Added Hadrons_Error in case blockSize is set too large
bugfix in perambulator module
# Conflicts:
# Hadrons/Modules/MDistil/Perambulator.hpp
2020-03-12 12:45:18 +00:00
35e8e31749
Merge pull request #272 from mmphys/feature/ZPeramb
...
bugfix ZPerambulator
2020-03-12 12:28:04 +00:00
34813e9b04
Merge branch 'develop' into feature/ZPeramb
2020-03-12 12:27:56 +00:00
Felix Erben
373cf61abb
bugfix ZPerambulator
2020-03-12 11:44:43 +00:00
4e8fbc4b49
Merge pull request #271 from mmphys/feature/ZDistil
...
registered module supporting ZMobius action
2020-03-12 10:54:07 +00:00
ferben
516ac1d4d5
registered module supporting ZMobius action
2020-03-12 10:52:27 +00:00
318f63eb34
Merge pull request #268 from mmphys/a2a-error-log
...
Added Hadrons_Error in case blockSize is set too large
2020-03-11 11:09:00 +00:00
16503d7532
Merge pull request #267 from mmphys/feature/distil-bugfix
...
bugfix in perambulator module
2020-03-11 11:08:23 +00:00
ferben
0fa93383b7
changed to push_back according to request
2020-03-11 09:05:01 +00:00
ferben
0a827aa7bf
Added Hadrons_Error in case blockSize is set too large
2020-03-11 08:52:52 +00:00
Carleton DeTar
165c68e28e
Change TrueResiduals to TrueResidualShift and IterationsToComplete to IterationsToCompleteShift
2020-02-29 17:51:51 -06:00
ferben
b32b1ca642
bugfix in perambulator module
2020-02-26 12:06:45 +00:00
Carleton DeTar
9479bc8486
Make IterationsToComplete and TrueResidual externally accessible
2020-02-19 17:43:57 -06:00
Peter Boyle
8a5c13d5fb
Still fast moving in changes
2020-02-06 17:57:26 -05:00
Peter Boyle
bdccb0c91f
Working 2 types of decomposition
2020-02-06 17:26:55 -05:00
Peter Boyle
68b45f6444
Lower left/upper right region cut paste
2020-02-06 15:50:26 -05:00
Peter Boyle
ef9b3e658a
extra typedef
2020-02-06 15:47:14 -05:00
Peter Boyle
b9ca40cc44
More precise power method at start
2020-02-06 10:09:14 -05:00
Peter Boyle
2f421a5db1
Commeent fix
2020-02-06 10:08:27 -05:00
Michael Marshall
10192dfc71
Wall source momenta must be specified for spatial components only.
...
So we don't break existing scripts, allow momentum in time direction as well, but only if zero.
Fail early, so do the check in setup()
2020-01-31 15:02:03 +00:00
Michael Marshall
c69a3b6ef6
When saving eigenvectors, LapEvec now saves eigenvalues for every timeslice as well.
...
I.e. nT x nVec eigenvalues are saved in FileName.evals.conf.h5.
A new named tensor, "TimesliceEvals" can be used to simplify restoring these from disk.
NB: The changes in BaseIO add support so that Eigen tensors can be easily used in MPI operations, e.g. GlobalSum.
See LapEvec.hpp for an example of how this is done.
2020-01-29 21:20:20 +00:00
Peter Boyle
852fc1b001
True Hierachical multigrid for DWF
2020-01-27 13:45:10 -05:00
Peter Boyle
2b5de5bba5
MdagM operator without norm option
2020-01-27 13:44:30 -05:00
Peter Boyle
2e85cae74e
Add Jacobi polynomials
2020-01-27 13:43:49 -05:00
Peter Boyle
76c823781e
Much faster coarsening
2020-01-27 13:43:19 -05:00
Peter Boyle
114db3b99d
Optional MdagM without norms
2020-01-27 13:42:51 -05:00
Peter Boyle
49e123dbda
Use explicit linalg calls to get coalesce optimisations on GPU
2020-01-27 12:44:51 -05:00
Peter Boyle
8cec294ec9
Make CG a bit less verbose as gettign annoying in nested algorithms.
...
Can use Iterative logging if you want to see more
2020-01-27 12:44:04 -05:00
Peter Boyle
eb5b720e94
Normal Equations can be used in HDCR now
2020-01-27 12:43:29 -05:00
Peter Boyle
b2736ec80b
Make PrecGCR recursive - it can precondition itself
2020-01-27 12:42:48 -05:00
Peter Boyle
086256a032
Less sloppy convergence test on PowerMethod
2020-01-27 12:41:59 -05:00
Peter Boyle
afc7426f39
Much bigger pointer cache in case of Nvidia due to cost of setting up UVM allocations
2020-01-27 12:41:16 -05:00
Peter Boyle
7c061e20c9
All directions of dirac operator for fastt coarsening
2020-01-27 12:40:13 -05:00
Peter Boyle
e5d1c09665
Faster DhopDirAll for little dirac operator coarsening
2020-01-27 12:38:54 -05:00
Peter Boyle
8016a465ae
Remove extraneous variable
2020-01-27 12:35:37 -05:00
Peter Boyle
d8b9742092
DhopDirAll for faster matrix elements of little Dirac operator
2020-01-27 12:34:54 -05:00
Peter Boyle
1bd87c35d7
Read coalescing on Nvidia
2020-01-27 12:29:56 -05:00
Peter Boyle
fa856c9669
Disable information message
2020-01-27 12:28:46 -05:00
Peter Boyle
48008e4d8b
Thread coordinate creation loop
2020-01-27 12:28:16 -05:00
Peter Boyle
55cdb17691
Integer divide for blocking
2020-01-27 12:27:45 -05:00
Michael Marshall
2ed39ebb7a
Perambulator won't even allocate memory for unsmeared sinks unless the filename is specified.
...
Prior to this update, memory is allocated regardless of whether these are requested.
2020-01-24 13:01:06 +00:00
Christopher Kelly
96671bbb24
Added ability to pass callback to MADWF that is called every inner iteration and allows user to, for example, adjust the inner solver tolerance depending on residual
...
Added a general implementation of the Remez algorithm for producing arbitrary rational polynomial approximation with optional restriction to even/odd polynomials
Added implementation of computation of ZMobius parameters
Added Test_zMADWF_prec to test ZMobius in MADWF
2020-01-17 12:45:30 -08:00
Peter Boyle
554542b773
Merge branch 'feature/hdcr' of https://github.com/paboyle/Grid into feature/hdcr
2020-01-06 11:47:56 -05:00
Peter Boyle
03da4040e2
Make summit happy
2020-01-06 11:47:48 -05:00
Peter Boyle
e583035614
Change to interface to minise comms in evaluating coarse space operator
2020-01-06 11:43:59 -05:00
Peter Boyle
3c3d6a94f3
OPtimising the force term a bit
2020-01-04 03:16:23 -05:00
Peter Boyle
205ea4bbb2
More verboose Lanczos
2020-01-04 03:13:40 -05:00
Peter Boyle
039eb7b2eb
Make the force term and coarsening multigrid more optimised
2020-01-04 03:12:17 -05:00
Peter Boyle
f7e4bd1f6d
Getting more optimised
2020-01-04 03:11:53 -05:00
Peter Boyle
0afecfcae7
Nearing well optimised state
2020-01-04 03:11:19 -05:00
Peter Boyle
ba40a3f763
Alternate low pass filter option
2020-01-03 05:29:09 -05:00
Peter Boyle
aa920aa532
Improved DWF multigrid
2019-12-28 10:32:35 -05:00
Peter Boyle
c0d8e4dce5
Improved Multigrid for DWF
2019-12-28 10:32:15 -05:00
Michael Marshall
0ca1992151
Remove warning in tensor layout comparison. Make default names and index names visible for PerambTensor and NoiseTensor
2019-12-20 13:53:27 +00:00
Michael Marshall
df2b0c4e79
Merge branch 'develop' into feature/distil
...
* develop:
Missing conjugate in MooeeInvDag
Allow subspace setup to no converge
fp16 mandatory. Use SFW is not available as hdw
2019-12-20 13:24:59 +00:00
Peter Boyle
9cfd64c604
Coarse grid on GPU, not fast enough yet. Need a 10x
2019-12-17 05:24:45 -05:00
Peter Boyle
e478404291
Tuned up significantly on GPU, but another 10x in coarse space required
2019-12-17 05:03:25 -05:00
Peter Boyle
9aafd20468
Simple block project promote runs faster on GPU
2019-12-17 05:01:39 -05:00
Peter Boyle
5d834486c9
Merge pull request #259 from grid-test-organisation/feature/5d-improvement-fix
...
Missing conjugate in MooeeInvDag
2019-12-16 04:20:37 -05:00
gfilaci
f7373e97a4
Missing conjugate in MooeeInvDag
2019-12-16 10:05:50 +01:00
Peter Boyle
9e15474999
Accelerator loop attempt at speed up
2019-12-14 05:28:16 -05:00
Peter Boyle
152b525a4d
Typo fix
2019-12-13 22:44:42 -05:00
Peter Boyle
d18994eddc
offload more of mgrid to GPU
2019-12-13 22:08:11 -05:00
Peter Boyle
b8bd8cd2ae
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-12-13 21:32:10 -05:00
Peter Boyle
736b19485e
Faster set up and some dead code ifdef'ed out
2019-12-13 21:30:48 -05:00
Michael Marshall
c7637a84ad
Documentation tweak for peculiarities of OpenMPI --prefix
2019-12-12 17:00:03 +00:00
Michael Marshall
a7772c827b
Documentation tweak
2019-12-12 16:05:22 +00:00
8e83398861
Merge pull request #257 from AndrewYongZhenNing/develop
...
Added NamedTensor.hpp
2019-12-11 21:36:59 +00:00
David Murphy
843ca9350a
Fix naming conventions to be consistent with Peter
2019-12-11 11:46:18 -05:00
f47b2b6e13
Added NamedTensor.hpp
2019-12-11 15:56:46 +00:00
Peter Boyle
5bfd1470ad
Merge branch 'develop' into feature/hdcr
2019-12-10 21:51:06 -05:00
Peter Boyle
6957b0b58a
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-12-10 21:50:42 -05:00
Peter Boyle
d73f0b8618
Verbose for temporary debug
2019-12-10 21:50:06 -05:00
Peter Boyle
0b3a3562c3
Some MPI (summit) create sigusr2, so trap that
2019-12-10 21:49:12 -05:00
Peter Boyle
710fee5d26
Subspace setup testing code
...
and timing verbose
2019-12-10 21:48:42 -05:00
Peter Boyle
bab0bf2e93
Merge branch 'develop' into feature/hdcr
2019-12-10 21:47:41 -05:00
Peter Boyle
848079e8ba
Merge pull request #235 from grid-test-organisation/feature/5d-improvement
...
MooeeInv and M5D optimisations + enable threading with nvcc
2019-12-10 21:45:03 -05:00
Peter Boyle
f2a4f13111
Must offload the Coarsened matrix if Stencil buffers are device resident
2019-12-10 19:32:12 -05:00
David Murphy
4180a4a8a7
Import BiCGSTAB solvers and tests
2019-12-10 17:20:35 -05:00
b9b9fcbfa0
Merge pull request #229 from nils-asmussen/feature/JacobiSmear
...
MSource::jacobi smear + sort file contents of Modules.hpp and modules.inc
2019-12-09 22:50:02 +00:00
bbe48998a8
sort Modules.hpp and modules.inc + add module JacobiSmear
2019-12-09 18:06:29 +00:00
6446671a9c
Merge pull request #241 from nils-asmussen/fix/remQCDns_ignore_ws
...
Undo whitespace changes in fix/removeQCDremnants to allow comparing relevant changes
2019-12-09 18:02:21 +00:00
110373ea79
Merge pull request #204 from nils-asmussen/sha256sum_Eigen_download
...
bootstrap.sh: verify checksum of Eigen tar file
2019-12-09 18:01:46 +00:00
a986786192
bootstrap.sh: verify checksum of Eigen tar file if sha256sum is installed
2019-12-09 17:11:21 +00:00
Peter Boyle
edd1c924eb
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-12-09 03:53:01 -05:00
Peter Boyle
9b6b0caa55
Junk commit fix
2019-12-09 03:01:58 -05:00
Peter Boyle
2a48617ac5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-12-09 03:00:00 -05:00
Peter Boyle
876d9c957c
QMR
2019-12-09 02:59:49 -05:00
Peter Boyle
295e535f93
QMR
2019-12-09 02:59:35 -05:00
Peter Boyle
58a31f0763
QMR implemented, preserve even if not used much
2019-12-09 02:59:13 -05:00
Peter Boyle
3d2fe80780
Temporary size depends on checkerboard/uncheckerboard. The Mdir cares
2019-12-09 02:58:24 -05:00
Peter Boyle
e43fce1083
Clean up and simplify a little.
2019-12-09 02:55:45 -05:00
Peter Boyle
0dfdf80407
Logging
2019-12-09 02:54:52 -05:00
Peter Boyle
2912071f83
Add non hermitian operator
2019-12-09 02:51:53 -05:00
Peter Boyle
26605ef387
HDCR back to working
2019-12-09 02:51:01 -05:00
1e5ac576d9
Merge commit 'f7698b93ca57ea3aa4d72b133ad9ca5d1e703661' into develop
...
# Conflicts:
# Hadrons/Modules.hpp
# Hadrons/modules.inc
2019-12-06 11:59:21 +00:00
d5492b426f
Hadrons: better order in module list
2019-12-06 11:56:26 +00:00
d428858c9d
Merge pull request #255 from fionnoh/feature/sparseNoise
...
Feature/sparse noise
2019-12-06 11:43:27 +00:00
ferben
f7698b93ca
corrected comments about quark line directions
2019-12-06 09:46:52 +00:00
ferben
7ce77690b8
Naming conventon also applied to metadata
2019-12-05 17:38:43 +00:00
ferben
164ed9c434
Naming conventon also applied to metadata
2019-12-05 17:38:00 +00:00
ferben
a54157e682
more definitions changed
2019-12-05 17:08:09 +00:00
ferben
58b6a0d8d1
changed some naming conditions to resemble rare-kaons
2019-12-05 16:56:54 +00:00
ferben
1a5e562bde
only one FIMPL left!
2019-12-05 16:46:58 +00:00
Fionn O hOgain
45be26cf3f
Merge branch 'develop' of https://github.com/fionnoh/Grid into feature/sparseNoise
2019-12-05 16:18:47 +00:00
Fionn O hOgain
5227ffccb7
Added James' sparse noise code and a module to use it
2019-12-05 15:50:03 +00:00
a0b47cc0be
Merge pull request #254 from fionnoh/bugfix/eigenMigration
...
Updated Eigen URL after migration to gitlab
2019-12-05 15:26:38 +00:00
ferben
b766038810
new syntax after merge
2019-12-04 18:08:00 +00:00
ferben
cd9fd80a5d
merged in develop
2019-12-04 17:12:46 +00:00
d6100cc35a
Merge pull request #253 from mmphys/feature/distil
...
Fix phase convention adjustment error
2019-12-04 14:58:51 +00:00
Fionn O hOgain
29a1530510
Updated Eigen URL after migration to gitlab
2019-12-04 13:49:22 +00:00
Michael Marshall
15119eaf03
Fix phase convention adjustment error (and make no assumptions about node layout)
2019-12-04 09:59:58 +00:00
188e12ffbb
Merge pull request #249 from mmphys/feature/distil
...
Feature/distil
2019-12-03 18:06:00 +00:00
ferben
e940f4db7e
removed unused parameter parity
2019-12-03 12:01:31 +00:00
ferben
9c7f269489
typo in fimpl4
2019-12-03 11:19:54 +00:00
ferben
07feaf9531
updated ascii-doc preamble
2019-12-03 11:17:35 +00:00
Michael Marshall
7983ff2fdd
Merge branch 'develop' into feature/distil
...
* develop:
Change to reporting
NVCC timer support
Fix nocompilee under NVCC
--enable-summit flag
IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could be a little faster
Sliced propagator contraction was not producing any results because buf.size()=0
several typos in hadrons
2019-11-30 16:47:03 +00:00
Michael Marshall
2db814f2b7
Resolve conflicts in BaryonUtils (just use latest from develop)
2019-11-29 18:19:35 +00:00
Michael Marshall
6418f06771
Add option to save the eigenvectors of the Laplacian.
...
If they are saved, then metadata saved are:
solverXml Parameters for this LapEvec module instance
OperatorXml module type and parameters (if any) for the module that created the gauge field
2019-11-29 18:06:18 +00:00
8a5576f73c
cleared up how exactly q_spec has to be defined
2019-11-28 12:35:18 +00:00
Peter Boyle
997790ad24
Allow subspace setup to no converge
2019-11-26 14:04:28 -05:00
Peter Boyle
900d6fad21
fp16 mandatory. Use SFW is not available as hdw
2019-11-26 13:26:43 -05:00
799ff0c96e
speed-up
2019-11-26 15:28:47 +00:00
5fd5c25114
now two seperate functions for Eye and NonEye
2019-11-26 13:44:55 +00:00
62b3799c77
Merge pull request #251 from fionnoh/bugfix/WallWallMeson
...
MContraction::Meson bugfix
2019-11-26 12:46:03 +00:00
Peter Boyle
d1a89af8c9
Change to reporting
2019-11-22 10:49:10 -05:00
Peter Boyle
d91ba1f6cc
NVCC timer support
2019-11-21 20:11:19 +00:00
Peter Boyle
f4d27e7090
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-11-21 20:09:31 +00:00
Peter Boyle
feb1ff3494
Fix nocompilee under NVCC
2019-11-21 20:03:39 +00:00
Peter Boyle
8ef6175acc
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-11-21 15:02:21 -05:00
Peter Boyle
e4399e3ee1
--enable-summit flag
2019-11-21 15:02:10 -05:00
Peter Boyle
98ea67b636
IBM summit optimisation. Synchronise in node is still btweeen 2 halves of AC922, so could
...
be a little faster
2019-11-21 15:00:46 -05:00
ferben
421a4395af
Sigma to Nucleon contractions
2019-11-21 17:25:37 +00:00
Fionn O hOgain
cf95a460a5
Sliced propagator contraction was not producing any results because buf.size()=0
2019-11-21 17:17:55 +00:00
a60e20f265
Merge pull request #250 from mmphys/hadrons-typos
...
several typos in hadrons
2019-11-20 17:10:08 +00:00
ferben
9261c0da89
several typos in hadrons
2019-11-20 17:06:32 +00:00
ferben
b350a24ded
fixed test_distil
2019-11-18 15:29:20 +00:00
Michael Marshall
13a0db7162
Reverse changes not intended to be part of distillation release
2019-11-18 12:34:49 +00:00
Michael Marshall
18177d9709
Review changes
2019-11-18 11:59:13 +00:00
Michael Marshall
7bf42b9c0e
HADRONS_ERROR
2019-11-18 10:27:35 +00:00
ferben
2d6f4e0c09
fixed issue with HADRONS_ERROR, no idea why this works
2019-11-15 13:46:47 +00:00
ferben
7f06c40107
_var -> var_
2019-11-15 13:26:24 +00:00
ferben
9f75065205
eigen_strong_inline gone
2019-11-15 13:22:20 +00:00
ferben
271a02230e
assert -> ERROR
2019-11-15 11:11:50 +00:00
ferben
b1e8b5b5ce
changed default behaviour as discussed with antonin
2019-11-15 11:00:25 +00:00
ferben
25d2521d77
small stuff
2019-11-13 16:34:09 +00:00
ferben
500ef17143
beauty
2019-11-13 15:14:51 +00:00
ferben
ee9dd22643
worked on test_distil
2019-11-13 14:59:44 +00:00
ferben
a977d9901b
cleanup
2019-11-13 14:52:06 +00:00
ferben
667ffb70db
changed error type
2019-11-13 12:16:56 +00:00
ferben
65b3059bd7
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-11-13 11:51:14 +00:00
ferben
5238808ccd
No DistilVectors specified in xml no throws an error
2019-11-13 11:50:55 +00:00
Michael Marshall
8f88fee680
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
...
* 'feature/distil' of github.com:mmphys/Grid:
made notation DPar->dp consistent over modules
2019-11-13 11:34:10 +00:00
Michael Marshall
fcc412a1c2
Remove conditional compilation to support GPU build
2019-11-13 11:32:23 +00:00
ferben
12e415330f
made notation DPar->dp consistent over modules
2019-11-13 11:21:08 +00:00
Michael Marshall
66e0811317
Attempt to fix cuda build
2019-11-13 00:02:51 +00:00
Michael Marshall
55e743aad6
Streamline
2019-11-12 23:57:28 +00:00
Michael Marshall
e2ab0d671e
Implement destructors
2019-11-12 23:18:37 +00:00
Michael Marshall
7a4c5dbbd5
Restoring previous version for _reduced variables
2019-11-12 22:12:35 +00:00
Michael Marshall
3f00b8f6c7
Switch to std::unique_ptr<GridCartesian> grid3d;
...
Remove hand-coded reference to pi - switch to <math.h> definition
2019-11-12 21:53:09 +00:00
Michael Marshall
6d7043e0c2
NamedTensor changes done
2019-11-12 17:31:42 +00:00
ferben
b0f24ec302
Test works now
2019-11-12 15:14:13 +00:00
Michael Marshall
fb2834bf82
Oops
2019-11-12 14:01:20 +00:00
Michael Marshall
78f75b0e9f
Better than graffiti
2019-11-12 14:00:46 +00:00
Michael Marshall
62dd0bfe58
New parameter module compiles. Untested.
2019-11-12 13:59:53 +00:00
ferben
db952993fa
envCreate problem..
2019-11-12 12:23:34 +00:00
ferben
b8f0878981
removed most default behaviour
2019-11-11 17:49:38 +00:00
ferben
df586a142d
added DistilPar-module and cleaned up some code
2019-11-11 17:29:55 +00:00
ferben
7a446d5b7f
removed default filenames
2019-11-11 14:36:45 +00:00
ferben
e7d7ea4f8f
added LoadNoise module
2019-11-11 12:55:45 +00:00
Michael Marshall
f8e1941327
Implemented specialisations of NamedTensor as derived classes, however this suffers a number of problems:
...
1) virtual functions not available in base class constructor where I'd like to use them - e.g. IndexNames
2) Must define new constructors in derived classes
... so the specialisations are fatter than I'd like. Would prefer to revert to specifying tensor name and index name defaults in template
2019-11-08 11:55:00 +00:00
65aa54804e
added comments
2019-11-08 11:15:51 +00:00
ferben
293bfe17d1
added code to the noise module...
2019-11-07 14:00:40 +00:00
ferben
a8f3a111a5
added Serial RNG - code compiles but not tested!
2019-11-07 13:45:38 +00:00
ferben
5c23abe507
commented on Notation
2019-11-07 11:57:40 +00:00
Michael Marshall
22c654182a
Fixes for GPU compile
2019-11-04 17:24:34 +00:00
Michael Marshall
6f0439c0e4
Remove unnecessary cast
2019-11-04 15:50:14 +00:00
Michael Marshall
4f9a7c5d76
Back out unnecessary change
2019-11-02 16:50:29 +00:00
Michael Marshall
fcd90705bc
Beautification
2019-11-02 16:15:48 +00:00
Michael Marshall
4bcdb4ff95
Remove accidental check-in of local debugging
2019-11-02 15:24:12 +00:00
Michael Marshall
1c10933db1
Rationalisation of NamedTensor (Perambulator)
2019-11-02 14:58:32 +00:00
Michael Marshall
52d8d576d0
Removed SliceShare as a reusable routine
2019-11-01 20:10:51 +00:00
Michael Marshall
ada0a7a83b
C++11 case comparison of named tensor index names
2019-11-01 16:05:08 +00:00
Michael Marshall
efe2f2d48b
Merge branch 'develop' into feature/distil
...
* develop:
Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun
Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions Fixed Lattice::reset not compiling with new Grid explicit memory region handling Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-11-01 15:38:48 +00:00
Michael Marshall
45d4cf0971
Cleanup in progress
2019-11-01 15:35:07 +00:00
Peter Boyle
ac614cbc53
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-10-31 11:46:43 -04:00
Peter Boyle
ec8e060ec7
Summit jsrun GPU mapping updates. Conffigure with --enable-jsrun
2019-10-31 11:46:09 -04:00
Felix Erben
5c54f27ac1
some cleanup, but hard-coded src in LapEvec unclear
2019-10-31 11:51:05 +00:00
Felix Erben
4ed9379535
some cleanup
2019-10-31 11:45:50 +00:00
Michael Marshall
858e348a6d
Cleanup of messages
2019-10-31 11:11:52 +00:00
Michael Marshall
3b3680c64e
Reversed Felix's interim A2Autils.h changes ... these were finished and went into develop via a separate branch
2019-10-30 15:50:04 +00:00
Michael Marshall
2a926b3dc6
Merged latest changes from develop, in preparation for release.
2019-10-30 14:52:34 +00:00
Chris K
845a045493
Merge pull request #233 from giltirn/lanczos_fix
...
A few run /compile / memory leak fixes
2019-10-30 10:21:59 -04:00
Michael Marshall
eb8848a071
Merge branch 'develop' into feature/distil
...
* develop: (27 commits)
Update README.md
result layout standardised, iterator size more elegant
updated syntac in Test_hadrons_spectrum
chroma-regression test now prints difference correctly
baryon input strings are now pairs of pairs of gammas - still ugly!!
second update to pull request
Changing back interface for Gamma3pt
Removing old debug code
Changes to A2Autils
suggested changes for 1st pull request implemented
changed input parameters for easier use
Should compile everywhere now
changed baryon interface
added author information
ready for pull request
code compiling now - still need to test
Baryons module works in 1 of 3 cases - still need SlicedProp and Msource part!!
thread_for caused the problems - slow for loop for now
still bugfix
weird bug...
...
# Conflicts:
# Hadrons/Modules.hpp
# Hadrons/modules.inc
2019-10-30 14:13:00 +00:00
Peter Boyle
f31e3278a6
Update README.md
2019-10-25 11:43:55 -04:00
Michael Marshall
ca234325bc
Fix single-precision error
2019-10-23 21:49:32 +01:00
c97f780784
Merge pull request #243 from fionnoh/feature/A2A_current_insertion
...
Feature/a2 a current insertion
2019-10-22 13:55:53 +01:00
Michael Marshall
78bdb0ff6a
Grid
2019-10-20 14:22:45 +01:00
Michael Marshall
decab587a0
PerambFileName defaults to object name if empty
2019-10-20 14:14:06 +01:00
202f025fc7
Merge pull request #242 from mmphys/feature/baryons
...
Feature/baryons
2019-10-16 15:06:32 +01:00
Felix Erben
3c702b510b
result layout standardised, iterator size more elegant
2019-10-15 18:48:51 +01:00
Michael Marshall
519ce19128
Fixes to enable GPU build. NB: Contractor and ContractorBenchmark still not working
2019-10-14 22:40:13 +01:00
Felix Erben
8d166a81c0
updated syntac in Test_hadrons_spectrum
2019-10-14 13:41:08 +01:00
Felix Erben
aa62ca9046
chroma-regression test now prints difference correctly
2019-10-10 11:07:20 +01:00
Felix Erben
2dee4791db
baryon input strings are now pairs of pairs of gammas - still ugly!!
2019-10-09 17:56:09 +01:00
Felix Erben
548b3bf43c
second update to pull request
2019-10-09 14:52:33 +01:00
Fionn O hOgain
a55d0ba8fe
Changing back interface for Gamma3pt
2019-10-08 15:52:01 +01:00
Fionn O hOgain
5de9547db5
Removing old debug code
2019-10-08 15:51:28 +01:00
Fionn O hOgain
6a3b09cf02
Merge branch 'develop' of github.com:fionnoh/Grid into feature/A2A_current_insertion
2019-10-08 13:25:51 +01:00
Fionn O hOgain
10de4bfc23
Changes to A2Autils
2019-10-08 13:24:56 +01:00
Felix Erben
2ce7f2b4d8
suggested changes for 1st pull request implemented
2019-10-08 13:19:47 +01:00
Michael Marshall
88d6ff8f1d
Peter's bugfix in ImplicitlyRestartedLanczos.h
...
My bugfix in MomentumPhase.hpp
2019-10-07 17:36:11 +01:00
Michael Marshall
803329af99
Merge branch 'develop' into feature/distil
...
* develop:
Fix after GPU merge: Phase in Free Propagator
z2-momentum phase module
# Conflicts:
# Hadrons/Modules/MSource/MomentumPhase.hpp
2019-10-07 13:09:52 +01:00
Michael Marshall
9d96899aa8
Doc bugfix
2019-10-07 13:05:04 +01:00
Michael Marshall
86939dbf1a
Removed unnecessary function (for getting a parameter)
2019-10-04 13:59:59 +01:00
317645aaeb
undo (most) whitespace changes in the two files HMC/Mobius2p1fEOFA{,_F1}.cc
2019-10-02 16:25:23 +01:00
Felix Erben
e280ec6b0b
changed input parameters for easier use
2019-10-02 16:14:06 +01:00
d5a180d914
Merge branch 'fix/removeQCDremnants' into fix/remQCDns_ignore_ws
2019-10-02 16:11:27 +01:00
d2928761dd
Merge pull request #240 from guelpers/feature/bugfixafterGPUmerge
...
Fix after GPU merge: Phase in Free Propagator
2019-10-02 15:00:15 +01:00
f2a74c603f
Merge pull request #239 from mmphys/z2_momentum
...
z2-momentum phase module
2019-10-02 14:57:59 +01:00
5f22810f55
Fix after GPU merge: Phase in Free Propagator
2019-10-02 14:49:35 +01:00
Michael Marshall
92e25488f8
Added MomentumPhase Hadrons module from z2_momentum branch (thankyou, Felix) so I can run Z_2 wall with momenta easily
2019-10-02 14:13:35 +01:00
Michael Marshall
89ef2b7dc2
Should compile everywhere now
2019-10-02 13:20:07 +01:00
Michael Marshall
7606554b76
Remove references to unused modules (now part of separate Baryons branch)
2019-10-02 13:16:58 +01:00
Felix Erben
c8fc0b3e0c
changed baryon interface
2019-10-02 11:36:39 +01:00
Felix Erben
ccb5e8374b
z2-momentum phase module
2019-09-30 17:36:15 +01:00
Felix Erben
b88fd436e7
added author information
2019-09-30 17:07:46 +01:00
Felix Erben
155bcd4ff3
ready for pull request
2019-09-30 16:58:20 +01:00
Fionn O hOgain
d1daab601a
Merge branch 'develop' of github.com:fionnoh/Grid into feature/A2A_current_insertion
...
Peter's GPU branch changes merged with A2A CI code
2019-09-30 16:53:44 +01:00
Felix Erben
e5d7910fa7
code compiling now - still need to test
2019-09-30 13:55:26 +01:00
Felix Erben
94b9a9474c
Baryons module works in 1 of 3 cases - still need SlicedProp and Msource part!!
2019-09-27 15:08:56 +01:00
Felix Erben
bf62ec163d
thread_for caused the problems - slow for loop for now
2019-09-26 13:33:49 +01:00
Felix Erben
8415e23fc6
still bugfix
2019-09-26 11:09:09 +01:00
Felix Erben
76c93aa44e
weird bug...
2019-09-17 14:36:26 +01:00
Michael Marshall
3137628222
BaryonUtils.h is now part of Baryons
2019-09-17 13:19:20 +01:00
Michael Marshall
ce965ee6bb
Cleanup tests that are no longer required
2019-09-17 13:10:59 +01:00
Michael Marshall
911fbb0f36
Cleanup modules that are no longer required
2019-09-17 13:06:52 +01:00
Michael Marshall
eb293e9909
Restore Baryons modules per develop branch
2019-09-16 20:29:37 +01:00
Felix Erben
f548114ff6
bugfix
2019-09-16 17:55:58 +01:00
Felix Erben
dab8c01c3d
added Baryon code
2019-09-16 17:20:54 +01:00
Michael Marshall
2f3dd0703d
Ensure Distillation test (Test_distil) works
2019-09-16 17:00:46 +01:00
Michael Marshall
2e963d1a78
Fix location of Grid.h and remove reference to QCD namespace
2019-09-16 15:34:47 +01:00
Michael Marshall
bf52e7cc96
Latest BaryonUtils.h from Felix + my fixes
2019-09-13 18:11:10 +01:00
Michael Marshall
61d017d0a5
Merge GPU support (upstream/develop) into distillation branch.
...
This compiles and looks right ... but may need some testing
* develop: (762 commits)
Tensor ambiguous fix
Fix for GCC preprocessor/pragma handling bug
Trips up NVCC for reasons I dont understand on summit
Fix GCC complaint
Zero() change
Force a couple of things to compile on NVCC
Remove debug code
nvcc error suppress
Merge develop
Reduction finished and hopefully fixes CI regression fail on single precisoin and force
Double precision variants for summation accuracy
Update todo list
Freeze the seed
Fix compiling of MSource::Gauss for single precision
Think the reduction is now sorted and cleaned up
Fix force term
Printing improvement
GPU reduction fix and also exit backtrace option
GPU friendly
Simplify the comms benchmark
...
# Conflicts:
# Grid/communicator/SharedMemoryMPI.cc
# Grid/qcd/action/fermion/WilsonKernelsAsm.cc
# Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h
# Grid/qcd/smearing/StoutSmearing.h
# Hadrons/Modules.hpp
# Hadrons/Utilities/Contractor.cc
# Hadrons/modules.inc
# tests/forces/Test_dwf_force_eofa.cc
# tests/forces/Test_dwf_gpforce_eofa.cc
2019-09-13 13:30:00 +01:00
Michael Marshall
04a661cafe
Remove unused modules BC2 and Baryon2
2019-09-10 14:49:24 +01:00
gfilaci
a7fa86dc29
MooeeInv improvement for DW EOFA + comments
2019-09-05 12:05:21 +01:00
gfilaci
0c1efa5235
pass OpenMP flag to host compiler
2019-09-03 12:12:25 +01:00
gfilaci
fdd9b14e82
speed up MooeeInvDag for DWF EOFA
2019-09-02 14:49:51 +01:00
gfilaci
e66669d300
fast MooeeInv for EOFA
2019-09-02 14:26:13 +01:00
gfilaci
0efaf3c4fa
access M5D coeffs through pointers
2019-09-02 11:33:00 +01:00
gfilaci
3ef519aaa4
fast MooeeInv
2019-09-02 11:18:14 +01:00
Peter Boyle
b473405652
Tensor ambiguous fix
2019-08-29 09:36:41 -05:00
Christopher Kelly
114ebb7914
Fixed Lanczos calling aligned alloc in threaded region hitting up against pointer-cache no-threading restrictions
...
Fixed Lattice::reset not compiling with new Grid explicit memory region handling
Fixed memory leak in Lattice::resize that occurs when data region has been previously allocated
2019-08-26 16:47:44 -04:00
Peter Boyle
9b7a6d197f
Fix for GCC preprocessor/pragma handling bug
2019-08-23 14:37:46 +01:00
Peter Boyle
59cd7f3b70
Trips up NVCC for reasons I dont understand on summit
2019-08-23 06:03:49 -04:00
Peter Boyle
28d6be2a4e
Fix GCC complaint
2019-08-22 18:56:37 +01:00
6b6c5aa626
remove namespace QCD from directory tests
2019-08-20 15:35:36 +01:00
9210b0aa6e
remove namespace QCD from directory HMC
2019-08-20 15:21:23 +01:00
ad01290545
remove remnants of the namespace QCD
2019-08-19 20:30:33 +01:00
Fionn O hOgain
25150eb2e0
3pt contraction now takes a list of gammas
2019-08-15 12:09:30 +01:00
Peter Boyle
95f66cc93c
Merge branch 'feature/gpu-port' into develop
2019-08-15 02:19:31 +01:00
Peter Boyle
12eb2a6a34
Zero() change
2019-08-15 01:43:00 +01:00
Peter Boyle
7c8902b04f
Merge branch 'develop' into feature/gpu-port
2019-08-15 01:33:07 +01:00
Peter Boyle
4278caa030
Force a couple of things to compile on NVCC
2019-08-15 01:32:03 +01:00
Peter Boyle
be37dfb6f8
Remove debug code
2019-08-15 01:31:40 +01:00
Peter Boyle
5e8437029f
nvcc error suppress
2019-08-15 01:31:12 +01:00
Peter Boyle
e279b2be29
Merge develop
2019-08-14 23:01:59 +01:00
Peter Boyle
48e6efc7c9
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
Grid/qcd/action/fermion/WilsonKernelsAsm.cc
Grid/qcd/action/fermion/implementation/ImprovedStaggeredFermionImplementation.h
Grid/qcd/action/fermion/implementation/StaggeredKernelsAsm.h
benchmarks/Benchmark_comms.cc
2019-08-14 18:56:54 +01:00
55c095f620
Merge pull request #226 from nils-asmussen/fix/Gauss
...
Fix compiling of MSource::Gauss for single precision
2019-08-14 17:50:38 +01:00
Peter Boyle
3e49dc8a67
Reduction finished and hopefully fixes CI regression fail on single precisoin and force
2019-08-14 15:18:34 +01:00
Peter Boyle
96ac56cace
Double precision variants for summation accuracy
2019-08-14 13:08:01 +01:00
Peter Boyle
2b037e3daa
Update todo list
2019-08-14 13:07:26 +01:00
Peter Boyle
2d2de7aede
Freeze the seed
2019-08-14 13:07:11 +01:00
e3966aa49b
Fix compiling of MSource::Gauss for single precision
2019-08-12 14:57:11 +01:00
Peter Boyle
ce97638bac
Think the reduction is now sorted and cleaned up
2019-08-11 11:09:01 +01:00
Peter Boyle
53e3ab4131
Fix force term
2019-08-11 11:06:13 +01:00
c2c4252a07
Merge pull request #216 from nils-asmussen/feature/GaussianSmearing
...
feature/gaussian smearing
2019-08-08 12:29:55 +02:00
Fionn O hOgain
d566637cec
Merge branch 'develop' of github.com:fionnoh/Grid into feature/A2A_current_insertion
2019-08-07 12:11:40 +01:00
Felix Erben
51bed48cd2
added selfcontract module
2019-08-05 17:46:42 +01:00
Felix Erben
b875edceab
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
...
Conflicts:
Grid/qcd/utils/BaryonUtils.h
Hadrons/Modules/MContraction/Baryon2.hpp
2019-08-05 14:19:43 +01:00
Felix Erben
29df60c0cb
some debugging stuff
2019-08-05 14:10:04 +01:00
Michael Marshall
8d97e2a02a
Say which A2AMatrix is being loaded, and which contraction is being performed (m of n)
2019-08-02 19:23:18 +01:00
Michael Marshall
ed23f6be20
Remove blank line from log
2019-08-02 15:59:18 +01:00
Michael Marshall
cad76827b0
Be consistent about separator usage. Log start / stop / duration
2019-08-02 15:47:20 +01:00
Michael Marshall
310867d46a
Additional option to specify the separator used between terms in correlator
2019-08-02 11:25:29 +01:00
Michael Marshall
e598178d94
TODO: Felix, please fix. I commented this out because of compiler errors
2019-08-01 20:51:51 +01:00
Michael Marshall
723457d467
Contractor updates ready for test on Tesseract:
...
1) Move definitions of serialisable objects into header for re-use by external programs/utilities
2) Add "-s" switch for "Simple" correlators, i.e. only include A2AMatrix info for the actual fields included in each contraction
2019-08-01 20:35:55 +01:00
Michael Marshall
6f40021842
Fixed compiler errors: TODO: Felix, please validate
2019-08-01 19:57:59 +01:00
Peter Boyle
9cd33a7b9c
Printing improvement
2019-07-31 08:01:24 +01:00
Peter Boyle
639dc1ab21
GPU reduction fix and also exit backtrace option
2019-07-31 01:23:23 +01:00
Peter Boyle
9117f61109
GPU friendly
2019-07-31 01:22:54 +01:00
Peter Boyle
bca36d9bc3
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-07-30 22:51:23 +01:00
Peter Boyle
263dcbabab
Simplify the comms benchmark
2019-07-30 22:51:04 +01:00
Felix Erben
622d5eaa3e
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-07-30 13:47:22 +01:00
Felix Erben
e66d48c142
second way to compute baryons - qdp style
2019-07-30 13:46:59 +01:00
Peter Boyle
9dad7a0094
Reproducible reduction and axpy_norm offload from Gianluca.
...
Hopefully get CG running entirely on GPU
2019-07-30 00:14:12 +01:00
Peter Boyle
8c6016f717
Merge pull request #219 from mmphys/feature/include
...
Housekeeping. #include <Grid.h> ---> #include <Grid/Grid.h>
2019-07-29 23:08:01 +01:00
Peter Boyle
1282e1067f
Do the force term on the accelerator too. Needed particularly because comms buffers
...
are device memory.
2019-07-29 22:58:35 +01:00
Michael Marshall
f5ad4f3de8
Added the ability to write a version of the validated XML file excluding any of the module IDs supplied in a separate exclude file
2019-07-26 19:46:55 +01:00
Peter Boyle
275c1c920f
More info dump on error from CUDA
2019-07-26 12:18:53 +01:00
Peter Boyle
fe700a183a
Getting HMC to run
2019-07-26 12:18:29 +01:00
Peter Boyle
34108296cd
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
Grid/simd/Grid_avx512.h
2019-07-20 17:05:35 +01:00
Peter Boyle
76c704b84b
Intrinsics for CLANG are now fixed in v6
2019-07-20 16:52:24 +01:00
Peter Boyle
ce255ec359
Relocate to fix build failure for comms none
2019-07-20 16:37:03 +01:00
Peter Boyle
1c096626cb
Hypercube defaults to on if HPE detected, but override to off possible
2019-07-20 16:06:16 +01:00
Peter Boyle
ce8b247426
Compiles
2019-07-20 15:16:02 +01:00
Peter Boyle
80481f81be
Constructor typo
2019-07-20 09:58:24 +01:00
Peter Boyle
d85dcc72df
Multinode fix
2019-07-20 07:13:28 +01:00
Peter Boyle
3fedcd6d52
Compiles
2019-07-20 07:12:44 +01:00
Michael Marshall
e7050a7aed
Support gamma structure names that have trailing white space
2019-07-19 11:58:56 +01:00
Felix Erben
e138bc7204
debug output
2019-07-19 11:16:35 +01:00
Peter Boyle
25ba4c5f80
Merge branch 'develop' into feature/gpu-port
...
Conflicts:
HMC/Mobius2p1fEOFA.cc
tests/forces/Test_rect_force.cc
2019-07-19 11:01:55 +01:00
Peter Boyle
671bcbcccb
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-07-19 10:48:22 +01:00
Peter Boyle
ff325376cb
Fix single precision deriv test fail
2019-07-19 10:47:44 +01:00
Michael Marshall
6d4fb35d84
Ready for testing
2019-07-19 10:33:03 +01:00
Peter Boyle
9e926e3fc5
Build fix in develop
2019-07-19 10:01:52 +01:00
Peter Boyle
775eaee199
Fix for suspected Intel 2018.1 compiler bug under O3
2019-07-19 07:57:34 +01:00
Peter Boyle
0fd2827d5d
Fix fail in single
2019-07-19 05:28:26 +01:00
Peter Boyle
bdd79f9ef8
TODO update
2019-07-18 22:04:28 +01:00
Peter Boyle
0695f8cec2
Single precision compile fix. Soon deprecate single precision
2019-07-18 22:02:31 +01:00
Peter Boyle
9fa705c5a0
comma fix
2019-07-18 21:38:11 +01:00
Felix Erben
56cefadf9b
gamma matrices as input
2019-07-18 17:46:43 +01:00
ferben
9d82855c5d
bugfix in Baryonutils
2019-07-18 15:45:43 +01:00
ferben
97d61f2564
bugfix in Baryonutils
2019-07-18 14:57:10 +01:00
Peter Boyle
331f5a53dc
New header
2019-07-18 14:51:09 +01:00
Peter Boyle
a23dc295ac
Remove compiler errors and warnings
2019-07-18 14:47:02 +01:00
ferben
11a8668d19
bugfix in Baryonutils
2019-07-18 14:44:55 +01:00
ferben
cded7670d0
new utils for baryons
2019-07-18 14:29:04 +01:00
ferben
feb029fb66
new utils for baryons
2019-07-18 14:24:16 +01:00
Peter Boyle
08904f830e
Merge develop
2019-07-16 11:59:56 +01:00
Peter Boyle
fa9cd50c5b
Merge branch 'develop' into feature/gpu-port
2019-07-16 11:55:17 +01:00
Felix Erben
5a62ebe7b1
general baryons case added
2019-07-15 15:26:30 +01:00
Peter Boyle
7c11525d1a
Local stencil for complex wilson loops etc
2019-07-14 14:05:09 +01:00
Peter Boyle
42c1dbb1d1
General local stencil first cut for Patrick force term
2019-07-14 14:04:28 +01:00
Peter Boyle
6179acfda0
Put back a call that was required
2019-07-14 13:59:54 +01:00
Michael Marshall
fa747173d1
Debugging references were to l-values, so added const to stop errors
2019-07-14 11:08:00 +01:00
Peter Boyle
07601ac1f5
Replace instantiation of Gparity
2019-07-12 17:18:12 +01:00
Peter Boyle
705a8098b2
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
...
Conflicts:
Grid/stencil/Stencil.h
2019-07-12 17:14:11 +01:00
Peter Boyle
a29b43d755
Stencil comms cleaner
2019-07-12 17:12:25 +01:00
Peter Boyle
368c8369ce
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
2019-07-12 17:11:29 +01:00
Peter Boyle
c0d89a2dbb
TODO updates
2019-07-12 17:11:15 +01:00
Peter Boyle
78ebd93281
Cuda 9.1 happy
2019-07-12 17:11:00 +01:00
Peter Boyle
3d58daf70f
Safety check
2019-07-12 17:10:35 +01:00
Peter Boyle
bd155ca5c0
Overlap comms with comput now supported
2019-07-12 09:09:40 +01:00
Peter Boyle
91e2cf9b40
All axes can be used for comms now
2019-07-12 09:08:26 +01:00
Peter Boyle
3cc9947731
Better welcome printing
2019-07-12 06:47:51 +01:00
Peter Boyle
f15eeb0283
localise scope of variables declared in macro
2019-07-12 06:47:01 +01:00
Peter Boyle
0996ba9396
Pretty messaging
2019-07-12 06:45:31 +01:00
Michael Marshall
12afb0395f
Debugging transposeSpin - seems just not to be implemented for Lattice<x>
2019-07-11 17:42:26 +01:00
Felix Erben
ec4aa978ab
why cant I spinTranspose
2019-07-11 14:01:41 +01:00
Peter Boyle
966a203dcb
Interactions with GPU compilation
2019-07-11 03:16:17 +01:00
Peter Boyle
44170cc15f
Initialise CUDA device prior to entering MPI.
...
This may or may not interact with Summit which configures MPI - CUDA mapping with jsrun.
TBD
Cases of OpenMPI and MVAPICH are covered, and default to cudaSetDevice(0) otherwise
2019-07-11 03:14:23 +01:00
Michael Marshall
7bc4a06f3f
This is probably what you want ...
2019-07-10 12:29:33 +01:00
Michael Marshall
cd659525e1
You probably want to add this to the build. And you may need to do a bootstrap
2019-07-10 12:08:37 +01:00
Felix Erben
dc2240d2d8
why does sliceSum in Nucleon.hpp not work
2019-07-10 11:34:16 +01:00
Felix Erben
98cf20cf06
continued work on baryons
2019-07-09 17:42:36 +01:00
Felix Erben
cc3346073e
continued work on baryons
2019-07-09 17:30:32 +01:00
Felix Erben
3848da7c50
added nucleon module (non-distillation)
2019-07-08 17:43:14 +01:00
c3d0c176ab
cleaning up Kl2 contraction
2019-07-05 16:29:46 +01:00
0a71f8bb10
Merge pull request #222 from guelpers/feature/kl2QEDseq
...
EMLepton: Multiple source-sink separations at once
2019-07-05 16:22:34 +01:00
Felix Erben
b7d0cf6751
buxfix in diquark sum / baryons
2019-07-04 22:06:37 +01:00
3a31ba2ea2
Merge remote-tracking branch 'upstream/develop' into feature/kl2QEDseq
2019-07-03 14:37:56 +01:00
eac6337466
Hadrons: EMLepton: multiple source-sink separations at once
2019-07-03 14:36:34 +01:00
ab7537e002
Merge pull request #221 from fionnoh/bugfix/A2ALoop
...
Bugfix for A2ALoop module
2019-07-03 14:13:51 +01:00
Felix Erben
2c1a077369
continued on baryons
2019-07-02 17:55:28 +01:00
Peter Boyle
6e3c3214a3
Offload loops
2019-07-02 17:25:40 +01:00
Peter Boyle
d6ffadb33b
Coalesced write
2019-07-02 17:25:13 +01:00
Michael Marshall
ae3abbe53d
Added the ability for Perambulator module to save unsmeared sinks through the addition of two optional parameters:
...
UnsmearedSinkFileName: If present, specifies the filename to write to
UnsmearedSinkMultiFile: defaults to true to write each sink vector to a different file, but can be set to 0 for a single file
2019-07-01 17:28:27 +01:00
Felix Erben
5fc0188205
started saving sinks
2019-07-01 14:51:59 +01:00
Peter Boyle
4c3225412b
Drop 5dVEC
2019-07-01 07:31:26 +01:00
Peter Boyle
b8f7bfbb26
Dont stream as poor perf in some cases
2019-07-01 07:30:25 +01:00
Peter Boyle
7b7c470917
Accelerator loop
2019-07-01 07:29:51 +01:00
Peter Boyle
532e226b22
cuda 9.1 fixes
2019-07-01 07:29:22 +01:00
Peter Boyle
6a13731818
Move GPU cuda call earlier
2019-07-01 07:28:41 +01:00
fionnoh
67690df3bd
Changes nedded to have a current insertion on every second time slice - avoids unnecessary contractions
2019-06-28 15:18:28 +08:00
fionnoh
1059189abf
Bugfix for A2ALoop module
2019-06-27 13:49:55 +08:00
fionnoh
ce29b18dc9
New modules for loading in MFs as diskvectors and producing propagaotrs from 4 quark contractions
2019-06-27 13:46:06 +08:00
fionnoh
421a0a8a36
Changes to A2Autils, A2AMatirx and DiskVector code that is needed for Hadrons 4 quark contraction module
2019-06-27 13:45:20 +08:00
fionnoh
ac530636ca
A2Aloop bugfix
2019-06-27 13:44:47 +08:00
Michael Marshall
2d940a598c
Inserted four extra parameters just to make this test compile. Needs to be fixed properly
2019-06-19 10:37:50 +01:00
Michael Marshall
c28c5fc61b
Inserted four extra parameters just to make this test compile. Needs to be fixed properly
2019-06-19 10:31:41 +01:00
Michael Marshall
015340d60c
Elided superfluous copy on write
2019-06-19 09:37:03 +01:00
Peter Boyle
1cd4ee0706
Thrust used on GPU builds
2019-06-18 12:50:35 +01:00
Peter Boyle
b8f71b6777
Fix NVCC warning unused variable
2019-06-17 13:58:45 +01:00
Peter Boyle
703dc20377
Compile tests fix
2019-06-16 13:59:29 +01:00
Peter Boyle
d976e5c514
Pow is being awkward in thrust for reasons I don't understand. Possible thrust bug.
2019-06-16 12:05:11 +01:00
Peter Boyle
d7b3efe893
Compile fix
2019-06-15 17:03:15 +01:00
Peter Boyle
f710d7bd45
TODO list update
2019-06-15 12:54:27 +01:00
Peter Boyle
cb336aa8f8
Thread loop constructs changing a little
2019-06-15 12:54:11 +01:00
Peter Boyle
462900b48d
Modified entire test directory to suit new GPU constructs for looping
2019-06-15 12:53:27 +01:00
Peter Boyle
0561c2edeb
Benchmarks modified for new GPU constructs
2019-06-15 12:52:56 +01:00
Peter Boyle
0184719216
Change to predicate type
2019-06-15 12:52:26 +01:00
Peter Boyle
24202dbc51
Thread loop construct change
2019-06-15 12:52:07 +01:00
Peter Boyle
d763c303c5
Clean acceleerator barrier
2019-06-15 12:51:45 +01:00
Peter Boyle
8e394d3bf9
New loop construct
2019-06-15 12:51:15 +01:00
Peter Boyle
b881d5489b
Move SchurDiagTwoKappa to Algorithms
2019-06-15 12:50:45 +01:00
Peter Boyle
82306913a8
Move Schur operator into correct place
2019-06-15 12:49:22 +01:00
Peter Boyle
49f90cc7eb
use pragma once
2019-06-15 12:45:22 +01:00
Peter Boyle
b77af0210b
Thread loop. Probably deprecate this impl
2019-06-15 12:44:56 +01:00
Peter Boyle
5254ede2d8
New loops. Revisit as accelerator loop in future audit
2019-06-15 12:44:29 +01:00
Peter Boyle
16e5d7945e
Hard to make 5D vec work with GPU code
2019-06-15 12:43:43 +01:00
Peter Boyle
decc99ca76
Accelerator version
2019-06-15 12:43:00 +01:00
Peter Boyle
464cd65931
Still to test this fully
2019-06-15 12:35:14 +01:00
Peter Boyle
a1ec2f4723
Still to test this routine fully
2019-06-15 12:33:55 +01:00
Peter Boyle
ea9662ec85
Thread loop changes
2019-06-15 09:09:57 +01:00
Peter Boyle
52c74f1cac
Thread loop changes
2019-06-15 09:08:16 +01:00
Peter Boyle
9a13d2992c
lean up
2019-06-15 09:05:16 +01:00
Peter Boyle
b0449ae270
Thread loop changes
2019-06-15 09:04:19 +01:00
Peter Boyle
1299225105
Accelerator loop changes
2019-06-15 09:03:46 +01:00
Peter Boyle
5925e7f405
Thread for changes
2019-06-15 09:01:30 +01:00
Peter Boyle
be1fd4930f
Template instantiation make happy changes
2019-06-15 08:37:34 +01:00
Peter Boyle
377fa5dec1
looping construct
2019-06-15 08:36:48 +01:00
Peter Boyle
e8b78f596e
Looping construct changes
2019-06-15 08:35:57 +01:00
Peter Boyle
09720c40cd
Coalesced loops
2019-06-15 08:35:26 +01:00
Peter Boyle
bb024dd114
Loop construct changed
2019-06-15 08:30:05 +01:00
Peter Boyle
52456b9ec7
New loop construct
2019-06-15 08:28:45 +01:00
Peter Boyle
b285138be4
Better checking on types
2019-06-15 08:27:48 +01:00
Peter Boyle
c7dbf4c87e
Scalar support for GPU threads
2019-06-15 08:25:43 +01:00
Peter Boyle
1e889c93b8
Insert a GPU synchronise
2019-06-15 08:23:26 +01:00
Peter Boyle
7379047482
Threading and acceleration primitives further changes. accelerator_barrier() needed and used
2019-06-15 08:22:48 +01:00
Peter Boyle
d836ce3b78
Clean up of acceleration and threading primitives
2019-06-15 08:14:21 +01:00
Peter Boyle
cefaacbc07
Changing accelerator loop. Still have work to do for multi-GPU code
2019-06-15 08:10:24 +01:00
Peter Boyle
0074ef7f69
thread loops
2019-06-15 08:04:29 +01:00
Peter Boyle
20359ca15f
Coalesced loops.
2019-06-15 08:03:57 +01:00
Peter Boyle
736358b0cb
Coalesced loops
2019-06-15 08:03:13 +01:00
Peter Boyle
6b692aa726
Thread loops
2019-06-15 08:02:26 +01:00
Peter Boyle
7f99e1cd3b
Coalesced loops
2019-06-15 08:01:39 +01:00
Peter Boyle
f3c89df948
Thread loop changes
2019-06-15 08:00:37 +01:00
Peter Boyle
b7e6d111d7
Thread loop changes. Need to offload this file
2019-06-15 07:59:10 +01:00
Peter Boyle
f39cf69c33
Accelerator loop change
2019-06-15 07:58:23 +01:00
Peter Boyle
8e27338df2
Rationalise number of loop macros
2019-06-15 07:57:40 +01:00
Peter Boyle
bcbb5e9d26
Remove assembly tests
2019-06-15 07:57:05 +01:00
Peter Boyle
0ea7f5279d
Accelerator loop changes
2019-06-15 07:56:14 +01:00
Peter Boyle
18e5de426d
There is a stray use of predicatedWhere introduced by Andrew Lawson in the conserve currents.
...
The conserved currents need rewritten using data parallel operations.
2019-06-15 07:53:58 +01:00
Peter Boyle
e896d81235
Accelerator loop redefine. Coalesce most accesses, but ET engine still to go clean.
2019-06-15 07:52:44 +01:00
Peter Boyle
7b8ccff4f4
Accelerated coalesced loops in most cases
2019-06-15 07:48:00 +01:00
Peter Boyle
68541606ab
Thread loop changes. Soon try these with accelerator loops and benchmark
2019-06-15 07:46:42 +01:00
Peter Boyle
339ea10cc7
First touch only on CPU code
2019-06-15 07:45:43 +01:00
Peter Boyle
d0d8dc8042
Thread loop changes
2019-06-15 07:45:09 +01:00
Peter Boyle
81eb1fd9f2
Accelerator loop changes for coalesced access
2019-06-15 07:44:47 +01:00
Peter Boyle
cb93d32cd9
Thread loop changes
2019-06-15 07:44:08 +01:00
Peter Boyle
8f223962ff
Thread loop changed
2019-06-15 07:43:42 +01:00
Michael Marshall
9a8a63467e
BC2 now runs. setup() runs twice, which had resulted in doubling up of momenta. Also fixed initialisation of momentum phases.
2019-06-12 15:25:59 +01:00
Peter Boyle
36f06555a2
Simplify Impl
2019-06-09 22:26:27 +01:00
Peter Boyle
d6c0e0756d
Remove GPU version
2019-06-09 11:23:42 +01:00
Peter Boyle
3e41b1055c
Remove Gpu only kernels.
2019-06-09 11:20:01 +01:00
Peter Boyle
9fbcfe612c
Update TODO list
2019-06-09 11:19:38 +01:00
Peter Boyle
e78a5e7838
ASM instantiation without link errors
2019-06-09 01:25:21 +01:00
Peter Boyle
da8d87e9da
Cuda switch off
2019-06-08 17:11:38 +01:00
Peter Boyle
8e3a05d89b
Moving the instantiation into a cleaner structure
2019-06-08 13:48:33 +01:00
Peter Boyle
8adc5da7dd
Testig out approaches to kernel writing introducing SIMT_loop temporarily
2019-06-08 13:47:04 +01:00
Peter Boyle
29a244e423
Test of using a lane variable instead of repeated reference to threadIdx.y
2019-06-08 13:46:26 +01:00
Peter Boyle
18cbfecf02
Use symlinks in find command
2019-06-08 13:45:46 +01:00
Peter Boyle
c933ac2248
Temporarily introduce a SIMT_loop to test out approaches prior to making a global change to
...
accelerator_loop
2019-06-08 13:44:27 +01:00
Peter Boyle
ad2c433574
Instantiations move. Tried using Gianluca's suggestion about avoiding threadIdx but doesn't
...
seem to make a difference. Will revisit this and probably remove the lane parameter from the coalescedRead
2019-06-08 13:43:12 +01:00
Peter Boyle
86e7fb6e86
Instantiation relocation
2019-06-08 13:42:46 +01:00
Peter Boyle
fb91dda7be
Hand instantiation moved location
2019-06-08 13:42:26 +01:00
Peter Boyle
82cf7bc5ab
Move instantiation into fermion/instantiation
2019-06-08 13:41:46 +01:00
Peter Boyle
e452cc0a22
Move static variables into instantiation .cc file
2019-06-08 13:41:20 +01:00
Peter Boyle
4d2b938166
Remove explict instantiation from here
2019-06-08 13:41:01 +01:00
Peter Boyle
10d16ab76c
Remove explict instantiation from here
2019-06-08 13:40:32 +01:00
Peter Boyle
1f997fa484
Instantiate via explict .cc files for parallel make.
2019-06-08 13:39:51 +01:00
Peter Boyle
dc5024e88c
The GPU reduction was not working for me and causing errors. Need to revisit.
...
Gianluca is working on deterministic reduction/
2019-06-08 13:39:11 +01:00
Peter Boyle
6d77941990
Drop the 5D vec actions
2019-06-08 13:38:05 +01:00
Peter Boyle
0ee6e77cbc
Compiles GPU and CPU, still gives good performance on CPU
2019-06-05 13:28:16 +01:00
Peter Boyle
18d3cde29a
Compile on GPU workd
2019-06-05 00:14:58 +01:00
Peter Boyle
7323099966
Instatiation fix
2019-06-05 00:14:38 +01:00
Peter Boyle
6379651cdd
Generic or GPU ready for benchmark test on GPU
2019-06-05 00:13:52 +01:00
Peter Boyle
ba4fd756b9
Fix signature, but deprecating this loops style
2019-06-05 00:12:36 +01:00
Peter Boyle
d185fc1ebf
clean up instantiation
2019-06-05 00:11:52 +01:00
Peter Boyle
96b36d8367
Instantiation clean up
2019-06-05 00:11:27 +01:00
Peter Boyle
899f8b5065
Instantiation clean up 5d vec removal
2019-06-05 00:11:05 +01:00
Peter Boyle
c8d0483fe9
Remove 5d vectorisation
2019-06-05 00:10:37 +01:00
Peter Boyle
0f214e5f76
Clean up instantiation
2019-06-05 00:10:13 +01:00
Peter Boyle
8eea568426
GPU loop ; presently differentiated with ifdef, find a way to unify.
2019-06-05 00:09:28 +01:00
Peter Boyle
9636324069
GPU happy code
2019-06-05 00:08:54 +01:00
Peter Boyle
8a5489d9e6
Move the loop into a central kernel call.
2019-06-05 00:08:13 +01:00
Peter Boyle
8113845f9c
coalesce loop. Need to rationalise this file
2019-06-04 23:49:29 +01:00
Peter Boyle
b47f73c222
GPU happy
2019-06-04 21:30:39 +01:00
Peter Boyle
5720ced0fd
Simplifying
2019-06-04 21:30:08 +01:00
Peter Boyle
2c87b56b53
Making GPU happier
2019-06-04 21:29:44 +01:00
Peter Boyle
dbad48d802
Remove Ls vectorised DWF
2019-06-04 21:27:40 +01:00
Peter Boyle
4557a1365a
Remove Ls vectorised DWF
2019-06-04 20:59:59 +01:00
Peter Boyle
16e9b87d98
Remove Ls vectorised DWF as unused and hard to maintain
2019-06-04 20:59:01 +01:00
Peter Boyle
685eea3d0f
Small cosmetic
2019-06-04 20:58:14 +01:00
Peter Boyle
65b48831fb
Simplify code
2019-06-04 20:56:30 +01:00
Peter Boyle
57396fc595
Simplify code
2019-06-04 20:56:23 +01:00
Peter Boyle
a2e199df50
Simplifying Cayley cases.
2019-06-04 20:54:52 +01:00
Peter Boyle
020346c848
WOrk list. Will have to clean up Fermion sector.
2019-06-04 20:54:00 +01:00
Peter Boyle
c2625a127e
Non blocking loop. Want to change the naming here.
2019-06-04 20:52:59 +01:00
Peter Boyle
8794d35c78
GPU
2019-06-04 20:52:27 +01:00
Peter Boyle
24bff6dbe6
Minor improvements
2019-06-04 20:51:48 +01:00
Peter Boyle
45b15d10d3
GPU happy changes
2019-06-04 20:49:16 +01:00
Peter Boyle
33d6bbe32b
GPU must use accelerator vectors
2019-06-04 20:48:52 +01:00
Peter Boyle
7a1569bd46
Annoying, cannot rely on equivalence of Grid ComplexD adn Eigen Complex type on GPU.
...
Solve with ComplexD typecasts but must be a better way
2019-06-04 20:47:49 +01:00
Peter Boyle
6e2e904a0e
NVCC compiles happy. Start to develop strategy for writing generic
...
code for GPU kernels and CPU kernels.
2019-06-04 20:46:35 +01:00
Peter Boyle
d92a17f359
Suppress NVCC warnings in pugixml with pragma
2019-06-04 20:45:53 +01:00
Peter Boyle
47c063f984
Remove Ls Vec cases from benchmarks
2019-06-04 20:45:35 +01:00
Peter Boyle
7e27a5213a
Tests builds clean.
2019-06-04 20:45:20 +01:00
Michael Marshall
fe72dc099b
Upgrade to Mojave forced me to reinstall MacPorts. These are the ports I installed to get Grid working
2019-06-04 16:12:24 +01:00
Peter Boyle
ade4a126da
Getting closer on the GPU port, but will start deleting 5th dim vectorised variants
...
for code maintainability
2019-06-04 11:53:44 +01:00
Peter Boyle
7b59ab5bd7
Compiling after reorganisation
2019-06-03 15:46:26 +01:00
Peter Boyle
fcd8cfe257
Gparity in
2019-06-03 15:45:09 +01:00
Peter Boyle
b4b53812cb
Move implementation to specific implementation headers
2019-06-03 15:43:01 +01:00
Peter Boyle
085cac583f
Implementation in header
2019-06-03 15:42:36 +01:00
Peter Boyle
25e3b8640c
Move to header
2019-06-03 15:42:05 +01:00
Michael Marshall
c81d3d422d
Housekeeping. #include <Grid.h> ---> #include <Grid/Grid.h>
2019-06-03 15:25:05 +01:00
Michael Marshall
54edb9906e
Housekeeping. #include <Grid.h> ---> #include <Grid/Grid.h>
2019-06-03 15:20:46 +01:00
Peter Boyle
44bbec50b0
Making GPU compile happy
2019-06-03 14:57:04 +01:00
Peter Boyle
ec68b67d5d
Attempt at unified GPU and CPU kernel
2019-06-03 14:55:51 +01:00
Peter Boyle
778450e0c8
Move to implementation subdir
2019-06-03 14:53:56 +01:00
Peter Boyle
567aa5f366
Move to implementation subdir
2019-06-03 14:53:33 +01:00
Peter Boyle
2ab7e2b175
Force instantiation in .cc files.
...
Eventually move into multiple files
2019-06-03 14:52:59 +01:00
Peter Boyle
6f61be044d
Dont instantiate in header
2019-06-03 14:52:01 +01:00
Peter Boyle
269e00509e
Don't instantiate in header
2019-06-03 14:51:24 +01:00
Peter Boyle
a5e90b0ddc
Making the kernels more GPU happy
2019-06-03 14:50:54 +01:00
Peter Boyle
5622faf226
pragma once ifdef guard
2019-06-03 14:50:26 +01:00
Peter Boyle
82ecd520c7
Macos happy fix under nvcc
2019-06-03 14:48:50 +01:00
620965781e
MSource::Convolution remove test code
2019-06-02 13:44:19 +01:00
9c18638b24
MSource::Convolution let mom argument be Nd dimensional
2019-06-02 13:41:39 +01:00
4bfe678218
MSource::Gauss Integer is unsigned...
2019-06-02 12:36:57 +01:00
fc6e584f2c
MSource::Gauss fix sign in exponent of normalization + use correct types
2019-06-02 11:52:05 +01:00
7c3f400fc5
MSource::Gauss add parameters tA and tB
2019-06-02 00:12:15 +01:00
4bca2c17ce
MSource::Convolution rename parameters
2019-06-02 00:04:07 +01:00
8d540a4e85
MSource::Gauss add mom parameter + avoid Cshifts
2019-06-01 23:56:14 +01:00
Michael Marshall
9ff459816f
ReadBinary needs to do case insensitive name comparison (since I changed the default case of perambulator column names)
2019-06-01 13:50:27 +01:00
Michael Marshall
eb737daeb5
Merge branch 'develop' into feature/distil
...
* develop: (34 commits)
Hadrons: EMLepton: Wall source
Revert "cleaning up Kl2 contraction"
cleaning up Kl2 contraction
posibility to save/load schedules directly from the application parameters
moving VERSION file to the empty ChangeLog one, this create compilation problems with #include <version> in recent versions of LLVM and case-insensitive FS (typically macOS)
Added precision tuning to Hadrons parameterfile writing
Kl2 QED cleanup
Added ZFIMPL to SeqGamma
Added ZFIMPL to SeqConserved module
F1 ensemble running with 96%~ acceptance etc..
Make detection of HPE 8600 automatic
Added variables that were missing from wall source setup
Exposed a coulomb/landau enum to the gauge fixing module
Coulomb gauge added as an option
More logging, timing, and 4d/5d logic for eigpack gauge transforms
Added gauge transform option to eigpack IO
Hadrons: Lepton Propagator for kl2, sign swap for antiperiodic boundary
A2A Lepton-Meson Field contraction
Verbose
Iteratoin range fix
...
2019-05-31 18:20:43 +01:00
b120ef1fe4
Merge pull request #217 from guelpers/feature/EMlepwall
...
Hadrons: EMLepton: Wall source
2019-05-30 11:13:27 +02:00
166feb6483
Hadrons: EMLepton: Wall source
2019-05-30 10:07:08 +01:00
f569813b60
remove commented code
2019-05-29 17:07:07 +01:00
0190ada714
Merge branch 'develop' into feature/GaussianSmearing
2019-05-29 17:01:17 +01:00
de1a1dccb3
MSource::Gauss and MSource::Convolution: change LatticeComplex to ComplexField
2019-05-29 16:25:45 +01:00
0b3f40ce16
MSource::Convolution fix sign in Momentum
2019-05-29 16:06:10 +01:00
e35e8da111
Revert "cleaning up Kl2 contraction"
...
This reverts commit f244fed6ab .
2019-05-29 11:23:17 +02:00
6fdf93d695
move momentum phase from MSource::Gauss to MSource::Convolution
2019-05-28 17:26:55 +01:00
Peter Boyle
ffde81f22a
Nsimd() and coalesced support
2019-05-25 12:44:07 +01:00
Peter Boyle
d8098f1ecd
coalesced support
2019-05-25 12:43:31 +01:00
Peter Boyle
aca788cf4f
Move coalesced read into tensors
2019-05-25 12:43:00 +01:00
6064f96fde
MSource::Gauss remove superfluous comment
2019-05-24 20:18:37 +01:00
4e52e46a2c
MSource::Gauss fix missing factor
2019-05-24 20:16:09 +01:00
6b27369ade
MSource::Convolution use type PropagatorField
2019-05-24 16:07:08 +01:00
ab2e5f88cd
add fields as input (for scheduler)
2019-05-24 15:57:30 +01:00
f244fed6ab
cleaning up Kl2 contraction
2019-05-24 13:08:35 +01:00
9b3701ae27
posibility to save/load schedules directly from the application parameters
2019-05-24 13:08:20 +01:00
4ac27340b9
moving VERSION file to the empty ChangeLog one, this create compilation problems with #include <version> in recent versions of LLVM and case-insensitive FS (typically macOS)
2019-05-24 13:05:17 +01:00
c7c0a1065f
Merge pull request #214 from guelpers/feature/kl2QEDseq
...
Kl2 contraction with sequential propagators
2019-05-23 20:31:41 +01:00
80947130f9
Merge pull request #215 from fionnoh/develop
...
Added precision tuning to Hadrons parameterfile writing
2019-05-23 18:44:58 +01:00
fionnoh
0aee73ea6b
Added precision tuning to Hadrons parameterfile writing
2019-05-23 18:43:25 +01:00
e43d59045e
add option mom to MSource::Gauss
2019-05-23 17:33:32 +01:00
e553678599
add modules MSource::Gauss and MSource::Convolution
2019-05-23 16:38:13 +01:00
0290ee1f6d
Merge pull request #213 from fionnoh/develop
...
Added ZFIMPL to SeqConserved module
2019-05-23 13:46:02 +01:00
9a34edcf9f
Kl2 QED cleanup
2019-05-23 13:43:22 +01:00
fionnoh
246f10001e
Added ZFIMPL to SeqGamma
2019-05-23 12:42:40 +01:00
e675c6a48c
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
2019-05-23 12:41:54 +01:00
fionnoh
a66d110b88
Added ZFIMPL to SeqConserved module
2019-05-23 11:49:54 +01:00
Peter Boyle
918e673078
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-05-22 09:57:02 +01:00
Peter Boyle
44b53c3ba2
F1 ensemble running with 96%~ acceptance etc..
2019-05-22 09:56:26 +01:00
Peter Boyle
2095c12eac
Make detection of HPE 8600 automatic
2019-05-22 09:54:21 +01:00
Peter Boyle
a0e9f3b0a0
Plan for GPU port
2019-05-20 09:46:19 +01:00
ae5ad986e2
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
2019-05-19 14:35:46 +01:00
Peter Boyle
a9342c6ae5
Udpdate TODO afer gianluc marge
2019-05-18 22:58:25 +01:00
Peter Boyle
ee6f96d85c
Merge pull request #210 from grid-test-organisation/feature/gpu-port-develop
...
Cayley fermion functions for GPUs
2019-05-18 19:06:20 +01:00
Peter Boyle
77ca45ff49
Merge pull request #211 from fionnoh/develop
...
Enum for gaugefix and bug fix for wall source
2019-05-18 18:57:52 +01:00
Peter Boyle
4e9df9e93c
GPU patches
2019-05-18 17:43:11 +01:00
Peter Boyle
9fe68857a9
Runs multiGPU with coalesced access on tesseract
2019-05-18 17:42:41 +01:00
Peter Boyle
37336c9e0c
Allow compress to be either vector or scalar types
2019-05-18 17:41:13 +01:00
Peter Boyle
6c4da3bbc7
Stencil now runs with coalesced accesses
2019-05-18 17:40:35 +01:00
Peter Boyle
a584b16c4a
Adding a non-blocking kernel launch
2019-05-18 17:39:54 +01:00
fionnoh
dbd7f3f0fc
Added variables that were missing from wall source setup
2019-05-17 19:10:09 +01:00
fionnoh
d14512ee03
Exposed a coulomb/landau enum to the gauge fixing module
2019-05-17 19:01:52 +01:00
Peter Boyle
48b1c806ed
Coulomb gauge added as an option
2019-05-17 17:36:32 +01:00
Felix Erben
8ce7ebdca3
fixed contraction issue
2019-05-17 10:52:55 +01:00
Felix Erben
435653490e
fixed contraction issue
2019-05-17 10:50:15 +01:00
Michael Marshall
10a052d695
3 issues preventing compilation under clang. Marked these with FELIX_ISSUE and made minimal change to make compile (as fix not obvious)
2019-05-17 09:59:01 +01:00
Felix Erben
acd5a01b65
some work on baryons
2019-05-16 15:11:50 +01:00
0a8b6724ef
Merge pull request #209 from fionnoh/develop
...
Added gauge transform option to eigpack IO
2019-05-15 18:09:44 +02:00
fionnoh
ce102ac550
More logging, timing, and 4d/5d logic for eigpack gauge transforms
2019-05-15 14:31:25 +01:00
fionnoh
94accec311
Added gauge transform option to eigpack IO
2019-05-15 13:35:47 +01:00
gfilaci
1a82533d22
fix inner product with thrust reduction
2019-05-14 15:35:54 +01:00
Michael Marshall
ec7d96ce3b
Merge branch 'develop' into feature/distil
...
* develop:
Hadron WeakEye and A2ALoop bug fixes, and WWVVContraction bug fix
DiskVector: fix of memory bug triggering segfault when the cache is accessed following a certain pattern
MFermion::GaugeProp fix for 4d fields
2019-05-14 13:10:40 +01:00
gfilaci
e3c56fd9b3
CayleyZeroCounters before benchmark loop
2019-05-13 15:52:00 +01:00
gfilaci
955cc7790f
MooeeInvDag offloaded to GPU
2019-05-13 14:25:29 +01:00
gfilaci
1179123ac2
MooeeInv offloaded to GPU
2019-05-13 12:37:12 +01:00
d8512b03f8
Merge pull request #195 from nils-asmussen/fix_GaugeProp_4d
...
MFermion::GaugeProp fix for 4d fields
2019-05-12 21:31:18 +02:00
d90cf9d022
Merge pull request #207 from fionnoh/develop
...
Weak Hamiltonian and contraction bug fixes
2019-05-12 21:30:20 +02:00
79e930ba12
Hadrons: Lepton Propagator for kl2, sign swap for antiperiodic boundary
2019-05-10 12:46:18 +01:00
gfilaci
22e35c9ddd
M5Ddag offloaded to GPU
2019-05-10 12:23:39 +01:00
gfilaci
698b45e163
remove unused typedef
2019-05-09 11:19:39 +01:00
gfilaci
f1744b3f01
M5D offloaded to GPU
2019-05-09 11:17:55 +01:00
gfilaci
2b3c22f03d
bandwidth dependent on grid default precision
2019-05-08 12:01:11 +01:00
gfilaci
8423a05940
duplicate CayleyFermion5D for gpu
2019-05-08 11:51:37 +01:00
fionnoh
2acd8ece65
Hadron WeakEye and A2ALoop bug fixes, and WWVVContraction bug fix
2019-05-08 10:57:36 +01:00
fionnoh
b638509c61
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-05-08 10:51:04 +01:00
Michael Marshall
c16916cc45
Multiple local slice fixes
2019-05-06 10:35:42 +01:00
Michael Marshall
a865caf0d2
Forgot a const in IndexName only version of NamedTensor constructor
2019-05-03 22:17:25 +01:00
Michael Marshall
9ae4d369f3
Use the definition of the Perambulator Index names given in Hadrons::MDistil
2019-05-03 22:00:50 +01:00
edeb590818
DiskVector: fix of memory bug triggering segfault when the cache is accessed following a certain pattern
2019-05-03 17:09:47 +01:00
Michael Marshall
ec24a1f828
Fixed 2 bugs in LapEvec: 1) InsertLocalSlice 2) ensure convergence assertion stops entire machine
2019-05-03 16:03:56 +01:00
Michael Marshall
0efe63f6fa
3D smearing fix
2019-05-02 19:37:59 +01:00
Michael Marshall
b7ead6c16a
Fixed bug: iff stout smearing disabled then gauge field uninitialised
2019-05-02 18:20:49 +01:00
gfilaci
d9438627d9
M5D benchmark without vector copy overhead
2019-05-02 11:10:57 +01:00
gfilaci
b23305dbe2
fix M5D flop count
2019-05-02 11:08:21 +01:00
gfilaci
d3b5c02e2d
measure M5D bandwidth and fix M5D flop count
2019-05-02 11:02:39 +01:00
gfilaci
8b6541fb60
Fix gpu MultRealPart and MaddRealPart bug
2019-05-02 10:58:17 +01:00
gfilaci
6da9aa9971
replace std::vector with Vector in benchmark
2019-05-02 10:56:22 +01:00
gfilaci
44e0360b97
replace std::vector with Vector
2019-05-02 10:55:36 +01:00
gfilaci
9003c4a07c
allocator copy constructor (to be fixed)
2019-05-02 10:53:37 +01:00
gfilaci
b52fa38f8c
seed initialisation of RNG5
2019-05-02 10:36:09 +01:00
gfilaci
3f1c4d8789
fix comment hash
2019-05-02 10:24:36 +01:00
Michael Marshall
62692b68b9
I'd forgotten that Intel '17 doesn't like auto var{value}; syntax
2019-05-01 20:45:16 +01:00
Michael Marshall
311c35a15c
Looking for fixes for Intel '17 compiler errors. std::cout << complex number ?
2019-05-01 18:22:08 +01:00
Michael Marshall
a3fe57f430
NamedTensor writes to tag NamedTensor by default (not filename) - so still usable in case user renames file.
...
Also tweaked tensor index name checking (which is used to ensure tensor is correct type)
2019-05-01 18:11:37 +01:00
Michael Marshall
8dc0587621
Post Michael / Felix review. Ready for Peter / Antonin review
2019-05-01 13:04:51 +01:00
Michael Marshall
cfe5fa7a35
1) Don't write Laplacian eigenvectors to disk 2) Add a test that loads perambulators from disk
2019-05-01 09:50:23 +01:00
Michael Marshall
e72e26c899
Get rid of unnecessary multiFile options
2019-05-01 08:53:08 +01:00
Michael Marshall
334f29becb
Fairly close to ready for release. Felix and I to review, then submit for release
2019-04-30 23:53:57 +01:00
Michael Marshall
e56ead55ef
WIP
2019-04-30 14:41:48 +01:00
4f0631615f
A2A Lepton-Meson Field contraction
2019-04-30 12:04:59 +01:00
Michael Marshall
d74d443d1b
Pre-release cleanup in progress
2019-04-29 22:18:29 +01:00
Michael Marshall
4203105104
Part-way through release tidy-up
2019-04-29 18:40:38 +01:00
c2cd0e15d7
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
...
Conflicts:
Grid/qcd/action/fermion/DomainWallFermion.h
Grid/qcd/action/fermion/FermionOperator.h
2019-04-29 12:07:20 +01:00
Michael Marshall
ac19c0e04f
This will need to be removed eventually, but should save us fiddling about with each release
2019-04-29 09:20:08 +01:00
Michael Marshall
b48ca8a6ef
Merge branch 'develop' into feature/distil
...
* develop: (36 commits)
Mobius 2+1f sign off.
Integrator logging on by default
RHMC for mobius
HMC make file
Update
Simple check
Simple checks
Monius HMC
Changes locally
Power method
Momentum rescaling
Bounds checking
Bounds checking
Scale momentum convention to CPS/UKQCD MD time
Add bounds checking
Updated documentation after Peter's review. 1) Removed version numbers from Grid dependencies 2) Explained in a little more detail how to use Xcode to build Grid and Hadrons libraries
Remove bundled Eigen stuff
Fix typo so it matches develop
Remove bundled source from my local repository
Slightly generalize interface to SchurRedBlackBase and derived solver classes so we can pass forecasted initial guesses in EOFA heatbath correctly
...
2019-04-29 08:37:39 +01:00
Michael Marshall
c48ae4f3ad
1) Only the boss should write the perambulator - possibly was a source of intermittent corruption?
...
2) Implemented and test a perambulator conversion utility in Test_distil (commented out near the start of main)
2019-04-28 23:24:57 +01:00
Michael Marshall
fb74de0798
Making sure Hdf5 is an optional dependency (default to binary writer if not present)
2019-04-28 20:23:44 +01:00
Michael Marshall
adc1eaee68
Switched to Hdf5 format for perambulators. Ready for first test on Tesseract.
2019-04-28 17:53:42 +01:00
Peter Boyle
60330e05a3
NVCC wacky compiler options frozen. Possibly Cuda 9.2 specific
2019-04-28 07:39:33 +01:00
Peter Boyle
f9b8c0cccf
Vector changes for UVM
2019-04-28 07:38:57 +01:00
Peter Boyle
3cad67e569
Compile on tesseract
2019-04-28 07:38:09 +01:00
Peter Boyle
170ba4e619
Ensure different MPI ranks use different GPUs. The mapping works on Tesseract.
2019-04-28 07:32:30 +01:00
Peter Boyle
204a090497
Inner product is not working on GPU. Why?
2019-04-28 07:31:56 +01:00
Peter Boyle
3c717c47ef
GPU no compile on Wilson Multigrid fixed
2019-04-28 07:31:19 +01:00
Michael Marshall
5aca4e8670
Just realised that the trace is at every lattice site, so moved the check for no smearing further up
2019-04-26 17:23:18 +01:00
Michael Marshall
e223d0b99f
Need to validate range about which exp^iQ is considered unity
2019-04-26 16:00:35 +01:00
Michael Marshall
2e220456d3
First attempt at minimising smearing
2019-04-26 15:54:05 +01:00
Felix Erben
4333d97958
fixed parameter
2019-04-26 14:29:21 +01:00
Felix Erben
55c9c45d4b
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-04-26 14:28:01 +01:00
Felix Erben
e70e03f560
started stout smearing for small w
2019-04-26 14:27:40 +01:00
Michael Marshall
ff5e2e0f47
Debug output fix. Meant to print the rho matrix for stout smearing ... not the address of the function that creates it
2019-04-26 12:30:41 +01:00
Michael Marshall
4f3d1ea6e8
Two heads are better than one. Combined effort and hopefully spatial smearing now fixed!
2019-04-26 12:18:11 +01:00
Author Name
b1768ba820
Urgh!
2019-04-26 10:04:27 +01:00
Michael Marshall
3ac5a69a57
Ready to test spatial smearing (again)
2019-04-26 08:54:30 +01:00
Michael Marshall
50a74eaea3
Doesn't compile. Does it still need to be maintained?
2019-04-26 08:33:10 +01:00
Michael Marshall
8419fbb335
Renamed PerambLight module. Check with Felix whether Test_24 and Test_tesseract still need to be maintained
2019-04-26 08:23:15 +01:00
Michael Marshall
23a9b93cda
More dependencies for Distil.hpp move and (C) 2019 only
2019-04-26 07:39:05 +01:00
Michael Marshall
ecdc3ddebf
Moved Distil.hpp and added GNU license to all files
2019-04-26 07:24:56 +01:00
fionnoh
df41de4cb6
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-04-24 12:02:50 +01:00
Peter Boyle
6d0b985697
Verbose
2019-04-24 06:29:52 +01:00
Peter Boyle
94ebcf551c
Iteratoin range fix
2019-04-24 06:28:14 +01:00
Peter Boyle
6fd4b0be91
Evolving HMC status
2019-04-23 21:54:45 +01:00
Peter Boyle
7894ea6263
Now have mixed precision solves in the 2f sector
2019-04-23 21:54:19 +01:00
Peter Boyle
73d4676997
Action and Deriv solvers allowed to differ
2019-04-23 21:53:44 +01:00
Peter Boyle
262a73c964
COmment improvement
2019-04-23 21:52:58 +01:00
Peter Boyle
5921b1d2b9
Layout/whitespace changes
2019-04-23 21:52:33 +01:00
Peter Boyle
6505efcb57
Set iteration count if guess is already good
2019-04-23 21:51:57 +01:00
Peter Boyle
b595f58e4c
Allow HMC to acces matrix
2019-04-23 21:51:23 +01:00
Peter Boyle
b0de7ab7db
Extra do nothing guesser
2019-04-23 21:50:45 +01:00
Peter Boyle
e1124d9572
Integrator verbosity updates
2019-04-23 21:50:15 +01:00
Michael Marshall
606698511c
Seems we've not been keeping the test up-to-date
2019-04-22 19:03:24 +01:00
Michael Marshall
a97b814f0c
Remove redundancy in LapEvec filename
2019-04-19 14:09:36 +01:00
Michael Marshall
7214681e11
Spatial smearing doesn't work yet. Fixed inconsistency in naming of perambulator in PerambLight.hpp
2019-04-19 13:54:25 +01:00
Michael Marshall
143b75956c
Stout smearing 3D fixes. Changed LapEvec to perform spatial smearing only
2019-04-19 11:54:02 +01:00
Peter Boyle
d416156c16
Mobius 2+1f sign off.
2019-04-19 07:57:08 +01:00
Peter Boyle
cd8d939a1a
Integrator logging on by default
2019-04-19 07:54:17 +01:00
Peter Boyle
760cfe294c
RHMC for mobius
2019-04-19 07:53:54 +01:00
Felix Erben
4a4203c610
fixed stout smearing for now
2019-04-18 19:10:49 +01:00
Felix Erben
2b598294c9
added distil source module
2019-04-18 17:47:09 +01:00
Peter Boyle
13eaf21b5c
HMC make file
2019-04-18 11:53:26 +01:00
Peter Boyle
1403ab231b
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-04-18 11:06:02 +01:00
Peter Boyle
0368fbcde8
Update
2019-04-18 11:05:53 +01:00
Peter Boyle
2dd0ec7862
Merge pull request #186 from djm2131/feature/eofa-bug-fixes
...
Merge feature/eofa-bug-fixes into develop
2019-04-17 14:54:06 +01:00
Peter Boyle
f4241e59ba
Merge pull request #200 from mmphys/feature/XcodeDoc
...
Updated documentation after Peter's review.
2019-04-17 14:51:19 +01:00
Peter Boyle
26b1d2df2d
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-04-17 12:08:06 +01:00
Peter Boyle
bc14e86812
Simple check
2019-04-17 12:07:42 +01:00
Peter Boyle
780a67844e
Simple checks
2019-04-17 12:07:17 +01:00
Peter Boyle
8b7805200f
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-04-17 12:05:09 +01:00
Peter Boyle
2871dec6c0
Monius HMC
2019-04-17 12:04:57 +01:00
Peter Boyle
abde12433e
Changes locally
2019-04-17 12:03:20 +01:00
Peter Boyle
1f88ba4e39
Power method
2019-04-17 12:03:05 +01:00
Peter Boyle
ea5b3ed8a2
Momentum rescaling
2019-04-17 12:01:06 +01:00
Peter Boyle
a104115c7d
Bounds checking
2019-04-17 11:56:46 +01:00
Peter Boyle
b899042d81
Bounds checking
2019-04-17 11:55:43 +01:00
Peter Boyle
3e712fe643
Scale momentum convention to CPS/UKQCD MD time
2019-04-17 11:54:17 +01:00
Peter Boyle
f4723e07c5
Add bounds checking
2019-04-17 11:52:23 +01:00
fionnoh
9ed2d02bb2
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-04-12 12:11:06 +01:00
Michael Marshall
d111c70c38
Merge branch 'develop' into feature/distil
...
* develop:
Make sure Grid::Serializable can write Eigen Tensors to output streams. NB: 1) The Eigen package defines operator<< for Eigen tensors, but this format is different, hence Grid::Serializable::WriteMember 2) For simplification, the contents are written in memory order. I.e. Different results will be obtained depending on whether the tensor is row- or column-major
... this time without the new Distillation modules ...
Eigen tensor serialisation fixes after Antonin's review
Iterator added. Will wait for review comments before finalising.
Fix build with Intel '17 compiler, i.e. workaround incorrect auto types for c++ style definitions. E.g. assuming T::rank is an int, then objects defined like so: const auto rank{T::rank}; should also be int. Unfortunately, Intel '17 instead defines them to be std::initializer_list<int>, then proceeds to complain where these variables are used that they cannot be converted to int. NB: This was fixed under Intel '18
Pushed paboyle's changes: Updates for clang happy
Merge paboyle's no compile in single precision Intel 2019 fix
Eigen::Tensor serialisation. Tested on single and double precision builds
2019-04-10 13:14:24 +01:00
50d016340c
Merge pull request #190 from mmphys/feature/distil-checkin
...
Eigen::Tensor serialisation. Tested on single and double precision builds
2019-04-10 12:49:06 +01:00
Michael Marshall
f7b4fd0f69
Make sure Grid::Serializable can write Eigen Tensors to output streams. NB:
...
1) The Eigen package defines operator<< for Eigen tensors, but this format is different, hence Grid::Serializable::WriteMember
2) For simplification, the contents are written in memory order. I.e. Different results will be obtained depending on whether the tensor is row- or column-major
2019-04-06 15:40:23 +01:00
Michael Marshall
ed2427d5f7
Make sure Grid::Serializable can write Eigen Tensors to output streams. NB:
...
1) The Eigen package defines operator<< for Eigen tensors, but this format is different, hence Grid::Serializable::WriteMember
2) For simplification, the contents are written in memory order. I.e. Different results will be obtained depending on whether the tensor is row- or column-major
2019-04-06 15:37:53 +01:00
Michael Marshall
1f1aa92f14
Updated documentation after Peter's review.
...
1) Removed version numbers from Grid dependencies
2) Explained in a little more detail how to use Xcode to build Grid and Hadrons libraries
2019-04-06 13:42:39 +01:00
Michael Marshall
ea2f34de7b
Updated documentation after Peter's review.
...
1) Removed version numbers from Grid dependencies
2) Explained in a little more detail how to use Xcode to build Grid and Hadrons libraries
2019-04-06 13:37:47 +01:00
00963a7499
twist and boundary conditions for free propagator
2019-04-05 10:08:27 +01:00
Michael Marshall
63dc0fa7e9
Fixed memory leak ... without breaking semantics of prior code. Possibly should change the semantics? For Peter / Antonin to comment
2019-04-04 16:00:17 +01:00
Felix Erben
5e6104e683
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-04-04 12:13:35 +01:00
Felix Erben
25e4ee3a49
3D Stout smearing added
2019-04-04 12:13:16 +01:00
Michael Marshall
82a77f9960
... this time without the new Distillation modules ...
2019-04-03 23:02:26 +01:00
Michael Marshall
00b4139c16
Eigen tensor serialisation fixes after Antonin's review
2019-04-03 22:48:07 +01:00
Michael Marshall
4161429dcc
Serialisation fixes after Antonin's review
2019-04-03 22:30:07 +01:00
fionnoh
3e9c757b3b
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-04-03 16:56:48 +01:00
Michael Marshall
b5eb97206b
Merge branch 'develop' into feature/distil
...
* develop:
MGauge::GaugeFix use standard convention for fields
fix bug: MGauge::GaugeFix should not modify its input
add gauge transformation matrix as output to module MGauge/GaugeFix
2019-04-03 16:24:49 +01:00
ecf736e6bf
Merge pull request #193 from nils-asmussen/fix_GaugeFix_inputmod
...
Fixes #192 and adds gauge transformation matrix as output.
2019-04-02 18:28:50 +01:00
fionnoh
f22ab5e1bc
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-04-02 11:29:59 +01:00
72f959c0b8
MGauge::GaugeFix use standard convention for fields
2019-03-29 16:51:21 +00:00
63001d3fa6
fix bug: MGauge::GaugeFix should not modify its input
2019-03-29 16:51:11 +00:00
b1d3d1f1a9
add gauge transformation matrix as output to module MGauge/GaugeFix
2019-03-29 16:51:00 +00:00
c2250fa124
MFermion::GaugeProp fix for 4d fields
2019-03-29 16:36:56 +00:00
84940fbdf0
Hadrons: Lepton Propagator for kl2
2019-03-28 10:15:09 +00:00
Michael Marshall
0da906cf66
Merge branch 'develop' into feature/distil
...
* develop:
Documentation for using Grid with Xcode on Mac OS
2019-03-27 23:08:29 +00:00
Peter Boyle
0a270b3e93
Merge pull request #191 from mmphys/GridXcode
...
Documentation for using Grid with Xcode on Mac OS
2019-03-27 22:53:35 +00:00
Michael Marshall
6536bed8a4
Documentation for using Grid with Xcode on Mac OS
2019-03-27 20:51:20 +00:00
Michael Marshall
3decb5f886
Merge branch 'develop' of github.com:paboyle/Grid into feature/distil
...
* 'develop' of github.com:paboyle/Grid:
endianness fix in resilient IO
2019-03-27 20:39:23 +00:00
Michael Marshall
faa8bb9bc6
Fixed funny memory leak
2019-03-27 17:55:52 +00:00
Michael Marshall
4c02ed6d0c
Updated GridXcode documentation
2019-03-27 13:54:39 +00:00
ferben
f757b80e1c
tried to fix mem leak
2019-03-27 12:00:36 +00:00
ferben
b8581be1da
:
2019-03-27 11:59:06 +00:00
79160011a1
endianness fix in resilient IO
2019-03-26 16:06:13 +00:00
Michael Marshall
9fce1263be
Fixed bug in LapEvec if machine running spread-out in time
2019-03-26 13:24:39 +00:00
Michael Marshall
ae565b006a
Compiling in single-precision now works
2019-03-25 22:56:01 +00:00
Michael Marshall
8502660023
Begin fixes for single precision
2019-03-25 20:40:05 +00:00
Michael Marshall
47f5b1e2b5
Iterator added. Will wait for review comments before finalising.
2019-03-25 18:19:55 +00:00
Michael Marshall
625a97a466
cosmetic
2019-03-25 18:16:04 +00:00
Felix Erben
bce2766fef
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-03-25 16:38:42 +00:00
Felix Erben
ce501afec6
bugfix
2019-03-25 16:38:25 +00:00
Michael Marshall
1d10a3b3de
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
...
* 'feature/distil' of github.com:mmphys/Grid:
bugfix
2019-03-25 15:50:57 +00:00
Michael Marshall
d1e02f50ff
Added iterator for Eigen tensors
2019-03-25 15:50:29 +00:00
Felix Erben
48b03c4590
bugfix
2019-03-25 15:45:35 +00:00
ferben
b3b9e608e1
added new module for noises
2019-03-25 14:13:03 +00:00
Michael Marshall
4e87cbd400
Fix build with Intel '17 compiler, i.e. workaround incorrect auto types for c++ style definitions.
...
E.g. assuming T::rank is an int, then objects defined like so:
const auto rank{T::rank};
should also be int. Unfortunately, Intel '17 instead defines them to be std::initializer_list<int>, then proceeds to complain where these variables are used that they cannot be converted to int. NB: This was fixed under Intel '18
2019-03-23 09:28:41 +00:00
Michael Marshall
a381d34f37
Fix build with Intel '17 compiler, i.e. workaround incorrect auto types for c++ style definitions.
...
E.g. assuming T::rank is an int, then objects defined like so:
const auto rank{T::rank};
should also be int. Unfortunately, Intel '17 instead defines them to be std::initializer_list<int>, then proceeds to complain where these variables are used that they cannot be converted to int. NB: This was fixed under Intel '18
2019-03-23 09:24:15 +00:00
ferben
4fc045b563
added module to load perambulators from disk
2019-03-22 13:50:47 +00:00
ferben
fbf286b0e3
added Spin dilution
2019-03-22 13:30:11 +00:00
Michael Marshall
9dc3fe9922
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
...
* 'feature/distil' of github.com:mmphys/Grid:
modules list
2019-03-22 13:00:06 +00:00
Michael Marshall
f0c2108acf
Pushed paboyle's changes: Updates for clang happy
2019-03-22 12:59:14 +00:00
Felix Erben
6c9029fab7
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-03-22 12:41:56 +00:00
Felix Erben
8700dd4d0d
modules list
2019-03-22 12:41:53 +00:00
Michael Marshall
9c16391e55
Merge branch 'develop' into feature/distil
...
* develop:
Updates for clang happy
2019-03-22 12:08:50 +00:00
Peter Boyle
93a5fc083f
Updates for clang happy
2019-03-22 11:39:22 +00:00
Michael Marshall
6d1de8ed2e
Merge paboyle's no compile in single precision Intel 2019 fix
2019-03-21 16:48:08 +00:00
Michael Marshall
685d9bafef
Merge branch 'develop' into feature/distil
...
* develop:
No compile in single precisoin Intel 2019 fix
2019-03-21 16:36:48 +00:00
Peter Boyle
116dde31eb
No compile in single precisoin Intel 2019 fix
2019-03-21 14:13:33 +00:00
Michael Marshall
d2d26b302d
Removed the module we don't need from modules.inc (so make now works)
...
i.e. removed Modules/MDistil/PerambMultipleSolves.hpp from Hadrons/modules.inc
2019-03-20 22:59:20 +00:00
Michael Marshall
12d8bf1ced
Eigen::Tensor serialisation. Tested on single and double precision builds
2019-03-20 22:27:41 +00:00
Michael Marshall
88cb004731
Fixed single-precision issues in Test_serialisation
2019-03-20 22:05:16 +00:00
ferben
a66bb8acba
fixed possible memory leak
2019-03-20 14:41:36 +00:00
ferben
4ae35000a9
removed module which we do not need
2019-03-20 13:36:57 +00:00
Michael Marshall
02b96b4602
Fixed module list (messed up when I merged from develop)
2019-03-20 11:20:40 +00:00
Michael Marshall
11dded61e8
Merge branch 'develop' into feature/distil
...
* develop: (29 commits)
precision fix
Updates after review with Peter.
Wilson clover multi grid for lime lattice
Recommendations for Traits classes
Hadrons: uninitialised pointer fix (might have been harmless)
Hadrons: beware of the nasty uninitialised twists
Smearing test. Test on free field.
Smearing for quark observables
Smearing
Hadrons: XML validator utility
display relative norm during field IO norm check
possibility to set a build number
IO norm check on relative norm
Output field norm check during IO
Hadrons: random vector utility module I/O
quieter initialisation
fix patch command for eigen in bootstrap.sh
Mres changes and gauge xform mat changes
Hadrons: 32 bit I/O directly in Lanczos module
Hadrons: copyright update
...
# Conflicts:
# Grid/tensors/Tensor_traits.h
# Hadrons/Modules.hpp
# Hadrons/modules.inc
2019-03-20 10:35:36 +00:00
d921a99b1a
precision fix
2019-03-19 17:07:40 +00:00
fionnoh
9790926cc5
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-03-19 16:35:48 +00:00
Michael Marshall
24cf3b9df5
Ignore Version.h as it's created by automake/autoconf
2019-03-19 12:12:39 +00:00
Michael Marshall
9c8aa2047d
Put GridXcode doc in subdirectory
2019-03-19 07:33:19 +00:00
Michael Marshall
204cfa1c5a
Added documentation for Grid using Xcode
2019-03-19 07:28:29 +00:00
Michael Marshall
fe6845d38b
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-03-18 14:44:18 +00:00
Peter Boyle
a6adb85a1b
Merge pull request #185 from mmphys/feature/trait-recommend
...
Recommendations for Traits classes
2019-03-18 12:22:42 +00:00
bff4eeec41
Added disclaimer on half-precision types
2019-03-18 12:15:25 +00:00
David Murphy
98b5b61fea
Remove bundled Eigen stuff
2019-03-15 19:44:28 -04:00
David Murphy
6896c57d7c
Fix typo so it matches develop
2019-03-15 19:34:36 -04:00
David Murphy
b3d480a978
Remove bundled source from my local repository
2019-03-15 19:17:03 -04:00
David Murphy
bb731c97d6
Slightly generalize interface to SchurRedBlackBase and derived solver classes so we can pass forecasted initial guesses in EOFA heatbath correctly
2019-03-15 19:10:56 -04:00
David Murphy
974003ae96
Fix sign convention of ExactOneFlavourRatioPseudoFermionAction::deriv() to match force conventions for Integrator class
2019-03-15 19:04:29 -04:00
David Murphy
93348775af
Resolved merge conflict
2019-03-15 19:01:37 -04:00
Felix Erben
d1fe4dce33
new idea to get multiple perambulators
2019-03-15 10:28:02 +00:00
ferben
50ca3101de
bug in multiSolves and new test prog
2019-03-13 17:25:55 +00:00
ferben
0faf40e207
last commit did not compile - fxied this
2019-03-13 13:24:18 +00:00
ferben
5313e44d11
some cleanup
2019-03-13 13:15:12 +00:00
ferben
6bb9b67c93
externalised gauge field reading to hadrons module
2019-03-13 12:09:12 +00:00
a0405c6d84
PerambMultipleSolves.hpp compiles (not had time to test)
2019-03-12 14:01:29 +00:00
ferben
c2a3231cdf
added testing module for multiple perambulators
2019-03-11 18:05:39 +00:00
ferben
5fb2ee89bb
modified test so that it runs
2019-03-08 16:50:21 +00:00
ferben
608a98d870
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-03-08 16:28:34 +00:00
ferben
2df396380d
solver is now external
2019-03-08 16:28:21 +00:00
Felix Erben
64ba664637
changed debug options
2019-03-08 12:25:00 +00:00
ferben
4a70b2ffd4
Aslash insertions work now?
2019-03-08 12:23:22 +00:00
2d659015ff
Serialisation is fully functional. Ready for review.
2019-03-08 00:30:43 +00:00
e63019ac50
Tensor serialisation is fully functional
2019-03-08 00:01:45 +00:00
Felix Erben
dde118fed9
added everythong to compute sequential aslash fields
2019-03-07 17:36:53 +00:00
Felix Erben
1538bf8c34
added everythong to compute sequential aslash fields
2019-03-07 17:36:22 +00:00
Felix Erben
4abc498ae3
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-03-07 15:34:10 +00:00
Felix Erben
93dfbfbfcd
added module to compute perambulator from a solve
2019-03-07 15:33:50 +00:00
f9e273d4bf
Making sure same as Traits-recommend
2019-03-07 14:33:04 +00:00
91cffef883
Updates after review with Peter.
2019-03-07 14:30:35 +00:00
584fa0a633
Changes after review with Peter
2019-03-07 12:53:34 +00:00
d3935ae7fc
Hadrons: some updates in WeakMesonDecayKl2
2019-03-06 15:27:59 +00:00
ferben
73cdca3973
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-03-06 13:55:51 +00:00
ferben
d716f8a0c9
new module for baryon contraction
2019-03-06 13:55:36 +00:00
aa24f04911
Changed EigenIO to use GridTypeMapper type traits
2019-03-06 12:55:05 +00:00
0b426bf9f6
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
...
Conflicts:
Hadrons/Modules.hpp
Hadrons/modules.inc
2019-03-06 11:28:59 +00:00
1880e6d12d
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-03-06 11:16:34 +00:00
4a00513e65
Moving Eigen trensor utilities to separate (optional) header
2019-03-06 11:16:22 +00:00
ferben
7718ee199a
efficient baryon test program
2019-03-05 17:16:42 +00:00
ferben
d7c7bff065
added output for source meson fields on all tsrc
2019-03-05 12:01:55 +00:00
ferben
802675f062
baryons should compile now...
2019-03-04 17:31:21 +00:00
Azusa Yamaguchi
acd25d0d01
Wilson clover multi grid for lime lattice
2019-03-04 11:30:15 +00:00
d56d8c923f
Replaced an error in A2AUtils.h that was stopping the build with an assert()
2019-03-02 00:36:53 +00:00
00c3c6fc54
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-03-02 00:24:47 +00:00
b3d4ba8657
Fixed issues with Eigen Tensor serialisation. Fixed issues with precision to text streams
2019-03-02 00:24:37 +00:00
Felix Erben
a4d578bd5d
baryons work now???
2019-03-01 14:44:39 +00:00
Felix Erben
7653649389
baryons working now
2019-03-01 12:57:41 +00:00
a344a2227e
Fixing build errors
2019-02-28 20:30:16 +00:00
b7db99967a
Recommendations for Traits classes
2019-02-28 20:06:59 +00:00
4b9200b35c
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-28 19:06:36 +00:00
91be028507
Still one issue on write
2019-02-28 19:06:25 +00:00
3b05f91f5c
Prototype for template traits recommendations
2019-02-28 19:04:44 +00:00
Felix Erben
8804271339
efficient baryons compile!
2019-02-28 16:32:40 +00:00
6d9f377913
added parity
2019-02-28 11:05:31 +00:00
18b603c5ae
simple but hopefully efficient baryon field
2019-02-28 10:27:05 +00:00
ferben
e9784572af
baryons...
2019-02-27 17:51:25 +00:00
b930eda69d
Merge branch 'develop' of github.com:paboyle/Grid into develop
2019-02-27 02:27:46 +00:00
7852181c2c
Hadrons: uninitialised pointer fix (might have been harmless)
2019-02-27 02:27:40 +00:00
bdf87bc994
Hadrons: beware of the nasty uninitialised twists
2019-02-27 02:27:09 +00:00
Felix Erben
f168a9e7ee
continued with baryons
2019-02-26 16:41:52 +00:00
Felix Erben
50b6db75da
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-02-26 15:57:09 +00:00
Felix Erben
df065f1d57
first test configs
2019-02-26 15:57:01 +00:00
Azusa Yamaguchi
136e7b2314
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-02-26 11:31:36 +00:00
Azusa Yamaguchi
1ea64b24fe
Smearing test. Test on free field.
2019-02-26 11:31:17 +00:00
Azusa Yamaguchi
8f661f6c05
Smearing for quark observables
2019-02-26 11:31:00 +00:00
Azusa Yamaguchi
ae9e248c95
Smearing
2019-02-26 11:29:12 +00:00
578eb177e7
Tweaked format and memory use on Xml format. Still crashes (out of memory) on large read on my laptop
2019-02-25 22:03:21 +00:00
81b3f3d2ca
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-25 15:39:07 +00:00
7c7ffa3b10
Added text read/write
2019-02-25 15:38:47 +00:00
ferben
1f098ceecf
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-25 15:36:23 +00:00
ferben
c47c1a2472
started working on baryons - this time efficiently
2019-02-25 15:36:11 +00:00
ec45b16840
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-25 14:10:34 +00:00
9288019789
Added Xml IO (has one deficiency: the format for multi-dimensional data is flat)
2019-02-25 14:10:24 +00:00
fionnoh
351ffe73cd
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-02-25 14:06:09 +00:00
ferben
9c04139362
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-25 12:40:44 +00:00
ferben
cfc14a7432
more adjustments to test
2019-02-25 12:40:32 +00:00
31e40c26fa
Oops. Forgot to delete SortNode (prevented linking)
2019-02-25 11:35:33 +00:00
ferben
3f2fe5c7e7
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-25 11:18:34 +00:00
ferben
76b6e8a01e
first tesseract test
2019-02-25 11:18:25 +00:00
f9543982e4
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-25 11:07:43 +00:00
3c9f2d4106
Chunking layout reasonably efficient. Looks for small prime factors of each dimension, falling back to approximate size if needed.
2019-02-25 11:07:29 +00:00
Peter Boyle
6160795a43
dt^2 term comments
2019-02-24 15:23:20 +00:00
Peter Boyle
ded2d5c3ab
HMC directory
2019-02-24 15:22:57 +00:00
Peter Boyle
04255128ef
HMC directory
2019-02-24 15:22:17 +00:00
Peter Boyle
a9a3248cb5
More precision in rect force test
2019-02-24 15:21:19 +00:00
Peter Boyle
7c461dc664
Bounds checking plan setup
2019-02-24 15:19:48 +00:00
Peter Boyle
15fddde9bf
ConstEE override in Clover
2019-02-24 14:44:43 +00:00
Peter Boyle
048397d880
Default tau spacing should be longer c.f. Zbigniew Srocinsky thesis
2019-02-24 14:43:22 +00:00
Peter Boyle
196c9e4a4a
Better conformable check with message
2019-02-24 14:42:52 +00:00
Peter Boyle
6a0823718e
Make ConstEE except override in clover
2019-02-24 14:41:59 +00:00
Peter Boyle
22476cc5a3
Power method estimator of spectral range
2019-02-24 14:37:56 +00:00
cb16c96dc7
Hadrons: XML validator utility
2019-02-22 18:41:26 +00:00
ferben
cad26a736e
quick&dirty fix for g5*field
2019-02-22 17:05:16 +00:00
ferben
4f2ac433f1
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-22 16:31:26 +00:00
ferben
f9e505108b
test Aslash
2019-02-22 16:31:17 +00:00
Felix Erben
d2aced13da
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-02-22 16:30:40 +00:00
Felix Erben
03d031d623
tesserct test
2019-02-22 16:30:22 +00:00
44a2d4854a
Ensured Hdf5 chunk size always less than 4GB
2019-02-22 15:14:32 +00:00
292ff33f7f
Removed issue with std::string_literal
2019-02-21 16:51:05 +00:00
55886cf9db
ran make_module_list.sh
2019-02-21 16:14:13 +00:00
c640923159
Fixed reference to depth from test
2019-02-21 15:48:52 +00:00
752530f352
Gotten rid of c++17 in Test_serialisation.cc
2019-02-21 14:43:07 +00:00
34b9450fc9
Gotten rid of c++17
2019-02-21 14:22:48 +00:00
ferben
5d6462b706
bugfix
2019-02-21 11:13:10 +00:00
f70c5b004a
some cleanup in Baryon2pt
2019-02-20 12:56:13 +00:00
5bb9de9242
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-19 17:37:37 +00:00
982a24514b
Binary IO also implemented and tested
2019-02-19 17:37:21 +00:00
ferben
97c6f770b4
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-19 17:22:41 +00:00
ferben
4522f1e831
separated final 2pt Contraction
2019-02-19 17:22:30 +00:00
c14547ddbe
EigenIO writing rationalised. All indices (trivial or not) written
2019-02-19 16:12:55 +00:00
63c97db414
Prior to rationalising 2 versions of BaseIO::write (scalar and vector)
2019-02-19 13:29:08 +00:00
6ebb32ffbf
Rationalised Test_serialisation
2019-02-18 21:40:53 +00:00
07c97cb424
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-18 17:12:36 +00:00
04b58de5de
Read-back working.
2019-02-18 17:12:27 +00:00
ferben
6e822b7201
added sign for contraction sum
2019-02-18 15:21:13 +00:00
ferben
625ccfcd72
continued baryon contraction code
2019-02-18 13:10:34 +00:00
c77069244d
Nearly ready. Just finishing off readback and compare
2019-02-18 08:55:50 +00:00
9815ddb853
Started read routines. Introduced readMultiDim and tested I didnt break anything
2019-02-16 19:30:33 +00:00
74a3a5b825
Fixed existing bug in Hdf5Reader::readDefault for std::vector<U>
2019-02-16 18:45:46 +00:00
00e9416e0a
Tweak to initialisation example
2019-02-16 17:08:22 +00:00
b6803a070a
Making sure I understand row-major vs column-major ordering
2019-02-16 16:18:28 +00:00
ferben
bfd2770657
started on baryon flavour sums
2019-02-15 15:51:46 +00:00
ferben
668b1e77c7
small changes
2019-02-15 15:31:53 +00:00
ferben
e51744260f
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-15 14:32:26 +00:00
ferben
e0987d7d81
first contraction version done
2019-02-15 14:32:17 +00:00
26b94d7bda
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-15 13:53:00 +00:00
df0c8b5d84
Test of Eigen slices
2019-02-15 13:52:49 +00:00
ferben
a111d814db
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-15 10:47:41 +00:00
e8bd8767c0
Get rid of declarations inside constexpr functions. if constexpr warning remains
2019-02-15 10:06:15 +00:00
8cb96cb693
Hmmm lots of warnings depending on compiler ...
2019-02-14 19:17:12 +00:00
b9bee45277
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-14 19:05:47 +00:00
bee24655cd
Finalising traits
2019-02-14 19:05:35 +00:00
886c895f81
baryon field structure is now eigentensor - started on contractions for 2pt functions
2019-02-14 16:44:54 +00:00
e37614bde4
display relative norm during field IO norm check
2019-02-14 16:23:50 +00:00
042bad2ced
possibility to set a build number
2019-02-14 13:58:17 +00:00
59c8cc1588
Minor bugfix
2019-02-13 22:11:24 +00:00
11467a994d
Enough for tonight
2019-02-13 21:48:35 +00:00
ferben
9f2ca98dfc
enseble can now be specified in LapEvec
2019-02-13 13:54:31 +00:00
bf434b6bef
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-13 12:14:18 +00:00
41ff592515
Moved serialisation tests into Test_serialisation
2019-02-13 12:14:01 +00:00
ferben
48ec937c55
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-13 11:48:57 +00:00
ferben
65731546b7
merge...
2019-02-13 11:48:34 +00:00
76c6a6772a
Added rank_non_trivial
2019-02-12 22:15:55 +00:00
e7048231bc
Working version with additional Grid traits pre: review by Antonin
2019-02-12 13:59:48 +00:00
49babeab19
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-11 23:26:46 +00:00
fb2cb3015e
Writing of Eigen::Tensor of grid objects now works (for Hdf5)
2019-02-11 23:26:18 +00:00
ferben
53f45d2c7e
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-11 17:39:55 +00:00
ferben
d889cebc60
unique string is now used
2019-02-11 17:39:42 +00:00
9a225235b6
Can write both fixed and dynamic sized tensors (small tidy)
2019-02-11 17:15:38 +00:00
dff7d9261d
Can write both fixed and dynamic sized tensors
2019-02-11 15:47:40 +00:00
6f2663edf6
Serialisation of an object containing an Eigen::Tensor works for Hdf5. Still quite a lot of tidying up to do.
2019-02-10 23:19:20 +00:00
5bc0857412
IO norm check on relative norm
2019-02-10 22:12:47 +00:00
b540dc1cee
Output field norm check during IO
2019-02-10 21:41:17 +00:00
7672bb6434
Hadrons: random vector utility module I/O
2019-02-10 21:25:25 +00:00
f80c548365
quieter initialisation
2019-02-10 20:47:35 +00:00
d5024bd07e
Hdf5 writing of scalar (i.e. no Grid subtypes) Eigen::Tensor works. But issues when adding Eigen::Tensor to serialisable object.
2019-02-10 15:33:16 +00:00
9c4189484a
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-09 17:12:43 +00:00
3720103f41
Adding Eigen::Tensor still WIP
2019-02-09 17:12:36 +00:00
ferben
c4d27ee30f
added parity operator to baryon fields
2019-02-08 15:49:52 +00:00
ferben
d26a5dce12
bugfix
2019-02-08 14:37:09 +00:00
ferben
5843a943d9
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-08 14:36:08 +00:00
ferben
c1341b8ed2
bugfix
2019-02-08 14:33:06 +00:00
6a4515d0cd
baryons have now the correct (?) structure - also easier!
2019-02-07 12:27:57 +00:00
a0a39e4b00
Fixed initialisation of vector of Complex
2019-02-06 21:56:44 +00:00
b9fb16077c
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-06 21:37:54 +00:00
4b3c566c89
../tests/hadrons/Test_hadrons_distil.cc
2019-02-06 21:36:46 +00:00
Peter Boyle
c8bcee6e97
Merge pull request #183 from nils-asmussen/fix-eigen-patch
...
fix patch command for eigen in bootstrap.sh
2019-02-06 14:36:36 +00:00
ferben
cbd2dfe53f
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-06 12:56:49 +00:00
ferben
6cdb1eb62c
BContraction now computes what might be a baryon function, but probably isn't
2019-02-06 12:23:52 +00:00
6e0d43aef5
fix patch command for eigen in bootstrap.sh
2019-02-06 11:25:51 +00:00
ed7175076b
Turned off warning of unused variable line 150
2019-02-06 09:32:13 +00:00
27677b3870
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-06 09:25:39 +00:00
7423f5af1a
Examples of how to access Grid Tensors
2019-02-06 09:25:24 +00:00
Peter Boyle
c1257208e2
Mres changes and gauge xform mat changes
2019-02-05 23:43:00 +00:00
74c38822ed
Hadrons: 32 bit I/O directly in Lanczos module
2019-02-05 21:56:51 +00:00
318c64adc2
Hadrons: copyright update
2019-02-05 19:13:37 +00:00
d5b053f86f
Hadrons: 1 propagator loop construction now using A2A vectors
2019-02-05 19:12:38 +00:00
c60e50e3cb
Hadrons: copyright update
2019-02-05 18:55:24 +00:00
08d8b1d5fb
Hadrons: 4-quark eye 3-pt contractions
2019-02-05 18:53:20 +00:00
ferben
21d6dbe0b6
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-02-05 17:32:39 +00:00
ferben
1ee84509b5
added baryons project - not working yet
2019-02-05 17:32:26 +00:00
57e57d162f
Removed Eigen::DontAlign attribute
2019-02-05 12:50:28 +00:00
90d6d28547
Hadrons: non-eye weak 3pt fix
2019-02-05 11:35:10 +00:00
5b0870bb19
Added Scalar_ length and Scalar_Unit_Size to Perambulator file for validation
2019-02-05 09:07:05 +00:00
7f5354630a
Updated perambulator binary format to save payload in big endian format on disk
2019-02-04 23:07:59 +00:00
9c31305b8d
Hadrons: test cleaning
2019-02-04 21:26:25 +00:00
2eb584fdf0
Hadrons: 4-quark non-eye 3-pt contractions
2019-02-04 21:24:07 +00:00
6b46834af8
Hadrons: archiving unmaintained or exotic modules
2019-02-04 21:23:30 +00:00
3692c7f1ef
Hadrons: type alias cleaning and global correlator class (need to propagate)
2019-02-04 21:21:51 +00:00
0cf94587cd
array with all gammas for convenience
2019-02-04 21:20:16 +00:00
008ac6b5ae
Permabulator is read back from disk if it exists instead of being created
2019-02-04 12:06:32 +00:00
c7aa4e0c1f
Perambulator filename can be specified in xml. NB: Perambulator binary format now includes data size in bytes to avoid type mismatches.
2019-02-04 11:30:30 +00:00
43bd918a47
Logging tweak
2019-02-03 21:48:50 +00:00
7eda54bb87
Only write indices with dimesion!=1
2019-02-03 20:58:58 +00:00
bd75b843fa
Added checksum to data
2019-02-03 20:31:42 +00:00
8865bf5d7c
Implemented perambulator read/write ... but in binary format. Will switch to Hdf5 when I have Antonins feedback
2019-02-03 17:05:19 +00:00
ferben
caabbcd951
minor change
2019-02-01 17:50:18 +00:00
48528c5b1d
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
...
Added index names to Perambulator
2019-02-01 15:31:27 +00:00
f7b90a0c14
Added index names to perambulator
2019-02-01 15:20:35 +00:00
ferben
a9848becb0
unsmeared sinks can now be computed - new test program available
2019-02-01 13:23:42 +00:00
7cc13f48d5
added some TODO comments; needs discussion
2019-01-31 16:54:11 +00:00
b6b267fd4b
Fixed new test parameters
2019-01-31 15:11:12 +00:00
9671a61bb2
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-31 15:07:45 +00:00
d7dc617746
Switched perambulator to sue Eigen::Tensor (file write temporarily excluded)
2019-01-31 15:06:52 +00:00
Fionn O hOgain
68868c83ff
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2019-01-31 14:46:56 +00:00
32cb2e1a9a
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-31 13:01:31 +00:00
3d31113337
added test t5 to compute meson fields of different quarks. Different nvec are allowed.
2019-01-31 13:01:16 +00:00
48b6f7e6ad
Changed PerambLight<FIMPL> to PerambLight<GIMPL>
2019-01-31 12:37:00 +00:00
0da411fe60
LapEvec fixes
2019-01-31 12:28:38 +00:00
d7b9ed199d
PerambLight fixes
2019-01-31 12:24:32 +00:00
7e74f7bec4
tsrc != 0 now works
2019-01-31 11:35:05 +00:00
dae7b30b92
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-30 21:16:30 +00:00
f7e4661ca0
Fixed grid3d leak in PerambLight
2019-01-30 21:16:09 +00:00
ferben
7b66197534
meson fields are now the same
2019-01-30 18:03:34 +00:00
ferben
c3273eff20
agreement up to laph vectors
2019-01-30 11:20:22 +00:00
ferben
67a3d7aeed
added debug output, perambulators now agree up to 8 digits
2019-01-29 16:24:59 +00:00
ferben
d8831fe925
changed parameters to match Test_Distil
2019-01-29 13:40:26 +00:00
c7ceff6a21
Switched to Gauge field (GIMPL)
2019-01-28 12:28:35 +00:00
ferben
5580b3a7d1
bugfix in DistilVectors
2019-01-28 12:24:47 +00:00
33d8fb2dd9
Default
2019-01-25 19:21:12 +00:00
9f6f776460
ensured there is a default test to run
2019-01-25 19:14:22 +00:00
ferben
84fe36d084
meson functions work until to be saved
2019-01-25 17:26:43 +00:00
ferben
3438dde8df
test prog now computes everything up to meson fields
2019-01-25 15:19:18 +00:00
ferben
aea49bc349
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-25 13:44:30 +00:00
ferben
9ef6f9878e
test works up to perambulators now
2019-01-25 13:44:19 +00:00
708ca8585a
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-25 13:26:56 +00:00
d15bf4b8e1
Added trajectory number to output file
2019-01-25 13:26:48 +00:00
ferben
7496da0987
bugfix in prambLight
2019-01-25 13:08:56 +00:00
ferben
2568f5b925
bugfix in prambLight
2019-01-25 12:37:18 +00:00
577cdf1d72
Simplified tests
2019-01-24 18:50:18 +00:00
f92ed659a7
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-24 16:30:28 +00:00
dfb7fb1d9f
LapEvec test works on --grid 4.4.4.8
2019-01-24 16:30:13 +00:00
a4c1ab6147
all modules linked in test prog
2019-01-24 16:12:19 +00:00
cf85f0388d
Still debugging eigenvector parameters
2019-01-24 13:26:05 +00:00
00b0f75b0d
Eigenvectors created. Still need to correctly set parameters for test.
2019-01-24 12:44:06 +00:00
b45586e81c
Discovered bug root cause. setup() is called multiple times. Now ready to copy-paste the LapEvec code
2019-01-23 21:17:56 +00:00
2c7e6bf58b
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-23 15:20:06 +00:00
7c5a06f6d0
Trying to work out why LapEvec constructor not being called
2019-01-23 15:19:51 +00:00
ferben
068ef85b05
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-23 15:08:24 +00:00
ferben
a6ab742fdb
added perambs to test
2019-01-23 13:58:20 +00:00
2062a8d578
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-23 13:00:20 +00:00
3d3e8f4f9f
Structured objects passed into LapEvec
2019-01-23 12:59:55 +00:00
ferben
2756f16a5e
created test prog for perambs
2019-01-23 12:49:20 +00:00
ferben
d7908c33de
moved hard-coded parameters in DistilVectors to module input
2019-01-23 11:32:53 +00:00
ferben
4cc2ebc9e4
moved hard-coded parameters to module input
2019-01-23 11:26:07 +00:00
ferben
b8afa7314c
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-01-23 10:51:23 +00:00
ferben
be5605931c
merge
2019-01-23 10:51:09 +00:00
09fa821510
Added remaining methods to Permabulator
2019-01-22 17:59:55 +00:00
ferben
f45d2d5dcc
perambLight done, but SliceShare and Write does not work yet
2019-01-22 15:52:26 +00:00
ferben
0a82fae45c
moved perambulator definition to shared header file
2019-01-22 15:06:45 +00:00
ferben
46b05aa9c5
cleaned up, deleted commented out old code
2019-01-22 13:48:44 +00:00
ferben
813c1ab1f1
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-01-22 13:28:09 +00:00
ferben
b1c27a141d
DistilVectors complete and compiling - not tested at all!
2019-01-22 13:27:51 +00:00
81bb361299
Test program ready
2019-01-22 13:19:39 +00:00
ferben
79d533550d
continued on DistilVectors.hpp
2019-01-21 16:45:31 +00:00
ferben
b8c106f320
working on DistilVectors, initialisation done and compiles
2019-01-21 16:04:18 +00:00
b74492a805
Merge branch 'feature/distil' of github.com:mmphys/Grid into feature/distil
2019-01-21 10:40:01 +00:00
c93a43f158
Added test program
2019-01-21 10:39:28 +00:00
Felix Erben
0ff410ae19
copied perambulato code into PerambLight.hpp
2019-01-18 17:47:41 +00:00
Felix Erben
ced30b61e2
added phi vectors - still commented out and does not compile otherwise
2019-01-18 16:38:13 +00:00
Felix Erben
2b782df290
Merge branch 'feature/distil' of https://github.com/mmphys/Grid into feature/distil
2019-01-18 15:58:51 +00:00
Felix Erben
f0f1ba0307
uses evec4d now
2019-01-18 15:58:10 +00:00
2343e621e6
Bananas
2019-01-18 13:32:27 +00:00
Felix Erben
2568504821
small change
2019-01-18 13:23:03 +00:00
b821dde020
Initial version
2019-01-18 13:14:28 +00:00
ae3b053334
Initial version
2019-01-18 13:10:02 +00:00
Peter Boyle
9b6ddb6e54
Adding a norm of a general field check, so that for things other than gauge configs there is an analogue of plaquette norm.
...
Improve argument checking in the BinaryIO.h, as there looks to be some corruption issue intermittently on tesseract jobs.
Not clear where the root bug is.
2019-01-16 22:35:58 +00:00
Vera Guelpers
447b772136
Merge remote-tracking branch 'upstream/develop' into feature/kl2QED
2019-01-07 15:09:18 +00:00
Peter Boyle
c5e081d69c
Re-Merge branch 'develop' into feature/gpu-port
...
Pull in Regensburg MultiGrid pull request
2019-01-03 01:50:16 +00:00
Peter Boyle
535a6aaf05
Update todo list
2019-01-02 22:07:51 +00:00
Peter Boyle
91a7fe247b
Merge branch 'DanielRichtmann-feature/wilsonmg' into develop
2019-01-02 14:40:31 +00:00
Peter Boyle
8a1be021d3
Merge branch 'feature/wilsonmg' of https://github.com/DanielRichtmann/Grid into DanielRichtmann-feature/wilsonmg
2019-01-02 14:39:59 +00:00
Peter Boyle
e73b909a48
Make tests running past nvcc. Different NVCC versions proving tricky to keep happy. This is 9.2
2019-01-02 12:05:30 +00:00
Peter Boyle
a4d9200293
Fixing AVX 512 instantiation error. Need to move to extern templates urgently.
2019-01-02 00:27:07 +00:00
Peter Boyle
350508bdb3
pugixml problem
2019-01-01 16:38:54 +00:00
Peter Boyle
38852737e4
No compile fix on clang
2019-01-01 15:55:13 +00:00
Peter Boyle
802404c78c
Remove warnings under NVCC and move parallel_for to thread-loop
2019-01-01 15:08:09 +00:00
Peter Boyle
0e9b591c1c
NVCC warning suppression
2019-01-01 15:07:47 +00:00
Peter Boyle
c43a2b599a
GPU support
2019-01-01 15:07:29 +00:00
Peter Boyle
8c91e82ee8
GPU clean up, remove parallel_for. Split into accelerator_loop, thread_loop
...
cases, and collides with parallel_for in thrust
2019-01-01 15:06:46 +00:00
Peter Boyle
9d866d062a
GPU support improvements
2019-01-01 15:05:03 +00:00
Peter Boyle
3a4e397e72
Deprecating JSON, too hard to support under NVCC
2019-01-01 15:04:33 +00:00
Peter Boyle
2b6cfe555f
Disable JSON on NVCC. Maybe unsupport JSON full stop. XML and JSON is too many formats in my view.
2019-01-01 15:03:50 +00:00
Peter Boyle
7df58dd883
Photon syntax gave problems with NVCC
2019-01-01 15:03:29 +00:00
Peter Boyle
4bf86ae60a
NVCC clean up
2019-01-01 15:02:50 +00:00
Peter Boyle
07ee87ff5a
GPU happy. Still need to prevent hand kernels being callable under NVCC
2019-01-01 15:00:33 +00:00
Peter Boyle
0c2498fe2f
Explicit instantiation needed for NVCC
2019-01-01 13:55:12 +00:00
Peter Boyle
ad2e65dad5
GPU related updates
2019-01-01 13:54:40 +00:00
Peter Boyle
715babeac8
GPU reductions first cut; use thrust, non-reproducible. Inclusive scan can fix this if desired.
...
Local reduction to LatticeComplex and then further reduction.
2019-01-01 13:53:37 +00:00
Peter Boyle
3eae9a9e3f
update NVCC flags
2019-01-01 13:49:15 +00:00
Peter Boyle
186aad065f
Roll forward Eigen in attempt to make CUDA happy
2019-01-01 13:48:32 +00:00
Peter Boyle
bf5685eb11
Update todo list
2019-01-01 13:48:06 +00:00
Peter Boyle
4a96c067ae
Remove warnings from NVCC
2019-01-01 13:43:09 +00:00
Peter Boyle
ab063f33c0
Offload the linear combinations in CG
2019-01-01 13:42:13 +00:00
Peter Boyle
9efcc535bc
Cleaner drop from CUDA mode around Eigen includes. Remains difficult to let Eigen compile under nvcc with version issues.
2019-01-01 13:39:10 +00:00
Peter Boyle
231b61d012
std::array by default
2019-01-01 13:37:35 +00:00
Peter Boyle
e898f4f0b0
Whitespace
2019-01-01 13:36:55 +00:00
Peter Boyle
d5db5f5242
Wrong dimension used in a temporary
2018-12-20 10:49:45 +00:00
Peter Boyle
2fcedb13dd
Step size modification in HMC; ICC happy thread pragmas
2018-12-20 09:32:33 +00:00
Peter Boyle
35ed1defac
Passes make check now single and double compile
2018-12-19 11:09:32 +00:00
Peter Boyle
4e95accf80
Namespace fix
2018-12-15 21:46:17 +00:00
fd66325321
pure QED test and copyright update
2018-12-14 17:39:11 +00:00
c637c0c48c
James H.'s code for general size Wilson loops
2018-12-14 17:37:09 +00:00
c4b472176c
Photon code fix
2018-12-14 17:36:38 +00:00
Peter Boyle
422764757d
Updates in tests to make all of Grid compile
2018-12-14 16:55:54 +00:00
Vera Guelpers
943fa48ce4
Hadrons: Kl2 contraction using sequential propagators
2018-12-14 13:45:30 +00:00
Vera Guelpers
fa97a56fdd
Hadrons: sequential Aslash insertion on propagator
2018-12-14 12:40:26 +00:00
856476a890
big cleanup of the Photon class + QED Coulomb gauge
2018-12-13 21:52:38 +00:00
Peter Boyle
afc462bd58
Bracketing issue in macro
2018-12-13 10:53:22 +00:00
Peter Boyle
b57a4d32aa
Merge branch 'develop' into feature/gpu-port
2018-12-13 05:11:34 +00:00
c509bd3fe2
Merge branch 'feature/resilient-io' into develop
2018-12-01 12:57:43 +00:00
49b934310b
resilient I/O fix
2018-11-27 20:17:09 +00:00
01e8cf5017
Merge branch 'develop' into feature/resilient-io
2018-11-27 19:09:59 +00:00
12f4499502
HDF5 serialiser fix
2018-11-27 19:09:50 +00:00
05aec72887
Hadrons: application parameter for resilient I/O
2018-11-27 18:46:43 +00:00
136d3802cb
binary parallel IO can do read tests and eventually re-write in case of failure
2018-11-27 18:38:24 +00:00
a4c55406ed
checksummed HDF5 IO
2018-11-27 17:43:19 +00:00
c7f33ca2a8
Revert "Hadrons: A2A vector write can fail and retry"
...
This reverts commit 10fc263675 .
2018-11-27 17:27:26 +00:00
0e3035c51d
Revert "optional non-fatal checksum fail in Lime lattice read (with error codes)"
...
This reverts commit bccfd4cbb3 .
2018-11-27 17:27:20 +00:00
10fc263675
Hadrons: A2A vector write can fail and retry
2018-11-26 19:47:03 +00:00
bccfd4cbb3
optional non-fatal checksum fail in Lime lattice read (with error codes)
2018-11-26 19:45:51 +00:00
0b50d4a328
log time fix
2018-11-23 15:51:27 +00:00
fionnoh
b74940b3d4
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-11-23 14:08:29 +00:00
e232257cb6
Hadrons: A2AAslashVector modul cleaning and renaming
2018-11-22 19:43:49 +00:00
09451b5e48
Merge branch 'develop' of github.com:paboyle/Grid into develop
2018-11-22 15:45:24 +00:00
6364aa8acf
Merge branch 'feature/contractor' into develop
2018-11-22 15:44:46 +00:00
b9e84ecab7
Hadrons: minor code cleaning
2018-11-22 15:44:30 +00:00
41032fef44
Optional RW mode for Hdf5Reader
2018-11-21 18:36:50 +00:00
d77bc88170
Optional support for faster CRC32C checksum through Intel IPP
2018-11-19 17:21:53 +00:00
494b3c9e57
Hadrons: contractor more IO fix
2018-11-19 16:26:53 +00:00
2ba19a9e07
Hadrons: contractor IO fix
2018-11-19 16:17:51 +00:00
5d7cc29eaf
Hadrons: contractor token @traj@ for trajectory number in input file
2018-11-19 16:04:01 +00:00
f22a27d7f9
Hadrons: contractor trajectory loop and file output
2018-11-19 15:45:04 +00:00
Peter Boyle
33a0bbb17b
Const correctness
2018-11-19 11:27:57 +00:00
f592ec8baa
Hadrons: contractor performance fix
2018-11-16 20:59:49 +00:00
8b007b5c24
Hadrons: remove the use of OpenMP reductions
2018-11-16 20:00:29 +00:00
fionnoh
17b3f47b1e
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-11-16 16:32:12 +00:00
9bb170576d
Merge pull request #177 from guelpers/develop
...
Hadrons module to electrify a gauge
2018-11-14 16:04:09 +00:00
Vera Guelpers
a7e3977b75
Merge remote-tracking branch 'upstream/develop' into develop
2018-11-13 14:56:23 +00:00
Vera Guelpers
995f20e45d
Hadrons: some renamings
2018-11-13 14:54:48 +00:00
Vera Guelpers
d058b4e681
Merge branch 'feature/seqA2A' into develop
2018-11-13 13:27:24 +00:00
8e0d2f3402
Hadrons: support for twisted boundary conditions
2018-11-12 17:16:18 +00:00
2ac57370f1
Hadrons: contractor translation average normalisation
2018-11-12 16:04:35 +00:00
344e832a4e
Hadrons: contractor faster transpose and finer timings
2018-11-12 15:59:54 +00:00
cfe281f1a4
Hadrons: diskvectors measure hash performance in debug output
2018-11-12 15:59:11 +00:00
f5422c7334
Hadrons: more contractor instrumentation
2018-11-09 16:23:53 +00:00
68c76a410d
Hadrons: more contractor improvements
2018-11-08 19:24:29 +00:00
69b6ba0a73
Hadrons: contractor fixes and improvements
2018-11-08 18:46:28 +00:00
65349b07a7
Hadrons: simpler A2A perf functions
2018-11-08 18:44:44 +00:00
7cd9914f0e
Hadrons: automatically resize output in MKL A2A matrix kernels
2018-11-08 17:40:57 +00:00
Peter Boyle
f3f24b3017
Optional Twisted BC's added, in "DoubleStore" for WilsonImpl.
...
Untested but doesn't affect answers when twists are all zero. The zero is the default behaviour
for ImplParams.
2018-11-08 12:55:25 +00:00
Vera Guelpers
8ef4657805
Merge remote-tracking branch 'upstream/develop' into feature/seqA2A
2018-11-08 09:00:06 +00:00
Vera Guelpers
78c1086f8b
Hadrons: sequential Aslash insertion and propagator on A2A vector
2018-11-08 08:58:09 +00:00
Peter Boyle
68c13045d6
Added a test for Felix and Michael to look at
2018-11-07 23:40:15 +00:00
Peter Boyle
e9b6f58fdc
Allow shrinking machine in orthog direction for extract slice local
2018-11-07 23:39:18 +00:00
Peter Boyle
839605c45c
Verbose reduce
2018-11-07 23:38:46 +00:00
1ff1422e07
Hadrons: contractor lighter output
2018-11-07 20:02:53 +00:00
32376f0437
Hadrons: contractor performances
2018-11-07 19:59:11 +00:00
0c6e581336
Hadrons: first stab at general contraction code, needs serious testing
2018-11-07 19:16:55 +00:00
Vera Guelpers
e0a79a5bbf
Hadrons: PR#177: Electrify gauge: Single Precision fix
2018-11-07 15:01:22 +00:00
Vera Guelpers
4c016cc1a4
Merge remote-tracking branch 'upstream/develop' into develop
2018-11-07 14:03:12 +00:00
Peter Boyle
2205b1e63e
Add CXX to grid-config
2018-11-07 13:32:46 +00:00
Peter Boyle
6f421c7a6f
Block solver in the SchurRedBlack plus timing report cleaner
2018-11-07 12:26:56 +00:00
Peter Boyle
b62b9ac214
Patch to broken assertion
2018-11-06 22:18:17 +00:00
88d9922e4f
Hadrons: fast A2A matrix contraction kernels
2018-11-06 19:49:09 +00:00
9734e3ee58
Hadrons: (somewhat) faster build
2018-11-06 19:47:41 +00:00
Peter Boyle
8c3a599148
Block solver test
2018-11-06 16:44:58 +00:00
Azusa Yamaguchi
4a47b11876
Block CG improvements to develop
2018-11-06 12:49:05 +00:00
Vera Guelpers
f1382cf81d
Merge remote-tracking branch 'upstream/develop' into develop
2018-11-06 10:29:52 +00:00
Vera Guelpers
85699daef2
Hadrons: Module to electrify a gauge field
2018-11-06 10:27:18 +00:00
1651111d18
Hadrons: final, portable form of the contractor benchmark
2018-11-05 21:29:13 +00:00
1ed4ea344d
Merge branch 'develop' into feature/contractor
2018-11-05 11:42:02 +00:00
8f514ae550
Hadrons: Lanczos 32bit IO
2018-11-05 11:41:10 +00:00
4a7415e83c
Hadrons: contractor benchmark update
2018-10-23 21:00:54 +01:00
0ffcfea724
Hadrons: contractor benchmark
2018-10-23 17:08:16 +01:00
febe41cc1d
Hadrons: improvement on PR #176
2018-10-23 12:48:15 +01:00
62173395b8
Merge pull request #176 from guelpers/develop
...
Hadrons: full volume noise source for A2A
2018-10-23 12:29:35 +01:00
b48611b80f
Merge branch 'develop' into feature/contractor
2018-10-22 18:27:18 +01:00
6b559d68aa
Hadrons: eigenpack converter can do test reads
2018-10-22 11:10:18 +01:00
1982cc58dd
Hadrons: A2A vectors I/O filename fix
2018-10-21 01:20:05 +01:00
2e2e5ce596
SciDAC I/O print data checksums
2018-10-19 20:36:32 +01:00
7d84dca8e9
Merge branch 'develop' into feature/contractor
2018-10-18 23:46:58 +01:00
2d3916418e
Hadrons: more precision fix
2018-10-18 23:45:13 +01:00
21304e2139
Hadrons: fix to allow single-prec build again
2018-10-18 19:58:50 +01:00
7b850eb48b
Merge branch 'develop' of github.com:paboyle/Grid into develop
2018-10-18 19:46:25 +01:00
a3ace57e01
Hadrons copyright update
2018-10-18 19:46:11 +01:00
b1c3cbe35e
Hadrons: A2A vectors I/O
2018-10-18 19:44:58 +01:00
f31d6bfec2
Hadrons: contractor cleaning and better error check
2018-10-18 17:50:35 +01:00
a7cfa26901
Hadrons: reverse A2A matrix load for better DiskVector cache reuse
2018-10-18 17:50:16 +01:00
f333f3e575
Hadrons: DiskVector save-on-eviction and faster CRC32 for Eigen matrices
2018-10-18 17:48:25 +01:00
2b4e253473
Merge branch 'develop' of github.com:paboyle/Grid into develop
2018-10-17 20:28:20 +01:00
0ba3d469c7
Benchmark IO in single and double precision
2018-10-17 20:27:34 +01:00
f709329d96
Hadrons: first version of a contractor utility
2018-10-17 20:26:48 +01:00
f05b25dae4
Hadrons: A2AMatrix load
2018-10-17 20:26:26 +01:00
3e1d268fa3
Hadrons: DiskVector optimisation
2018-10-17 20:25:32 +01:00
Vera Guelpers
109c74bed8
Hadrons: full volume noise source for A2A
2018-10-16 14:56:12 +01:00
3023287fd9
Hadrons: 3-index RO access to Eigen disk vector
2018-10-16 14:44:14 +01:00
b3d6805638
Merge branch 'feature/contractor' into develop
2018-10-16 11:29:37 +01:00
291bc2a1f0
IO benchmark on a list of directories
2018-10-15 17:25:08 +01:00
2f368c33fc
Hadrons: copyright update
2018-10-15 15:51:45 +01:00
9592115341
Hadrons: NPR and gauge fixing linking fix
2018-10-15 15:49:42 +01:00
Peter Boyle
24c07694bc
Mixed precision now supported in MADWF
2018-10-14 00:22:52 +01:00
Peter Boyle
f0229025e2
MADWF working across a range of actions
2018-10-13 19:55:03 +01:00
Peter Boyle
6de9a45a09
NPR first cut by Julia Kettle
2018-10-12 11:00:58 +01:00
Peter Boyle
03c3d495a2
First cut (non functional NPR code) developed by Julia Kettle
2018-10-12 10:59:33 +01:00
Peter Boyle
49f25e08e8
PauliVillars based 4D -> 5D reconstruction with Fourier Accelerated PV inverse
...
by Christoph. Differs from the one by Rudy in BFM since it vectorises the twisted
4D solves in pairs.
2018-10-11 12:35:32 +01:00
efc0c65056
Hadrons: DiskVector Eigen specialisation with binary I/O and sha256 correctness check
2018-10-08 19:02:00 +01:00
936eaac8e1
function to get the sha256 string
2018-10-08 19:00:50 +01:00
fe6a372f75
Hadrons: fixes and cleaning in the scalar SU(N) part
2018-10-08 15:14:08 +01:00
fionnoh
dac9f8622e
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-10-08 10:12:11 +01:00
148fc052bd
Hadrons: Aslash field, tested
2018-10-05 21:04:10 +01:00
c073341a10
Hadrons: more cleaning
2018-10-05 19:50:41 +01:00
78299daaac
Hadrons: code cleaning
2018-10-05 16:47:52 +01:00
866449c804
Hadrons: integration of Peter's A2Autils
2018-10-05 16:42:44 +01:00
d69a52079f
Merge remote-tracking branch 'gh/feature/a2a-integration' into feature/aslashfield
2018-10-05 15:39:09 +01:00
9f4f8a14a3
Hadrons: code cleaning
2018-10-05 15:38:01 +01:00
f6593dc881
Hadrons: A2A block performance counter fix
2018-10-05 15:11:01 +01:00
Peter Boyle
b46d31d4b6
MKL enable on Eigen if Grid is configured to use MKL
2018-10-05 11:29:40 +01:00
58567fc650
Hadrons: big update abstracting the block meson field routine, tested & working, performance counters broken and code dirty
2018-10-04 20:01:49 +01:00
Peter Boyle
7c57cac670
Adding A2A utils class for containing kernels.
2018-10-04 18:57:41 +01:00
d0b21bf1ff
Merge branch 'feature/eigenpack-convert' into develop
2018-10-04 18:26:45 +01:00
a1825d1f59
Hadrons: final fix for multiprec eigenpacks
2018-10-04 18:25:26 +01:00
5a3e83ff7b
Hadrons: new layer in eigenpacks class hierarchy
2018-10-03 14:45:01 +01:00
52569d98d8
Hadrons: multiprec eigenpack I/O fix
2018-10-03 14:24:43 +01:00
b351103c29
Hadrons: eigenpack load module with 32bit I/O
2018-10-02 21:07:56 +01:00
118cca4681
Hadrons: linking fix
2018-10-02 20:08:49 +01:00
44de727cd2
Hadrons: eigenpack support for multiprecision I/O
2018-10-02 19:51:09 +01:00
888ebc3cf9
Hadrons: better name for the EP converter
2018-10-02 15:22:18 +01:00
6c031a1b81
Merge branch 'feature/eigenpack-convert' into develop
2018-10-02 14:57:30 +01:00
02aa4bd762
Hadrons: cleaner eigenpack convert log
2018-10-02 13:43:25 +01:00
9aafa8ee60
Hadrons: eigenpack converter generalised for RB/5d grids
2018-10-02 13:34:17 +01:00
430b98b354
fix previous commit
2018-10-02 13:12:46 +01:00
84189867ef
Hadrons: eigenpack converter with RB grids (to be generalised)
2018-10-02 13:05:05 +01:00
4ab8cfbe2a
Hadrons: more verbose eigenpack convert
2018-10-02 12:24:45 +01:00
aadd9f4468
Eigenpack converter, to be tested, HadronsXmlRun moved to Utilities directory
2018-10-02 00:02:34 +01:00
8fbb27ce13
Hadrons: less code duplication in eigenpack IO
2018-10-01 20:15:21 +01:00
21bba95909
Hadrons: eigenpack metadata is no ignored anymore when reading
2018-10-01 19:33:45 +01:00
6448fe7121
More flexible XML control in Lime files
2018-10-01 19:32:50 +01:00
2458a11d1d
Hadrons: precision cast module
2018-09-29 18:00:08 +01:00
d0ca7c3fe6
Hadrons: big update for getGrid, grids are now created automatically
2018-09-29 17:55:19 +01:00
57f899d79c
Merge branch 'develop' of github.com:paboyle/Grid into develop
2018-09-29 15:50:59 +01:00
e881a0c157
Merge commit 'beed527ea37c90fd5e19b82d326eb8adc8eba5ff' into develop
2018-09-29 15:50:21 +01:00
f411657118
JSON update
2018-09-29 15:48:05 +01:00
Peter Boyle
7458c6174b
Use operator() for indexing internal indices
2018-09-27 06:42:02 +01:00
Peter Boyle
21b269d0f9
Move the Grid.pdf out of a deep directory
2018-09-27 06:36:25 +01:00
Peter Boyle
083af92ac2
Update from chulwoo ; high level link for Grid.pdf in documentation
2018-09-27 06:30:40 +01:00
Peter Boyle
2c162577b5
HMC documentation
2018-09-25 23:28:17 +01:00
Peter Boyle
b1c4e96382
Updates to actions etc..
2018-09-24 22:10:30 +01:00
Peter Boyle
a55c6f34f3
Updated docs
2018-09-24 15:44:35 +01:00
Peter Boyle
beed527ea3
Carletons chapter
2018-09-24 15:09:51 +01:00
eaa633cf69
Merge branch 'develop' of github.com:paboyle/Grid into develop
2018-09-21 18:16:22 +01:00
c632455129
Hadrons: meson field IO fix
2018-09-21 18:16:01 +01:00
c012899ed5
Hadrons: big update after templating of get/createGrid
2018-09-21 18:15:33 +01:00
paboyle
8bab544c2f
Updated manual pdf
2018-09-20 18:51:11 +01:00
paboyle
76fc06a5dc
Updates with todo from Carleton
2018-09-20 18:50:11 +01:00
fionnoh
d9de8fd5c9
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-09-17 12:19:47 +01:00
fionnoh
7e3647246c
Merge branch 'master' of https://github.com/paboyle/Grid into develop
2018-09-17 12:19:20 +01:00
4af6c7e7aa
Hadrons: copyright update
2018-09-14 12:51:48 +01:00
f60fbcfc4d
Hadrons: mixed precision CG, to be tested
2018-09-14 12:47:55 +01:00
464c81706e
Hadrons: defaults Impls for different precisions
2018-09-14 12:46:43 +01:00
Peter Boyle
adbdc4e65b
Half comms not working on GPU yet, so disable.
2018-09-11 05:15:22 +01:00
Peter Boyle
e4deea4b94
Weird bug appears with Vector<Vector<>>.
...
"fix" with std::vector<Vector<>>
Lies in the face table code. But think there is some latent problem.
Possibly in my allocator since it is caching, but could simplify or eliminate the caching
option and retest. One to look at later.
2018-09-11 04:36:57 +01:00
Peter Boyle
94d721a20b
Comments on further topology discovery work
2018-09-11 04:20:04 +01:00
408130b808
Hadrons: header list fix
2018-09-10 17:38:54 +01:00
375edd1370
file forgotten in last commit
2018-09-10 17:37:29 +01:00
6d912f6c67
Hadrons: general guesser factory
2018-09-10 17:36:54 +01:00
6d1d28955e
Guesser class is redundant, switching to LinearFunction
2018-09-10 17:35:54 +01:00
920b471761
Hadrons tests update
2018-09-10 15:32:13 +01:00
63c21767ba
Hadrons: grids stored with hash of SIMD type (for mixed-precision setups)
2018-09-10 15:31:39 +01:00
7b6b712565
function to convert std::vector to string
2018-09-10 15:17:32 +01:00
35abd05ee9
mute Version.h cache creation
2018-09-10 15:16:59 +01:00
Peter Boyle
7bf82f5b37
Offload the face handling to GPU
2018-09-10 11:28:42 +01:00
Peter Boyle
f02c7ea534
Peer to peer on GPU's setup
2018-09-10 11:26:20 +01:00
Peter Boyle
bc503b60e6
Offloadable gather code
2018-09-10 11:21:25 +01:00
Peter Boyle
704ca162c1
Offloadable compression
2018-09-10 11:20:50 +01:00
Peter Boyle
b5329d8852
Protect against zero length loops giving a kernel call failure
2018-09-10 11:20:07 +01:00
Peter Boyle
f27b9347ff
Better unquiesce MPI coverage
2018-09-10 11:19:39 +01:00
Peter Boyle
b4967f0231
Verbose and error trapping cleaner
2018-09-09 14:28:02 +01:00
Peter Boyle
6d0f1aabb1
Fix the multi-node path
2018-09-09 14:27:37 +01:00
Peter Boyle
f4bfeb835d
Drop back to smaller Ls
2018-09-09 14:25:06 +01:00
Peter Boyle
394b7b6276
Verbose decrease
2018-09-09 14:24:46 +01:00
dd36e60f6a
compilation fix for hypercube optimal communicator
2018-09-08 18:07:29 +01:00
cb6c548e21
Hadrons: code cleaning
2018-09-07 20:40:55 +01:00
02c4ccf621
Hadrons: diskvector debug message for writes
2018-09-07 20:33:49 +01:00
fd24588212
Merge branch 'develop' of github.com:paboyle/Grid into develop
2018-09-07 20:25:11 +01:00
b800bb3ecb
Hadrons: disk vector cache policy to last touch
2018-09-07 20:24:48 +01:00
f8abd0978b
Hadrons copyright update
2018-09-07 20:10:07 +01:00
12c7c493bf
Hadrons: disk-based container
2018-09-07 20:04:54 +01:00
paboyle
c7c9072313
Documentation
2018-09-06 16:01:42 +01:00
2bf3be5fae
Hadrons: copyright and code cleaning
2018-09-04 18:25:10 +01:00
3a40e4fc69
Hadrons: scalar SU(N) 2-pt guard against negative momenta components
2018-09-04 18:24:07 +01:00
2e69e03f6f
Hadrons: CosmHol configs IO module
2018-09-04 18:23:28 +01:00
a09f9bb528
Hadrons: code cleaning
2018-09-04 18:22:21 +01:00
f0e341d726
Hadrons: module list generator fix
2018-09-04 18:22:04 +01:00
6f09df0daf
Hadrons: A2A matrix IO fix
2018-09-02 01:46:22 +01:00
26cee605b8
Hadrons: copyright update
2018-09-01 21:30:30 +01:00
b3fa18c229
copyright script never removes authorship
2018-09-01 21:29:58 +01:00
2940c9bcfd
Hadrons: dedicated IO class for A2A matrices
2018-09-01 21:09:01 +01:00
0bb532f72b
more explicit clean git tree message
2018-09-01 20:02:18 +01:00
fada2aa0f7
Hadrons: precision fix
2018-09-01 20:00:12 +01:00
c193e4e675
Aslash expression in Mathematica notebook
2018-09-01 19:59:58 +01:00
3ee682f676
more Version.h fine tuning
2018-09-01 19:58:16 +01:00
d85ec3bac2
build system minor fix
2018-09-01 19:54:21 +01:00
b52d8eb1e3
better Version.h implementation
2018-09-01 19:49:13 +01:00
ee630d2e8b
Hadrons: smearing plaquette output
2018-09-01 17:38:32 +01:00
2f0af79869
Hadrons: scalar SU(N) NPR update
2018-09-01 17:36:35 +01:00
1b7fb79ec0
CI fix
2018-08-28 18:26:37 +01:00
2db1a4628c
build system minor fix
2018-08-28 18:26:30 +01:00
6aa047d842
Hadrons module template fix
2018-08-28 17:17:00 +01:00
8779c32ae1
Merge branch 'feature/hadrons' into develop
2018-08-28 17:10:33 +01:00
c527dc3358
CI fix
2018-08-28 17:10:08 +01:00
6b42577b6b
gitignore update
2018-08-28 16:58:37 +01:00
fb3596f968
Hadrons: precision fixes
2018-08-28 16:58:23 +01:00
f3a0158213
code cleaning
2018-08-28 16:56:07 +01:00
0250aa9347
file committed in error
2018-08-28 16:55:48 +01:00
3df6743396
more build system cleaning and patch for bad include in Eigen
2018-08-28 16:54:57 +01:00
fb7d021b9d
Hadrons: moving Hadrons to root directory, build system improvements
2018-08-28 15:00:40 +01:00
5f206df775
Hadrons: meson field cache friendly cache copy
2018-08-15 17:29:44 +01:00
7727e81113
Hadrons: slight improvement on previous commit
2018-08-14 20:18:47 +01:00
c4115544a5
Hadrons: application option to save graph
2018-08-14 20:03:53 +01:00
08c47328ba
Hadrons: meson field kernel performance for each block
2018-08-14 17:35:42 +01:00
09001aedca
Hadrons: meson fields saved in single precision
2018-08-14 17:19:38 +01:00
2c67304716
Hadrons: meson field code cleaning
2018-08-14 17:00:05 +01:00
dc6d8686de
Hadrons: meson field chunked HDF5 IO
2018-08-14 16:40:29 +01:00
cc2780bea3
Hadrons: meson field parallel IO
2018-08-14 14:55:13 +01:00
6e5a2b7922
fix previous commit
2018-08-14 14:07:54 +01:00
f4878d3a13
Hadrons: meson field threaded cache copy
2018-08-14 14:02:37 +01:00
89d2fac92e
Hadrons: copyright update
2018-08-14 12:19:14 +01:00
f2d3e41cf2
Hadrons: meson field: HDF5 perf, gamma input and Eigen tensors allocated by Grid
2018-08-13 20:18:33 +01:00
3c27bb36d4
Hadrons: direct timer access
2018-08-13 20:17:45 +01:00
603d59f389
Hadrons: code cleaning
2018-08-13 20:17:24 +01:00
07a0ef3f95
Hadrons: global measurement time profile
2018-08-13 16:44:57 +01:00
503259f9c9
Hadrons: meson field HDF5 IO done and tested
2018-08-12 16:52:12 +01:00
5be6a51044
Hadrons: meson fields code cleaning and momentum phases
2018-08-11 15:13:43 +01:00
ac69f042b1
Hadrons: module RNG uniquely seeded with <run id> + <module name> + <trajectory>
2018-08-10 18:27:00 +01:00
133d5c2e34
Merge branch 'develop' into feature/hadrons
2018-08-10 16:36:40 +01:00
2a94244890
configure: --with-openssl option and LIME is now mandatory
2018-08-10 16:36:11 +01:00
a15a2dfd29
Merge branch 'develop' into feature/hadrons
2018-08-10 16:08:22 +01:00
093bb02633
Hadrons: execute message for time diluted noise
2018-08-10 16:07:48 +01:00
99a85116f8
Hadrons: module and VM instrumentation
2018-08-10 16:07:30 +01:00
paboyle
27cdb79063
Sha used to seed from a unique string
2018-08-10 15:11:01 +01:00
f4cbfd63ff
Hadrons: more meson field cleaning, needs IO now
2018-08-09 18:39:58 +01:00
2b794b6aa7
Hadrons: module generating random lattices for testing purposes
2018-08-09 17:16:42 +01:00
d0244a059f
Hadrons: cleaning cleaning...
2018-08-09 00:38:17 +01:00
dcdd891d7d
Hadrons: precision fix
2018-08-09 00:13:53 +01:00
6d2df9de79
Hadrons: even more cleaning
2018-08-08 23:15:55 +01:00
41d4e37bae
Hadrons: more cleaning
2018-08-08 19:04:44 +01:00
ee5c0cc9b6
Hadrons: code cleaning
2018-08-08 18:45:06 +01:00
0a4020eb4d
Hadrons: copyright fix
2018-08-07 18:42:52 +01:00
b2de26589b
Hadrons: code cleaning and copyright update
2018-08-07 18:40:48 +01:00
0677adb4dd
Hadrons: overhaul of A2A for production
2018-08-07 18:27:59 +01:00
231cc95be6
Hadrons: eigenvalues precision fix
2018-08-07 18:27:19 +01:00
639f9cab82
Hadrons: schedule loading fix
2018-08-07 18:26:49 +01:00
4eac4e575e
Hadrons: meson fields indentation fix
2018-08-06 12:42:25 +01:00
3f0f92cda6
Hadrons: first cleaning/integration of A2A/meson fields
2018-08-06 12:11:52 +01:00
d2650e89bd
Hadrons: VM exception for object type (solves infinite loop in scheduler)
2018-08-06 12:11:00 +01:00
2962123cba
Hadrons: diluted noise polish
2018-08-05 01:44:37 +01:00
830168ec37
Hadrons: first try at diluted noise class (tested)
2018-08-04 12:32:58 +01:00
584c921ca0
Eigen support fix (use of Grid as a library was broken)
2018-08-03 21:07:58 +01:00
81347b4d16
gitignore update
2018-08-03 19:58:52 +01:00
2cfa0b0e6b
Merge pull request #174 from fionnoh/a2a_basics
...
A2A basics
2018-08-03 16:32:14 +01:00
fionnoh
fa5dee76b1
Included Peter's A2AMeson field and Eigen changes
2018-08-03 15:15:54 +01:00
fionnoh
8d1679c6b8
Merge branch 'feature/hadrons-a2a' of https://github.com/paboyle/Grid into a2a_basics
2018-08-03 15:12:24 +01:00
Peter Boyle
3791a38f7c
Optimised the MesonField a bit more
2018-08-01 08:27:27 +01:00
Peter Boyle
142f7b0c86
Updated the A2A Meson Field module
2018-07-31 15:58:02 +01:00
fionnoh
891ad66eab
Included changes to Hadrons RBPrecCG solver needed for subtraction of guess
2018-07-31 11:26:07 +01:00
Peter Boyle
60c43151c5
Merge branch 'feature/hadrons-a2a' of https://github.com/paboyle/Grid into feature/hadrons-a2a
2018-07-31 01:09:02 +01:00
paboyle
e036800261
Eigen fix
2018-07-31 01:08:42 +01:00
Peter Boyle
62900def36
Merge branch 'feature/hadrons-a2a' of https://github.com/paboyle/Grid into feature/hadrons-a2a
2018-07-31 00:36:26 +01:00
paboyle
e3a309a73f
Eigen happiness
2018-07-31 00:35:17 +01:00
fionnoh
ad6c1c0c4e
The basics of what is needed in Grid and Hadrons for the A2A class and module, with none of the contraction or MF code.
2018-07-30 18:40:50 +01:00
Peter Boyle
00b92a91b5
Optimising
2018-07-28 23:46:22 +01:00
paboyle
65533741f7
7 moms
2018-07-28 16:17:47 +01:00
Peter Boyle
dc0259fbda
Merge pull request #173 from fionnoh/feature/hadrons-a2a
...
Changes to meson field benchmark. Now includes the gammas in the fina…
2018-07-27 23:03:56 +01:00
Peter Boyle
131a6785d4
Merge branch 'feature/hadrons-a2a' into feature/hadrons-a2a
2018-07-27 23:03:42 +01:00
paboyle
44f4f5c8e2
Momentum loop
2018-07-27 23:00:16 +01:00
fionnoh
2679df034f
Changes to meson field benchmark. Now includes the gammas in the final part of the naive method, both methods compute
...
lhs^dag*Gamma*rhs (previously Gamma*lhs^dag*rhs), and checks results.
2018-07-27 18:31:10 +01:00
bf71162b97
Hadrons: backtrace on abort
2018-07-26 19:20:12 +01:00
299e828d83
Merge branch 'develop' into feature/hadrons
2018-07-26 16:49:49 +01:00
ef5452cddf
Hadrons: smarter memory profiler
2018-07-26 16:47:45 +01:00
80de748737
Hadrons: new exceptions which can save a integer
2018-07-26 16:47:25 +01:00
paboyle
71e1006ba8
Updated meson field benchmark for dirac structures
2018-07-26 09:09:29 +01:00
00f31ae83f
Merge pull request #163 from goracle/unstaged
...
Add printing of whether there are unstaged changes in the git hash print
2018-07-25 19:00:00 +00:00
cce339deaf
Merge pull request #172 from fionnoh/feature/hadrons
...
feature/hadrons -> feature/hadrons-a2a
2018-07-25 17:20:19 +00:00
fionnoh
24128ff109
Changes needed for MF benchmark to work with comms correctly
2018-07-23 15:51:37 +01:00
Peter Boyle
da17a015c7
Pack the stencil smaller for 128 bit access
2018-07-23 06:12:45 -04:00
Peter Boyle
1fd08c21ac
make simd width configure time option for GPU
2018-07-23 06:10:55 -04:00
Peter Boyle
28db0631ff
Hack to force 128bit accesses
2018-07-23 06:10:27 -04:00
Peter Boyle
b35401b86b
Fix CUDA_ARCH. Need to simplify. See when new eigen release happens
2018-07-23 06:09:33 -04:00
Peter Boyle
a0714de8ec
Define vector length for GPU
2018-07-23 06:09:05 -04:00
Peter Boyle
21a1710b43
Verbose vector length
2018-07-23 06:08:39 -04:00
fionnoh
34e9d3f0ca
Moved the creation and resizing of the v and w high modes from the A2A class to the A2A module and made them an output of the module. This means that they have to be inputs of the contration modules and they will freed from memory if they are no longer needed.
2018-07-22 14:40:31 +01:00
fionnoh
c995788259
Added ImportUnphysicalFermion and included appropriate logic for 5d w vectors in A2A code
2018-07-21 00:08:11 +01:00
fionnoh
94c7198001
Added ZFIMPL to A2AMeson contraction
2018-07-20 23:08:22 +01:00
fionnoh
04d86fe9f3
Removed overly verbose print statement
2018-07-20 21:38:19 +01:00
fionnoh
b78074b6a0
Removed a Dminus from high mode v and removed duplication pf D_oo code
2018-07-20 16:55:24 +01:00
fionnoh
7dfd3cdae8
Inclusion of ExportPhysicalFermionSource that fixes a bug in the low mode w vectors
2018-07-20 15:45:43 +01:00
fionnoh
cecee1ef2c
Merge branch 'develop' of github.com:paboyle/Grid into feature/hadrons
2018-07-20 13:37:50 +01:00
fionnoh
355d4b58be
Merge branch 'feature/hadrons' of github.com:fionnoh/Grid into feature/hadrons
2018-07-19 16:07:54 +01:00
fionnoh
2c54a536f3
Moved the meson field inner product to its own header file
2018-07-19 15:56:52 +01:00
fionnoh
d868a45120
Cleaned up some stuff that was erroneously included in a previous "trash" commit. Leaving in the mySliceInnerProdct function for now as it speeds up mesonfield creation quite a lot for 24^3 tests
2018-07-16 16:19:59 +01:00
fionnoh
9deae8c962
A2A meson field contraction code
2018-07-16 14:18:45 +01:00
Peter Boyle
b2b5137d28
Finally starting to get decent performance on Volta
2018-07-13 12:06:18 -04:00
fionnoh
db86cdd7bd
Possible trash commit
2018-07-10 13:30:45 +01:00
paboyle
ec9939c1ba
Test for faster implementation of meson field inner loop
...
This should be possible to cache block at outer levels, global sum across nodes not performed
and deferred to caller to block them all into a big all reduce.
Nc=3 and Fermion is hard coded in an ugly way. We might think about benchmarking whether
a product without the conjugate should be made available by Grid.
It is not clear whether the explicit unroll, or the performing of conjugate on left once
was the real source of the speed up.
Gives 70-80 GF/s on my laptop (single) half that double, and 70GB/s to cache.
This is competitive with dslash and a reasonable stopping point for the optimisation. If necessary we can revisit.
2018-07-10 12:38:51 +01:00
Peter Boyle
2cc07450f4
Fastest option for the dslash
2018-07-05 09:57:55 -04:00
Peter Boyle
c0e8bc9da9
Current version gets 250 - 320 GF/s on Volta on the target 12^4 volume.
2018-07-05 07:10:25 -04:00
Peter Boyle
b1265ae867
Prettify code
2018-07-05 07:08:06 -04:00
Peter Boyle
32bb85ea4c
Standard extractLane is fast
2018-07-05 07:07:30 -04:00
Peter Boyle
ca0607b6ef
Clearer kernel call meaning
2018-07-05 07:06:15 -04:00
Peter Boyle
19b527e83f
Better extract merge for GPU. Let the SIMD header files define the pointer type for
...
access. GPU redirects through builtin float2, double2 for complex
2018-07-05 07:05:13 -04:00
Peter Boyle
4730d4692a
Fast lane extract, saturates bandwidth on Volta for SU3 benchmarks
2018-07-05 07:03:33 -04:00
Peter Boyle
1bb456c0c5
Minor GPU vector width changeÂ
2018-07-05 07:02:04 -04:00
Peter Boyle
4b04ae3611
Printing improvement
2018-07-05 06:59:38 -04:00
Peter Boyle
2f776d51c6
Gpu specific benchmark saturates memory. Can enhance Grid to do this for expressions,
...
but a bitof (known) work.
2018-07-05 06:58:37 -04:00
fionnoh
f74617c124
Added ZFIMPL to meson field module
2018-07-03 14:04:53 +01:00
fionnoh
8c6a3921ed
Merge remote-tracking branch 'upstream/feature/hadrons' into feature/hadrons
2018-07-03 11:35:14 +01:00
a8a15dd9d0
Hadrons: code cleaning
2018-07-02 17:52:39 +01:00
3ce68a751a
Hadrons: stout smearing module
2018-07-02 17:52:04 +01:00
fionnoh
daa0977d01
Included a print statement that indicates that the guess is being subtracted from the solve.
2018-06-28 16:34:56 +01:00
fionnoh
a2929f4384
Removed A2A contraction module and replaced it with the beginnings of a meson field module
2018-06-28 16:17:26 +01:00
fionnoh
7fe3974c0a
Included eigenPacks and action as references, not inputs, of A2A module. They now now longer need to be parameters in the meson field modules.
2018-06-28 16:14:49 +01:00
fionnoh
f7e86f81a0
Changes A2A class to make use of the new Solver class
2018-06-28 16:14:16 +01:00
fionnoh
fecec803d9
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/hadrons
2018-06-28 16:13:43 +01:00
fionnoh
8fe9a13cdd
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/hadrons
2018-06-28 16:13:07 +01:00
paboyle
3a50afe7e7
GPU dslash updates
2018-06-27 22:32:21 +01:00
paboyle
f8e880b445
Loop for s and xyzt offlow
2018-06-27 21:49:57 +01:00
paboyle
3e947527cb
Move looping over "s" and "site" into kernels for GPU optimisatoin
2018-06-27 21:29:43 +01:00
paboyle
31f65beac8
Move site and Ls looping into the kernels
2018-06-27 21:28:48 +01:00
paboyle
38e2a32ac9
Single SIMD lane operations for CUDA
2018-06-27 21:28:06 +01:00
paboyle
efa84ca50a
Keep Cuda 9.1 happy
2018-06-27 21:27:32 +01:00
paboyle
5e96d6d04c
Keep CUDA happy
2018-06-27 21:27:11 +01:00
paboyle
df30bdc599
CUDA happy
2018-06-27 21:26:49 +01:00
paboyle
7f45222924
Diagnostics on memory alloc fail
2018-06-27 21:26:20 +01:00
paboyle
dd891f5e3b
Use NVCC to suppress device Eigen
2018-06-27 21:25:17 +01:00
d2c42e6f42
Hadrons: scaled DWF action
2018-06-26 14:59:33 +01:00
Daniel Richtmann
2881b3e8e5
WilsonMG: Remove unnecessary static assertions
2018-06-26 14:42:30 +02:00
049cc518f4
Hadrons: introduction message 2
2018-06-25 19:08:39 +01:00
2e1c66897f
Hadrons: introduction message
2018-06-25 19:08:22 +01:00
adcef36189
Hadrons: Möbius DWF action
2018-06-25 15:58:35 +01:00
fionnoh
2f121c41c9
Commiting reation of meson field code before a merge with the upstream branch feature/hadrons
2018-06-25 12:20:46 +01:00
e0ed7e300f
Hadrons: spurious Dminus removed
2018-06-22 16:33:43 +02:00
485207901b
Merge branch 'develop' into feature/hadrons
2018-06-22 16:15:32 +02:00
c760f0a4c3
Hadrons: remove make_5D/4D functions and FreeProp fix
2018-06-22 16:12:46 +02:00
c84eeedec3
Hadrons: GaugeProp module for z-Wilson actions
2018-06-22 15:53:22 +02:00
fionnoh
1ac3526f33
Small changes to the A2A header and module
2018-06-22 12:29:42 +01:00
fionnoh
0de090ee74
Temporarily added in the contraction code that produced the working 2-pt function. This is commited for reference only and will be removed in the next push.
2018-06-22 12:28:41 +01:00
91405de3f7
Hadrons: new solver exposing fermion matrix and generic source/solve import/export
2018-06-22 12:14:37 +02:00
fionnoh
8fccda301a
Fixed a bug where the guess was always subtracted after the solve and included appropriate weights for the sources in the one case we're looking at now. More work needs to be done to make the 5d/4d source logic less brittle.
2018-06-21 16:36:59 +01:00
fionnoh
7a0abfac89
Restructured the class that computes and returns the A2A vectors.
2018-06-21 16:36:06 +01:00
fionnoh
ae37fda699
A more elegant way to subtract guesses from solve and a bool check before verifying residual
2018-06-20 16:07:40 +01:00
fionnoh
b5fc5e2030
All to all module update that hit a promising milestone. Commiting for a reference for future changes.
2018-06-20 10:59:07 +01:00
Daniel Richtmann
cc5d025ea4
WilsonMG: Adapt staggered GMRES/MR tests to "new" constructor
2018-06-18 16:20:20 +02:00
paboyle
6c97a6a071
Coalescing version of the kernel
2018-06-13 20:52:29 +01:00
paboyle
73bb2d5128
Ugly hack to speed up compile on GPU; we don't use the hand kernels on GPU anyway so why compile
2018-06-13 20:35:28 +01:00
paboyle
b710fec6ea
Gpu code first version of specialised kernel
2018-06-13 20:34:39 +01:00
paboyle
b2a8cd60f5
Doubled gauge field is useful
2018-06-13 20:27:47 +01:00
paboyle
867ee364ab
Explicit instantiation hooks
2018-06-13 20:27:12 +01:00
paboyle
25becc9324
GPU tweaks for benchmarking; really necessary?
2018-06-13 20:26:07 +01:00
paboyle
94d1ae4c82
Some prep work for GPU shared memory. Need to be careful, as will try GPU direct
...
RDMA and inter-GPU memory sharing on SUmmit later
2018-06-13 20:24:06 +01:00
paboyle
2075b177ef
CUDA_ARCH more carefule treatment
2018-06-13 20:22:34 +01:00
paboyle
847c761ccc
Move sfw IEEE fp16 into central location
2018-06-13 20:22:01 +01:00
paboyle
8287ed8383
New GPU vector targets
2018-06-13 20:21:35 +01:00
paboyle
e6be7416f4
Use managed memory
2018-06-13 20:14:00 +01:00
paboyle
26863b6d95
User Managed memory
2018-06-13 20:13:42 +01:00
paboyle
ebd730bd54
Adding 2D loops
2018-06-13 20:13:01 +01:00
paboyle
066be31a3b
Optional GPU target SIMD types; work in progress and trying experiments
2018-06-13 20:07:55 +01:00
paboyle
7a4c142955
Add GPU specific simd targets
2018-06-13 19:55:30 +01:00
Daniel Richtmann
ddcb53bce2
Merge remote-tracking branch 'upstream/develop' into feature/wilsonmg
2018-06-13 09:50:37 +02:00
Daniel Richtmann
d1c80e1d46
WilsonMG: Correct years in copyright line
2018-06-13 09:44:09 +02:00
Daniel Richtmann
c73cc7d354
WilsonMG: Add tests with MG preconditioner running single precision, outer solver running in double
2018-06-12 16:10:48 +02:00
Daniel Richtmann
49fdc324a0
WilsonMG: Make MG correctness checks abort on failing tests
2018-06-12 16:10:48 +02:00
Daniel Richtmann
f32714a2d1
WilsonMG: Make running MG correctness checks optional via commandline
2018-06-12 16:10:48 +02:00
Daniel Richtmann
73a955be20
WilsonMG: Move tests for Wilson & WilsonClover into separate files
2018-06-12 16:10:48 +02:00
Daniel Richtmann
66b7a0f871
WilsonMG: Move multigrid class to separate file
2018-06-12 16:10:48 +02:00
Daniel Richtmann
2ab9d4bc56
WilsonMG: Fix random behavior in GMRES
...
From time to time I saw random since the basis vectors were not initialized
properly.
2018-06-12 15:01:31 +02:00
Daniel Richtmann
4f41cd114d
WilsonMG: Add a mixed precision version of FGMRES
...
This version does everything in double prec but accepts a preconditioner working
in single precision.
2018-06-12 15:01:31 +02:00
Daniel Richtmann
11c4f5e32c
WilsonMG: Provide command line switch for reading in input xml + move default params to constructor of MultiGridParams
2018-06-12 15:01:31 +02:00
Daniel Richtmann
e9b9550298
WilsonMG: Fix incompatibility with single prec MG in construction of simd layout on coarser grids
2018-06-12 15:01:31 +02:00
Daniel Richtmann
7564fedf68
WilsonMG: Set subspace to zero to avoid random behavior
2018-06-12 15:01:31 +02:00
8db0ef9736
Merge pull request #168 from jch1g10/feature/qed-fvol
...
Feature/qed fvol
2018-06-08 20:09:06 +02:00
Guido Cossu
95d4b46446
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-06-08 11:30:29 +01:00
paboyle
251b904a28
Merge branch 'release/ISC-freeze-2'
2018-06-04 21:09:48 +01:00
paboyle
5dfd216a34
Better thread safety
2018-06-04 21:08:44 +01:00
paboyle
5a112feac3
Merge branch 'release/ISC-freeze-1'
2018-06-04 18:49:40 +01:00
paboyle
c2e8d0aa88
Solve g++ problem on the lanczos test
2018-06-04 18:34:15 +01:00
James Harrison
0fe5aeffbb
Merge branch 'feature/hadrons' into feature/qed-fvol
2018-06-04 16:59:43 +01:00
James Harrison
7fbc469046
Merge branch 'develop' into feature/hadrons
2018-06-04 16:58:30 +01:00
paboyle
bf96a4bdbf
Merge branch 'master' into develop
2018-06-04 14:03:11 +01:00
paboyle
84685c9bc3
Overflow fix
2018-06-04 13:42:07 +01:00
fionnoh
a8d4156997
Added a Hadrons module that computes the all-to-all v and w vectors
2018-05-31 17:18:58 +01:00
fionnoh
c18074869b
Changes to Hadrons SchurRB solver to allow for a subtract_guess boolean to be passed
2018-05-31 17:17:16 +01:00
fionnoh
f4c6d39238
CHanges made to SchurRB solvers to allow for the subtraction of a guess after solve
2018-05-31 17:16:20 +01:00
200d35b38a
Merge branch 'develop' into feature/hadrons
2018-05-28 11:52:47 +02:00
eb52e84d09
Merge branch 'feature/hadrons' of github.com:paboyle/Grid into feature/hadrons
2018-05-28 11:50:27 +02:00
72abc34764
Merge pull request #166 from guelpers/feature/hadrons
...
Feature/hadrons
2018-05-28 11:49:46 +02:00
e3164d4c7b
Hadrons: env function to get volume in double
2018-05-28 11:39:17 +02:00
James Harrison
f5db386c55
Change MODULE_REGISTER_NS -> MODULE_REGISTER in UnitEM, ScalarVP and VPCounterTerms
2018-05-22 16:16:21 +01:00
James Harrison
294ee70a7a
Merge branch 'feature/hadrons' into feature/qed-fvol
...
# Conflicts:
# extras/Hadrons/modules.inc
# lib/qcd/action/gauge/Photon.h
2018-05-21 18:02:41 +01:00
Azusa Yamaguchi
013ea4e8d1
Merge branch 'feature/staggered-comms-compute' into develop
2018-05-21 13:11:56 +01:00
Azusa Yamaguchi
7fbbb31a50
Merge branch 'develop' into feature/staggered-comms-compute
...
Conflicts:
lib/qcd/action/fermion/ImprovedStaggeredFermion.cc
2018-05-21 13:07:29 +01:00
Azusa Yamaguchi
0e127b1fc7
New file single prec test
2018-05-21 12:57:13 +01:00
Azusa Yamaguchi
68c028b0a6
Comment
2018-05-21 12:54:25 +01:00
255d4992e1
Hadrons: stochastic scalar SU(N) free field fix
2018-05-18 20:49:55 +01:00
a0d399e5ce
Hadrons: yet other attempts at EMT NPR
2018-05-18 20:49:26 +01:00
fd3b2e945a
Hadrons: don't right result with empty stem
2018-05-18 20:48:24 +01:00
Daniel Richtmann
6c27c72585
WilsonMG: Provide more sensible default values for MG parameters
2018-05-16 17:26:09 +02:00
Daniel Richtmann
9c003d2d72
WilsonMG: Base wilson mg preconditioner entirely on existing infrastructure
2018-05-16 17:26:09 +02:00
Daniel Richtmann
4b8710970c
WilsonMG: Switch to Galerkin coarsening in CoarsenedMatrix
2018-05-16 17:26:09 +02:00
Daniel Richtmann
68d686ec38
WilsonMG: Add functionality for applying G5 on coarse grids
2018-05-16 16:17:14 +02:00
Daniel Richtmann
c48b69ca81
WilsonMG: Implement Mdir & Mdiag in CoarsenedMatrix
2018-05-16 16:08:05 +02:00
Daniel Richtmann
df8c208f5c
WilsonMG: Revert CoarsenedMatrix.h and Lattice_transfer.h back to state of develop branch
2018-05-16 16:02:54 +02:00
Daniel Richtmann
61812ab7f1
Merge remote-tracking branch 'upstream/develop' into feature/wilsonmg
2018-05-15 14:57:18 +02:00
b999984501
Merge branch 'develop' into feature/hadrons
2018-05-15 13:53:57 +01:00
Guido Cossu
7836cc2d74
No checksum output on log for scidac
2018-05-15 10:10:08 +01:00
Peter Boyle
eb7d34a4cc
GPU version
2018-05-14 19:41:47 -04:00
Peter Boyle
aab27a655a
Start of GPU kernels
2018-05-14 19:41:17 -04:00
Peter Boyle
93280bae85
Gpu option
2018-05-14 19:40:58 -04:00
Peter Boyle
c5f93abcd7
GPU clean up
2018-05-14 19:40:33 -04:00
Peter Boyle
d5deef782d
Useful debug comments
2018-05-14 19:39:52 -04:00
Peter Boyle
5f50473c0d
Clean up
2018-05-14 19:39:11 -04:00
a61e0df54b
Travis fix for Lime
2018-05-14 19:56:12 +01:00
9d835afa35
Attempt at solving the FP exception in the QED code
2018-05-14 19:05:54 +01:00
5e3be47117
Hadrons: scalar SU(N) various fixes
2018-05-14 18:58:39 +01:00
Peter Boyle
13f50406e3
Suppress print statement
2018-05-12 18:00:00 -04:00
Peter Boyle
09cd46d337
Lane by Lane operation
2018-05-12 17:59:35 -04:00
Peter Boyle
d3f51065c2
Give command line control of blocks/threads split
2018-05-12 17:58:56 -04:00
Peter Boyle
925ac4173d
Thread count control for warp scheduler thingy doodaa thing
2018-05-12 17:58:22 -04:00
Peter Boyle
eb921041d0
Perf count control
2018-05-12 17:57:32 -04:00
48de706dd5
Merge branch 'develop' into feature/hadrons
2018-05-11 18:06:40 +01:00
f871fb0c6d
check file is opened correctly in the Lime reader
2018-05-11 18:06:28 +01:00
93771f3099
Hadrons: scalar SU(N) stochastic free field
2018-05-10 22:29:48 +01:00
8cb205725b
Merge branch 'develop' into feature/hadrons
2018-05-09 23:56:35 +01:00
9ad580d82f
Hadrons: format fix
2018-05-07 21:38:15 +01:00
899f961d0d
Hadrons: eigenvalue metadata saved with 16 significant digits
2018-05-07 21:37:03 +01:00
54d789204f
more general implementation of the precision interface for serialisers
2018-05-07 21:17:46 +01:00
25828746f3
XML precision scientific with 16 digits by default
2018-05-07 21:04:31 +01:00
f362c00739
Hadrons: better handling of automatic directory creation
2018-05-07 19:43:40 +01:00
Guido Cossu
25d1cadd3b
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-05-07 18:55:09 +01:00
Guido Cossu
c24d53bbd1
Further debug of RNG I/O
2018-05-07 18:55:05 +01:00
2017e4e3b4
Hadrons: more verbose directory creation error
2018-05-07 18:12:22 +01:00
27a4d4c951
Hadrons: multi-file eigenpack in separate directory
2018-05-07 17:52:54 +01:00
2f92721249
Merge branch 'develop' into feature/hadrons
2018-05-07 17:26:47 +01:00
3c7a4106ed
Trap for deadly empty comm thread option
2018-05-07 17:26:39 +01:00
3252059daf
Hadrons: multi-file support for eigenpacks
2018-05-07 17:25:36 +01:00
paboyle
6eed167f0c
Merge branch 'release/0.8.1'
2018-05-04 17:34:11 +01:00
paboyle
4ad0df6fde
Bump volume for Gerardo
2018-05-04 17:33:23 +01:00
661381e881
Merge branch 'develop' into feature/hadrons
2018-05-04 14:52:17 +01:00
Peter Boyle
68a5079f33
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-05-04 14:13:54 +01:00
Peter Boyle
8634e19f1b
Update
2018-05-04 14:13:35 +01:00
Azusa Yamaguchi
9ada378e38
Add timing
2018-05-04 10:58:01 +01:00
Vera Guelpers
9d9692d439
Fix double vs float in boundary phases
2018-05-03 16:40:16 +01:00
0659ae4014
Merge branch 'develop' into feature/hadrons
2018-05-03 16:20:22 +01:00
bfbf2f1fa0
no threaded stencil benchmark if OpenMP is not supported
2018-05-03 16:20:01 +01:00
dd6b796a01
Hadrons: scalar SU(N) volume factor fix
2018-05-03 16:19:17 +01:00
Vera Guelpers
52a856b4a8
FreeProp module for Hadrons
2018-05-03 12:33:20 +01:00
Vera Guelpers
04190ee7f3
5D free propagator for DWF and boundary conditions for free propagators
2018-05-03 12:31:36 +01:00
Azusa Yamaguchi
587bfcc0f4
Add Timing
2018-05-03 12:10:31 +01:00
Vera Guelpers
2700992ef5
Merge remote-tracking branch 'upstream/feature/hadrons' into feature/hadrons
2018-05-03 10:01:52 +01:00
Peter Boyle
8c658de179
Compressor speed up (a little); streaming stores
2018-05-02 17:52:16 +01:00
Guido Cossu
ba37d51ee9
Debugging the RNG IO
2018-05-02 15:32:06 +01:00
Azusa Yamaguchi
4f4181c54a
Merge branch 'feature/staggered-comms-compute' of https://github.com/paboyle/Grid into feature/staggered-comms-compute
2018-05-02 14:59:13 +01:00
Guido Cossu
4d4ac2517b
Adding Scalar field theory example for Scidac format
2018-05-02 14:36:32 +01:00
Guido Cossu
e568c24d1d
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-05-02 14:29:25 +01:00
Guido Cossu
b458326744
Checkpointer module update
2018-05-02 14:29:22 +01:00
Guido Cossu
6e7d5e2243
HMC: added Scidac checkpointer and support for metadata
2018-05-02 14:28:59 +01:00
Azusa Yamaguchi
b35169f1dd
MultiShift for Staggered
2018-05-02 14:22:37 +01:00
Azusa Yamaguchi
441ad7498d
add Iterative counter
2018-05-02 14:21:30 +01:00
Peter Boyle
6f6c5c549a
Split off gparity
2018-05-02 14:11:23 +01:00
Peter Boyle
1584e17b54
Revert to fast versoin
2018-05-02 14:10:55 +01:00
Peter Boyle
12982a4455
Hypercube optimisation
2018-05-02 14:10:21 +01:00
Peter Boyle
172f412102
shmget reintroduce
2018-05-02 14:07:41 +01:00
Peter Boyle
a64497265d
TIming
2018-05-02 14:07:28 +01:00
ca639c195f
Merge branch 'develop' into feature/hadrons
2018-05-01 14:07:32 +01:00
edc28dcfbf
Hadrons: scalar SU(N) 2-pt fix
2018-05-01 14:02:31 +01:00
Peter Boyle
c45f24a1b5
Improvements for tesseract
2018-04-30 21:50:00 +01:00
Dr Peter Boyle
aaf37ee4d7
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-04-27 11:45:13 +01:00
Dr Peter Boyle
1dddd17e3c
Benchmark improvements from tesseract
2018-04-27 11:44:46 +01:00
paboyle
661f1d3e8e
Merge branch 'release/0.8.0' into develop
2018-04-27 11:22:33 +01:00
paboyle
edcf9b9293
Merge branch 'release/0.8.0'
2018-04-27 11:13:19 +01:00
paboyle
fe6860b4dd
Update with LIME library guard
2018-04-27 08:57:34 +01:00
paboyle
d6406b13e1
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-04-27 07:52:56 +01:00
paboyle
e369d7306d
Rename
2018-04-27 07:51:44 +01:00
paboyle
9f8d63e104
Roll over version
2018-04-27 07:51:12 +01:00
paboyle
9b0240d101
Hot start test
2018-04-27 07:50:51 +01:00
paboyle
b27f0e5a53
Control over IO
2018-04-27 07:50:15 +01:00
paboyle
75e4483407
Stronger convergence test
2018-04-27 07:49:57 +01:00
Guido Cossu
0734e9ddd4
Debugging Scatter_plane_simple
2018-04-27 14:39:01 +09:00
paboyle
809b1cdd58
Bug fix for MPI running ; introduced last night
2018-04-27 05:19:10 +01:00
paboyle
1be8089604
Clean compile
2018-04-26 23:42:45 +01:00
paboyle
3e0eff6468
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-04-26 23:00:46 +01:00
paboyle
7ecc47ac89
Quenched test compile
2018-04-26 23:00:28 +01:00
paboyle
e9f1ac09de
static
2018-04-26 23:00:08 +01:00
Peter Boyle
fa0d8feff4
Performance of CovariantCshift now non-embarrassing.
2018-04-26 17:56:27 +01:00
49b8501fd4
Merge branch 'develop' into feature/hadrons
2018-04-26 17:33:50 +01:00
d47484717e
Hadrons: scalar SU(N) result handling improvement
2018-04-26 17:32:37 +01:00
Peter Boyle
05b44aef6b
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
...
Conflicts:
benchmarks/Benchmark_su3.cc
2018-04-26 15:38:49 +01:00
Peter Boyle
03e9832efa
Use macros for bare openmp
2018-04-26 14:50:02 +01:00
Peter Boyle
28a375d35d
Force static
2018-04-26 14:49:42 +01:00
Peter Boyle
3b06381745
Guard bare openmp statemetn with ifdef
2018-04-26 14:48:57 +01:00
Peter Boyle
91a0a3f820
Improvement
2018-04-26 14:48:35 +01:00
Peter Boyle
8f44c799a6
Saving the benchmarking tests for Cshift
2018-04-26 14:48:03 +01:00
Azusa Yamaguchi
96272f3841
Merge staggered fix linear operator and reduction
2018-04-26 10:33:19 +01:00
Azusa Yamaguchi
5c936d88a0
Merge branch 'feature/staggered-comms-compute' of https://github.com/paboyle/Grid into feature/staggered-comms-compute
2018-04-26 10:18:37 +01:00
Azusa Yamaguchi
1c64ee926e
Faster staggered operator with m^2 term trivial used
2018-04-26 10:17:49 +01:00
Azusa Yamaguchi
2cbb72a81c
Provide info if EE term is trivial (m^2 factor)
...
Better timing in staggered 4d case
2018-04-26 10:10:07 +01:00
Azusa Yamaguchi
31d83ee046
Enable special treatment of constEE cases
2018-04-26 10:08:46 +01:00
Azusa Yamaguchi
a9e8758a01
Improvements to staggered tests timings
2018-04-26 10:08:05 +01:00
Azusa Yamaguchi
3e125c5b61
Faster linalg on CG optimised against staggered
...
Sum overhead is bigger for staggered
2018-04-26 10:07:19 +01:00
Azusa Yamaguchi
eac6ec4b5e
Faster reductions, important on single node staggered
2018-04-26 10:03:57 +01:00
Azusa Yamaguchi
213f8db6a2
Microsecond resultion
2018-04-26 10:01:39 +01:00
Guido Cossu
6358f35b7e
Debug of previous commit
2018-04-26 14:18:11 +09:00
Guido Cossu
43f5a0df50
More timers in the integrator
2018-04-26 12:01:56 +09:00
Guido Cossu
c897878776
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-04-26 11:31:57 +09:00
cc6eb51e3e
Hadrons: macro refactoring for library portability
2018-04-25 16:49:14 +01:00
Vera Guelpers
507009089b
Merge remote-tracking branch 'upstream/feature/hadrons' into feature/hadrons
2018-04-25 09:36:39 +01:00
paboyle
2baf193031
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-04-25 00:14:03 +01:00
paboyle
362ba0443a
Cshift updates
2018-04-25 00:12:11 +01:00
paboyle
276a2353df
Move constructor
2018-04-25 00:11:07 +01:00
b234784c8e
Hadrons: scalar SU(N) takes operator pairs now
2018-04-24 19:52:12 +01:00
6ea2a8b7ca
Hadrons: scheduler shows starting value
2018-04-24 19:51:47 +01:00
c1d0359aaa
Hadrons: scalar SU(N) kinetic term saves trace
2018-04-24 19:51:22 +01:00
047ee4ad0b
Hadrons: scalar SU(N) cleanup
2018-04-24 19:50:58 +01:00
a13106da0c
Hadrons: scalar SU(N) gradient
2018-04-24 19:50:30 +01:00
75113e6523
Hadrons: Scalar SU(N) variable name update
2018-04-24 19:49:27 +01:00
325c73d051
Hadrons: module template update
2018-04-24 19:48:54 +01:00
b25a59e95e
Hadrons: mitigation of GCC/Intel compiler bug not generating defaulted destructors
2018-04-24 17:20:25 +01:00
Guido Cossu
c5b9147b53
Correction of a minor bug in the su3 benchmark
2018-04-24 08:03:57 -07:00
Guido Cossu
64ac815fd9
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-04-24 17:27:38 +09:00
Guido Cossu
a1be533329
Corrected Flop count in Benchmark su3 and expanded the Wilson flow output
2018-04-24 01:19:53 -07:00
7c4533797f
Hadrons: scalar SU(N) EMT improvement term optional
2018-04-23 22:46:39 +01:00
af84fd65bb
Hadrons: missing dependency message improvement
2018-04-23 22:46:17 +01:00
Dan H
1a2613086a
Fix print message.
2018-04-23 15:42:12 -04:00
Dan H
4f110c09a5
Add printing of whether there are unstaged changes in the git hash print.
2018-04-23 15:38:23 -04:00
6764362237
Hadrons: automatic directory creation fix
2018-04-23 18:45:39 +01:00
2fa2b0e0b1
Hadrons: Application header does not include all the modules
2018-04-23 17:57:17 +01:00
b61292f735
Hadrons: recursive mkdir function
2018-04-23 17:36:43 +01:00
ce7720e221
Hadrons: copyright update
2018-04-23 17:36:20 +01:00
853a5528dc
Hadrons: template modules compilation optimisation
2018-04-23 17:35:01 +01:00
169f405c9c
Hadrons: tests repaired
2018-04-23 12:48:34 +01:00
c6125b01ce
Hadrons: Error and Warning channels always on
2018-04-23 12:48:17 +01:00
b0b5b34bff
Hadrons: custom abort with module trace
2018-04-23 12:48:00 +01:00
1c9722357d
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/qcd/action/fermion/FermionOperator.h
2018-04-20 17:15:21 +01:00
141da3ae71
function to get tensor dimensions
2018-04-20 17:13:34 +01:00
94edf9cf8b
HDF5: direct access to group for custom operations
2018-04-20 17:13:21 +01:00
c11a3ca0a7
vectorise/unvectorise in reverse order
2018-04-20 17:13:04 +01:00
paboyle
870b1a85ae
Think I have the physical prop interface to CF and PF overlap right, but need a strong check/regression.
...
Only support Hw overlap, not Ht for now. Ht needs a new Dminus implemented.
2018-04-18 14:17:49 +01:00
paboyle
b5510427f9
physical fermion interface, cshift benchmark in SU3.
2018-04-18 01:43:29 +01:00
Guido Cossu
26ed65c8f8
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-04-17 12:03:32 +01:00
paboyle
f7f043d8cf
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-04-17 10:57:18 +01:00
paboyle
ddcaa6ad29
Master does header on Nersc
2018-04-17 10:48:33 +01:00
Peter Boyle
87c5c0271b
Ficxing eigen
2018-04-16 19:08:07 -04:00
Peter Boyle
a3f5a13591
Better Eigen handling
2018-04-16 18:02:55 -04:00
Peter Boyle
9fe28f00eb
Eigen sim link off head revision
2018-04-16 17:54:46 -04:00
334da7f452
Hadrons: can trace which module is throwing an error
2018-04-13 18:45:31 +02:00
4669ecd4ba
Hadrons: build improvement
2018-04-13 18:21:18 +02:00
4573b34cac
Hadrons: scalar SU(N) 2-pt functions with momentum
2018-04-13 18:21:00 +02:00
Peter Boyle
a8a0bb85cc
Control scalar execution or vector under generic. Disable Eigen vectorisation on powerpc / SUmmit
2018-04-12 12:32:57 -04:00
Peter Boyle
6411caad67
work distribution
2018-04-12 11:41:41 -04:00
Peter Boyle
7533035a99
Control Eigen vectorisatoin
2018-04-12 11:40:56 -04:00
17f57e85d1
Merge branch 'develop' into feature/hadrons
2018-04-06 22:53:11 +01:00
c8d4d184ee
XML push fragment fix
2018-04-06 22:53:01 +01:00
17f27b1ebd
Hadrons: eigenpack writer fix
2018-04-06 22:52:11 +01:00
a16bbecb8a
Hadrons: more feedback
2018-04-06 19:38:20 +01:00
7c9b0dd842
Hadrons: top level name for eigenpack metadata
2018-04-06 19:32:22 +01:00
6b7228b3e6
Hadrons: better metadata for eigenpack
2018-04-06 19:29:53 +01:00
f117552334
post-merge fix
2018-04-06 18:38:46 +01:00
a21a160029
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/serialisation/XmlIO.cc
2018-04-06 18:34:19 +01:00
1569a374a9
XML interface polish, XML fragments can be pushed into a writer
2018-04-06 18:32:14 +01:00
eddf023b8a
pugixml 1.9 update
2018-04-06 16:17:22 +01:00
6b8ffbe735
Hadrons: genetic minimum value type fix
2018-04-06 15:41:31 +01:00
81050535a5
Hadrons: truncate eigenvalues when loading partial eigenpack
2018-04-06 13:48:58 +01:00
7dcf5c90e3
Hadrons: eigenpack must be referred by solver when used
2018-04-06 13:16:28 +01:00
9ce00f26f9
not special characters in std::vector operator<<
2018-04-04 17:44:56 +01:00
85c253ed4a
Test_serialisation MPI fix
2018-04-04 17:19:34 +01:00
ccfc0a5a89
Hadrons: better string representation of module parameters
2018-04-04 17:19:22 +01:00
d3f857b1c9
Hadrons: proper metadata for eigenpacks
2018-04-04 16:36:37 +01:00
fb62035aa0
Hadrons: do not create RB coarse grids
2018-04-03 19:49:11 +01:00
0260bc7705
Hadrons: eigen pack writing only for boss node
2018-04-03 18:55:46 +01:00
68e6a58f12
Hadrons: several Lanczos fixes and improvements
2018-04-03 17:42:21 +01:00
Daniel Richtmann
73ced656eb
Merge remote-tracking branch 'upstream/develop' into feature/wilsonmg
2018-04-03 17:51:11 +02:00
Daniel Richtmann
f69008edf1
WilsonMG: Add functionality to report timings to MG preconditioner
2018-04-03 17:26:49 +02:00
Daniel Richtmann
57a49ed22f
WilsonMG: Read in MG parameters from xml in test
2018-04-03 16:03:11 +02:00
Daniel Richtmann
ff6413a764
WilsonMG: Make number of levels chooseable at runtime
...
I don't like this solution though :(
2018-04-03 15:57:33 +02:00
Daniel Richtmann
2530bfed01
WilsonMG: Move params instance from global scope to test main function
2018-04-03 14:50:48 +02:00
640515e3d8
Merge branch 'develop' into feature/hadrons
2018-03-30 17:43:49 +01:00
paboyle
f089bf5629
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-03-30 16:17:26 +01:00
paboyle
276f113f28
IO uses master boss node for metadata.
2018-03-30 16:17:05 +01:00
97c579f637
Merge branch 'develop' into feature/hadrons
2018-03-30 16:04:44 +01:00
a13c109111
deterministic initialisation of field metadata
2018-03-30 16:03:01 +01:00
paboyle
ab6afd18ac
Still compile if no LIME
2018-03-30 13:39:20 +01:00
paboyle
5bde64d48b
Barrier required in parallel when we use ftell
2018-03-30 12:41:30 +01:00
paboyle
2f5add4d5f
Creation of file
2018-03-30 12:30:58 +01:00
c5a885dcd6
I/O benchmark
2018-03-29 19:57:41 +01:00
Daniel Richtmann
74f79c5ac7
Revert "Add function to return full type as std::string"
...
This reverts commit 1cb745c8dc .
2018-03-29 12:03:50 +02:00
Daniel Richtmann
58c30c0cb1
WilsonMG: Add conformability checks in MG preconditioner
2018-03-28 13:24:39 +02:00
Daniel Richtmann
917a92118a
WilsonMG: Move operator test to MG testing routine
2018-03-28 12:19:25 +02:00
a4d8512fb8
Revert "Lattice serialisation, just HDF5 for the moment"
...
This reverts commit 8a0cf0194f .
2018-03-27 17:55:42 +01:00
5ec903044d
Serial IO code cleaning for std:: convention
2018-03-27 17:11:50 +01:00
Daniel Richtmann
04f9cf088d
WilsonMG: Add more parameters to MultiGridParams struct
2018-03-27 17:13:11 +02:00
Daniel Richtmann
99107038f9
WilsonMG: Rationalize the level counting strategy
2018-03-27 17:06:33 +02:00
8a0cf0194f
Lattice serialisation, just HDF5 for the moment
2018-03-26 19:16:16 +01:00
Daniel Richtmann
b78456bdf4
WilsonMG: Get rid of explicit include of GCR header
2018-03-26 15:41:53 +02:00
Daniel Richtmann
08543b6b11
WilsonMG: Provide a switch between V- and K-cycle
2018-03-26 15:37:17 +02:00
Daniel Richtmann
63ba33371f
WilsonMG: Some minor refactoring
2018-03-26 15:34:53 +02:00
Daniel Richtmann
683a7d2ddd
WilsonMG: Move comment to make clang-format happy
2018-03-26 14:59:40 +02:00
1c680d4b7a
Merge branch 'develop' into feature/hadrons
2018-03-26 13:52:44 +01:00
Peter Boyle
b15db11c60
Kernels -> pure static object to enable device execution
2018-03-24 19:35:20 -04:00
Peter Boyle
f6077f9d48
Kernels -> not instantiaed otherwise object ref on GPU
2018-03-24 19:33:44 -04:00
Peter Boyle
572954ef12
Kernels not an instantiated object, just static
2018-03-24 19:33:13 -04:00
Peter Boyle
cedeaae7db
Lebesge -> StencilView if necessary
2018-03-24 19:32:41 -04:00
Peter Boyle
e6cf0b1e17
View typedefs go to OperatorImpl
2018-03-24 19:32:11 -04:00
Peter Boyle
5412628ea6
begin end lamda
2018-03-24 19:31:45 -04:00
Peter Boyle
1f70cedbab
Have to make all kernel called routines static since object reference will be a host pointer on GPU
2018-03-24 19:29:26 -04:00
Peter Boyle
b50f37cfb4
Remove overlap comms flag
2018-03-24 19:28:53 -04:00
Peter Boyle
cb0d2a1b03
threaded rng init; I thought this was on
2018-03-24 19:28:17 -04:00
Peter Boyle
6fe9b28a82
Cosmetic
2018-03-24 19:27:14 -04:00
Peter Boyle
b002587d7c
Simplify
2018-03-24 19:26:44 -04:00
Peter Boyle
6c08385782
Simplify
2018-03-24 19:26:19 -04:00
Daniel Richtmann
afdcbf79d1
Merge remote-tracking branch 'upstream/develop' into feature/wilsonmg
2018-03-23 21:13:50 +01:00
Daniel Richtmann
3c3ec4e267
WilsonMG: Move tests for Wilson & WilsonClover into the same file
2018-03-23 21:12:27 +01:00
Daniel Richtmann
bbe1d5b49e
WilsonMG: Temporarily use GMRES in construction of basis vectors
...
This can go back to CG once Mdag in CoarsenedMatrix works.
2018-03-23 20:02:27 +01:00
Daniel Richtmann
0f6009a29f
WilsonMG: Huge refactor into something that could be considered an algorithm
2018-03-23 19:55:43 +01:00
Daniel Richtmann
1cfed3de7c
WilsonMG: Add new logger for MG
2018-03-23 19:55:16 +01:00
Guido Cossu
c9c073eee4
Changes in messages in test dwf mixedprec
2018-03-23 11:27:56 +00:00
Guido Cossu
f290b2e908
Fix to pass CI tests
2018-03-23 11:14:23 +00:00
Guido Cossu
5f8225461b
Fencing mixedcg test propagator write. LIME is still optional in Grid
2018-03-23 10:37:58 +00:00
Peter Boyle
4e1272fabf
Kernels need to be static to work on GPU. No reference to host resident data
2018-03-22 18:44:53 -04:00
Peter Boyle
607dc2d3c6
Remove lebesgue order
2018-03-22 18:23:09 -04:00
Peter Boyle
23c880b009
Remove lebesgue order; stick in stencil if need
2018-03-22 18:13:41 -04:00
Peter Boyle
334bb6792f
Lebesgue order removed. Stick in the stencil view
2018-03-22 18:12:12 -04:00
Peter Boyle
a3690071b4
Warm up GPu
2018-03-22 18:05:20 -04:00
Peter Boyle
299d119013
GPU work allocation improved
2018-03-22 18:04:24 -04:00
Peter Boyle
55be842d23
Dont force l1p.h so early
2018-03-22 18:01:43 -04:00
Daniel Richtmann
edbc0d49d7
WilsonMG: Get rid of explicit GridTypeMappers in CoarsenedMatrix
2018-03-22 16:38:24 +01:00
e9323460c7
Merge branch 'develop' into feature/hadrons
2018-03-22 10:48:37 +00:00
20e186a1e0
Merge pull request #158 from goracle/dev-pull
...
Make compilation faster by moving print of git hash.
2018-03-22 10:45:17 +00:00
Peter Boyle
6ef4af989b
Merge pull request #159 from goracle/dev-precsafe
...
Add dimension check to precisionChange.
2018-03-22 10:41:53 +00:00
Dan H
ccde8b817f
Add dimension check to precisionChange.
2018-03-21 20:58:04 -04:00
Dan H
68168bf72d
Revert "Add dimension match check to precisionChange."
...
This reverts commit 8f601d9b39 .
2018-03-21 20:51:38 -04:00
Dan H
e93d0feaa7
Merge branch 'dev-pull' of github.com:goracle/Grid into dev-pull
2018-03-21 20:39:30 -04:00
Dan H
8f601d9b39
Add dimension match check to precisionChange.
2018-03-21 20:38:19 -04:00
paboyle
5436308e4a
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-03-21 14:26:29 +00:00
paboyle
07fe7d0cbe
Save file in current dir; print checksums
2018-03-21 14:26:04 +00:00
Guido Cossu
60b57706c4
Small bug fix in the shm file names
2018-03-21 13:57:30 +00:00
James Harrison
58c2f60b69
Merge branch 'feature/hadrons' into feature/qed-fvol
2018-03-20 20:19:18 +00:00
James Harrison
bfa3a7b3b0
Merge branch 'feature/hadrons' into feature/qed-fvol
...
# Conflicts:
# extras/Hadrons/Modules.hpp
# extras/Hadrons/Modules/MGauge/StochEm.cc
# extras/Hadrons/modules.inc
2018-03-20 20:17:59 +00:00
paboyle
954e38bebe
Put a username in the path
2018-03-20 18:16:15 +00:00
paboyle
b1a38bde7a
Extra test for Gparity with plaquette action
2018-03-20 18:01:32 +00:00
Peter Boyle
9875c446c6
Clean up pragmas
2018-03-20 07:19:17 -04:00
Peter Boyle
9c25eb35ca
Eigen develop branch for now
2018-03-20 07:18:56 -04:00
Peter Boyle
5ac96dbdc6
Warm behaviour in SU3 benchmark
2018-03-20 07:18:31 -04:00
Peter Boyle
5cc9aca85d
Use 64bit index for looping
2018-03-20 06:34:52 -04:00
Peter Boyle
ac29ebcb95
Clean up debug prints
2018-03-20 06:33:59 -04:00
Guido Cossu
2581875edc
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-03-19 18:00:08 +00:00
Guido Cossu
f212b0a963
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/hadrons
2018-03-19 17:57:13 +00:00
Guido Cossu
62702dbcb8
Fixing bug in the Point sink causing NaNs
2018-03-19 17:56:53 +00:00
41d6cab033
Merge branch 'develop' into feature/hadrons
2018-03-19 13:30:21 +00:00
5a31e747c9
Merge commit 'd5ce66f6ab2c44a12def7b6d26df80d6e646b1fb' into feature/hadrons
2018-03-19 13:19:09 +00:00
cbc73a3fd1
Hadrons: CG guesser fix
2018-03-19 13:11:38 +00:00
Peter Boyle
a5cfb89304
Update eigen process direct from develop on github. Dangerous, but needed from GPU
2018-03-19 07:20:48 -04:00
Peter Boyle
f04a7251cc
Gpu welcome message and device info
2018-03-19 07:12:12 -04:00
Peter Boyle
d4ce7d9905
GPU friendly Stencil needs a view
2018-03-19 07:11:21 -04:00
Peter Boyle
8a1d303ab9
GPU friendly stencil improvements
2018-03-19 07:11:03 -04:00
Peter Boyle
bf0a4de919
GPU friendly params object
2018-03-19 07:10:12 -04:00
Peter Boyle
6fe5885fe4
Warning suppress
2018-03-19 07:09:49 -04:00
Peter Boyle
17ac309e84
Fix the compile
2018-03-19 07:08:59 -04:00
Peter Boyle
7467a1c027
Latest eigen needed for GPU
2018-03-19 07:08:10 -04:00
Peter Boyle
fdfb8a26a8
Disable eigen vectorisation on GPU because of Summit compile issues
2018-03-19 07:07:30 -04:00
paboyle
2df4e422ad
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
2018-03-18 14:45:41 +00:00
paboyle
3a3e3cac40
Pull the trigger on offload
2018-03-18 14:45:29 +00:00
paboyle
b1c02ec310
MallocManaged in GPU
2018-03-18 14:44:46 +00:00
paboyle
38eadee2c9
Prettier code
2018-03-18 14:44:22 +00:00
paboyle
42c70437be
Views
2018-03-18 14:43:47 +00:00
paboyle
65274b4d7f
Tidy up
2018-03-18 14:43:16 +00:00
Peter Boyle
6c6d43eb4e
Drop RB on coarse space ; that was a mistake
2018-03-17 09:35:01 +00:00
Peter Boyle
e1dcfd3553
typo fix
2018-03-16 23:10:47 +00:00
Peter Boyle
888838473a
4GB clean the offsets in parallel IO for multifile records
2018-03-16 21:54:56 +00:00
Peter Boyle
01568b0e62
Add a new SHM option
2018-03-16 21:54:28 +00:00
Peter Boyle
d5ce66f6ab
Extra SHM option
2018-03-16 21:37:03 +00:00
Guido Cossu
d86936a3de
Eliminating deprecated lex_sites
2018-03-16 12:26:39 +00:00
Daniel Richtmann
ee5cf6c8c5
WilsonMG: Some minor changes to GMRES implementations
2018-03-16 13:10:45 +01:00
d516938707
Hadrons: eigen packs I/O and deflation interface
2018-03-14 14:55:47 +00:00
Peter Boyle
7e8be32755
Typo fix
2018-03-13 19:22:31 -04:00
72344d1418
Hadrons: change default Schur convention to DiagTwo
2018-03-13 17:10:54 +00:00
7ecf6ab38b
Merge branch 'develop' into feature/hadrons
2018-03-13 16:11:59 +00:00
2d4d70d3ec
Hadrons: LCL fixes
2018-03-13 16:10:36 +00:00
78f8d47528
Hadrons: environment access to derived objects
2018-03-13 16:10:16 +00:00
b85f987b0b
Hadrons: error message channel verbose during profiling
2018-03-13 16:09:22 +00:00
f57afe2079
Hadrons: much cleaner eigenpack implementation, to be tested
2018-03-13 13:51:09 +00:00
Dan H
0fb84fa34b
Make compilation faster by moving print of git hash.
2018-03-12 17:03:48 -04:00
Vera Guelpers
8462bbfe63
Gamma input for meson contraction with round brackets
2018-03-12 18:02:12 +00:00
229977c955
Hadrons: minor memory fix for ShiftProbe module
2018-03-09 21:56:27 +00:00
e485a07133
Hadrons: garbage collector debug output
2018-03-09 21:56:01 +00:00
paboyle
0880747edb
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-03-09 20:44:42 +00:00
paboyle
b801e1fcd6
fclose should be called through a call to close()
2018-03-09 20:44:10 +00:00
paboyle
ff761ea4e6
Bound check improvement
2018-03-09 20:00:46 +00:00
70ec2faa98
Hadrons: maximum iteration specified for tests and error if 0
2018-03-09 19:53:55 +00:00
paboyle
a31d3e60d8
Better bounds check
2018-03-09 18:10:21 +00:00
Daniel Richtmann
a66cecc509
WilsonMG: Fix invalid call to MR ctor
2018-03-09 17:34:29 +01:00
Daniel Richtmann
0f6cdf3d4b
WilsonMG: Implement missing parts of CoarsenedMatrix
2018-03-09 16:56:16 +01:00
Daniel Richtmann
1e63b73a14
WilsonMG: Some cleanup/formatting
2018-03-09 16:50:19 +01:00
2f849ee252
declaration fix
2018-03-08 23:34:00 +00:00
bb6ed44339
Merge branch 'develop' into feature/hadrons
2018-03-08 23:09:28 +00:00
paboyle
4d60b92b7f
Update oSites
2018-03-08 21:00:25 +00:00
360cface33
Grid tensor serialisation fully implemented and tested
2018-03-08 19:12:03 +00:00
Azusa Yamaguchi
80302e95a8
MILC Interface
2018-03-08 15:34:03 +00:00
paboyle
c159c70c84
View introduced
2018-03-08 14:58:04 +00:00
paboyle
28b5572755
Merge branch 'feature/gpu-port' of https://github.com/paboyle/Grid into feature/gpu-port
2018-03-08 13:01:42 +00:00
paboyle
5fac7080bc
Adding -fno-strict-aliasing by default
2018-03-08 13:01:24 +00:00
Peter Boyle
4548523ecc
This modification eliminates what looks like a compiler bug
...
on Intel 2017.
2018-03-08 04:41:16 -08:00
caf2f6b274
Merge branch 'develop' of github.com:paboyle/Grid into develop
2018-03-08 09:52:25 +00:00
c49be8988b
Grid tensor serialisation
2018-03-08 09:51:22 +00:00
971c2379bd
std::vector to tensor conversion + test units
2018-03-08 09:50:39 +00:00
Guido Cossu
94b0d66e4c
Merge pull request #157 from goracle/dev-pull
...
Add print of the current git hash on Grid init.
2018-03-08 16:09:28 +09:00
Dan H
5e8af396fd
Add print of the current git hash on Grid init.
2018-03-07 13:11:51 -05:00
paboyle
4154fc6f44
Revert a change
2018-03-07 16:54:11 +00:00
paboyle
4e3458516a
Reverting after fixing issue with extract merge
2018-03-07 16:50:13 +00:00
Peter Boyle
90a2efb9b3
Hit an annoying strict alias optimisation in GCC 4.9 through 6.3
...
Chris K was correct. It appears that an additional memcpy (UGHHH) is enough
to suppress the compiler
2018-03-07 07:27:26 -08:00
9942723189
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/serialisation/BaseIO.h
2018-03-07 15:22:16 +00:00
a7d19dbb64
Merge branch 'develop' of github.com:paboyle/Grid into develop
...
# Conflicts:
# lib/serialisation/BaseIO.h
2018-03-07 15:13:54 +00:00
90dbe03e17
Conversion of Grid tensors to std::vector made more elegant, also pair syntax changed to (x y) to avoid issues with JSON/XML
2018-03-07 15:12:32 +00:00
8b14096990
Conversion of Grid tensors to std::vector made more elegant, also pair syntax changed to (x y) to avoid issues with JSON/XML
2018-03-07 15:12:18 +00:00
Azusa Yamaguchi
b938202081
Overlapped Comm for Wilson DhopInternal
2018-03-07 14:08:43 +00:00
e79ef469ac
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/serialisation/BaseIO.h
2018-03-06 19:25:51 +00:00
485c5db0fe
conversion of Grid tensors to nested std::vector in preparation for tensor serialisation
2018-03-06 19:22:03 +00:00
James Harrison
c793947209
Add overloaded Photon constructors, with default parameters for IR improvements and infinite-volume G(x=0).
2018-03-06 16:27:26 +00:00
paboyle
40699221e2
Dont alias lhs and rhs in a where statement
2018-03-06 04:14:13 -08:00
paboyle
3cb1b545d0
Don't alias the variables with a where statement.
2018-03-06 04:13:26 -08:00
3e9ee053a1
Merge branch 'develop' into feature/hadrons
2018-03-05 20:01:38 +00:00
dda6c69d5b
Hadrons: scalar SU(N) shift probes
2018-03-05 20:00:29 +00:00
cd51b9af99
Torture yourself with namespace lookup 101
2018-03-05 19:58:13 +00:00
paboyle
e199ba7e88
Fix the Charge conjugate BC's
2018-03-05 13:59:02 +00:00
paboyle
c399c2b44d
Guido broke the charge conjugate plaquette action with premature optimisation.
...
This sector of the code does not matter for anything other than Guido's quenched HMC
studies, and any plaq specific optimisations should be retained in a private branch
instead of destroying the code simplicity.
2018-03-05 12:55:41 +00:00
paboyle
af7de7a294
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-03-05 12:22:41 +00:00
paboyle
1dc86efd26
Finalize protection
2018-03-05 12:22:18 +00:00
paboyle
4d53703c67
Scalar type differeing allowed, eg. precisoin change
2018-03-05 11:39:52 +00:00
paboyle
d506c59efa
Warnings disabled
2018-03-05 11:39:20 +00:00
paboyle
44188a5c6f
AVX512 fix
2018-03-05 00:32:24 +00:00
paboyle
2018077770
Make NVCC happy with the compile. This is warning free on 9.1 on my laptop (both make and make tests).
2018-03-05 00:28:24 +00:00
paboyle
984e06e2b5
Introduce view objects that can safely be copied to GPU for access
2018-03-04 16:40:11 +00:00
paboyle
aead94e9a7
View introduced
2018-03-04 16:39:29 +00:00
paboyle
3277bda130
View introduction to prepare for accelerator offload.
...
Probably same problem exists for stencil object
2018-03-04 16:38:08 +00:00
paboyle
442b0b406c
View related changes
2018-03-04 16:34:14 +00:00
paboyle
8824a54269
View related changes
2018-03-04 16:33:33 +00:00
paboyle
c03423250f
Indexable changes
2018-03-04 16:31:35 +00:00
paboyle
317fd0da44
Views introduced. Need to accelerator offload these routines.
2018-03-04 16:30:45 +00:00
paboyle
783795a44a
Views introduced
2018-03-04 16:12:49 +00:00
paboyle
0e6197fbed
Introduce accelerator friendly expression template rewrite.
...
Must obtain and access lattice indexing through a view object that is safe
to copy construct in copy to GPU (without copying the lattice).
2018-03-04 16:03:19 +00:00
paboyle
dad7862f91
Go through a view object that can be copied to GPU
2018-03-04 16:02:02 +00:00
paboyle
c89a883448
where was deprecated and integrated to ET engine a long time ago. Remove dead old original code
2018-03-04 15:58:02 +00:00
paboyle
c204288fbc
Remove a couple of print statements
2018-03-04 15:57:15 +00:00
paboyle
ad739f042a
Introduce views for passing lattice indexing to accelerators.
2018-03-04 15:56:14 +00:00
paboyle
db988301d0
Introduce view objects for indexing lattices. Used to pass the view to acccelerators
2018-03-04 15:55:16 +00:00
paboyle
9b1f29c4c2
Support a view for passing to accelerator
2018-03-04 15:54:35 +00:00
paboyle
e5ea04ee0c
Need to support precision change, and real replication in multiple simd lanes
2018-03-04 15:53:04 +00:00
paboyle
c92a3c6068
Need to support any vector type template and run on accelerator
2018-03-04 15:52:14 +00:00
paboyle
03f8da8fbc
enable-debug option for debug flags in compile
2018-03-04 15:51:47 +00:00
f32555dcc5
Merge branch 'develop' into feature/hadrons
2018-03-03 15:31:52 +00:00
30391cb2eb
Merge pull request #155 from fionnoh/develop
...
Some changes needed for deflation interface
2018-03-03 13:43:59 +00:00
e93c883470
Hadrons: basic GraphViz visualisation
2018-03-03 13:42:36 +00:00
Fionn O hOgain
2e88408f5c
Some changes needed for deflation interface
2018-03-02 22:27:41 +00:00
fcac5c0772
Hadrons: scalar SU(N) fixes
2018-03-02 19:20:23 +00:00
90f4000935
Hadrons: scheduler debug less verbose
2018-03-02 19:20:01 +00:00
480708b9a0
Hadrons: safer error handling for HadronsXmlRun
2018-03-02 19:19:37 +00:00
c4baf876d4
Hadrons: graph consistency check
2018-03-02 18:40:18 +00:00
2f4dac3531
Hadrons: legal update
2018-03-02 18:10:58 +00:00
3ec6890850
Merge branch 'feature/hadrons' of github.com:paboyle/Grid into feature/hadrons
2018-03-02 17:56:08 +00:00
018801d973
Hadrons: legal update
2018-03-02 17:56:00 +00:00
1d83521daa
Hadrons: scalar SU(N) EMT
2018-03-02 17:55:18 +00:00
fc5670c6a4
Merge pull request #151 from guelpers/feature/hadrons
...
Feature/hadrons
2018-03-02 17:54:43 +00:00
d9c435e282
Hadrons: Scalar SU(N) transverse projection module
2018-03-02 17:35:12 +00:00
614a0e8277
Hadrons: Scalar SU(N) utility functions
2018-03-02 17:34:23 +00:00
Vera Guelpers
aaf39222c3
update my fork and fixed conflicts
2018-03-02 17:08:08 +00:00
550142bd6a
Hadrons: more code cleaning
2018-03-02 14:30:45 +00:00
c0a929aef7
Hadrons: code cleaning
2018-03-02 14:29:54 +00:00
37fe944224
Hadrons: scalar kinetic term
2018-03-02 14:14:11 +00:00
Vera Guelpers
315a42843f
changes requested for the pull request
2018-03-02 11:47:38 +00:00
83a101db83
Hadrons: more LCL fixes
2018-03-02 11:05:02 +00:00
c4274e1660
Hadrons: LCL cleaning
2018-03-02 10:18:33 +00:00
ba6db55cb0
Hadrons: reverse last commit
2018-03-01 23:30:58 +00:00
e5ea84d531
Hadrons: LCL: orthogonalise coarse evec
2018-03-01 19:33:11 +00:00
15767a1491
Hadrons: LCL fine convergence test
2018-03-01 18:04:08 +00:00
4d2a32ae7a
Hadrons: z-Mobius message fix
2018-03-01 18:03:44 +00:00
5b937e3644
Hadrons: VM memory profiling fix
2018-03-01 17:28:38 +00:00
e418b044f7
Hadrons: code cleaning
2018-03-01 12:57:28 +00:00
b8b05f143f
Hadrons: Lanczos more conservative type names
2018-03-01 12:53:16 +00:00
6ec42b4b82
LCL: external storage fix
2018-03-01 12:27:29 +00:00
abb7d4d2f5
Hadrons: z-Mobius action
2018-02-27 19:32:19 +00:00
16ebbfff29
Hadrons: Schur convention globally defined through a macro
2018-02-27 18:45:23 +00:00
4828226095
Hadrons: prettier log
2018-02-27 14:43:51 +00:00
8a049f27b8
Hadrons: Lanczos code improvement
2018-02-27 13:46:59 +00:00
43578a3eb4
Hadrons: copyright update
2018-02-26 19:24:19 +00:00
fdbd42e542
Hadrons: first implementation of local coherence Lanczos
2018-02-26 19:22:43 +00:00
e7e4cee4f3
Merge branch 'develop' into feature/hadrons
2018-02-26 15:05:05 +00:00
paboyle
78a9e31ff0
options more obvious
2018-02-24 22:26:32 +00:00
paboyle
c1fc947bb8
Coordinate handling GPU friendly + some GPU merge/extract improvements
2018-02-24 22:26:10 +00:00
paboyle
ff7b19a71b
Coordinate handling GPU ready avoid malloc
2018-02-24 22:25:39 +00:00
paboyle
1c16ffa1c1
Coordinate GPU ready. No malloc
2018-02-24 22:25:09 +00:00
paboyle
4962f59477
Eliminate both GPU issue and threading bottle neck by avoiding malloc in coordinate handling
2018-02-24 22:24:37 +00:00
paboyle
e158b60bce
GPU friendly coords
2018-02-24 22:23:47 +00:00
paboyle
34820bec27
Coordinate handling GPU ready. No malloc
2018-02-24 22:23:18 +00:00
paboyle
eed9aa9f0c
Extract merge gpu ready
2018-02-24 22:23:01 +00:00
paboyle
8792ff6439
Coordinate handling gpu ready
2018-02-24 22:22:43 +00:00
paboyle
078901278c
Coordinate handling gpu friendly
2018-02-24 22:22:02 +00:00
paboyle
bf5fb89aff
Coordinate handling GPU friendly
2018-02-24 22:21:36 +00:00
paboyle
7574c18cef
Massive clean up extract merge.
...
Simpler and GPU friendly
2018-02-24 22:21:08 +00:00
paboyle
36ea5f6b77
gpu friendly coordinates ; no std::vector on GPU
2018-02-24 22:20:14 +00:00
paboyle
285deab432
Coordinate handling GPU friendly. Avoid std::vector
2018-02-24 22:19:28 +00:00
paboyle
bb7d87d0a0
Coordinate handling gpu friendly
2018-02-24 22:18:33 +00:00
James Harrison
ec3954ff5f
QedFVol: Add input parameter G(x=0) for infinite-volume photon
2018-02-23 14:53:05 +00:00
Azusa Yamaguchi
0f468e2179
OverlappedComm for Staggered 5D and 4D.
2018-02-22 12:50:09 +00:00
James Harrison
8e61286741
Merge branch 'develop' into feature/qed-fvol
2018-02-20 15:33:35 +00:00
paboyle
4790e99817
Extra communicator free that I had missed.
...
Hard to audit them all as this is complex
2018-02-20 15:12:31 +00:00
paboyle
2dd63aa7a4
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-02-20 14:29:26 +00:00
paboyle
559a501140
Deflation interface for solvers
2018-02-20 14:29:08 +00:00
paboyle
945684c470
updates for deflation in the RB solver
2018-02-20 14:28:38 +00:00
Christopher Kelly
e30a80a234
Relaxed constraints on MPI thread mode when not using multiple comms threads
2018-02-15 17:13:36 +00:00
James Harrison
69e4ecc1d2
QedFVol: Fix single precision build error
2018-02-14 17:37:18 +00:00
James Harrison
5f483df16b
Merge branch 'develop' into feature/qed-fvol
2018-02-14 16:35:04 +00:00
James Harrison
4680a977c3
QedFVol: set infinite-volume photon propagator to 1 at x=0,
...
so that momentum-spage photon propagator is non-negative.
Need to check whether this is sufficient for all volumes.
2018-02-14 16:30:09 +00:00
Vera Guelpers
de42456171
updated my fork and conflicts fixed
2018-02-14 13:57:56 +00:00
Vera Guelpers
d55212c998
restructure SeqConservedCurrent for DWF to need less memory
2018-02-14 10:45:18 +00:00
paboyle
c96483e3bd
Whitespace only change
2018-02-13 11:39:07 +00:00
Vera Guelpers
c6e1f64573
Test for QED
2018-02-13 09:30:23 +00:00
paboyle
ae31a6a760
Move deflate to right class
2018-02-13 02:11:37 +00:00
paboyle
dd8f2a64fe
INterface to suit hadrons on Lanczos
2018-02-13 02:08:49 +00:00
James Harrison
724cf02d4a
QedFVol: Implement infinite-volume photon
2018-02-12 17:18:10 +00:00
paboyle
7b8b2731e7
Conj error for complex coeffs
2018-02-12 16:06:31 +00:00
paboyle
237a8ec918
Communicator leak fixed (I think)
2018-02-12 13:27:20 +00:00
Vera Guelpers
49a0ae73eb
Insertion of photon field in seqential conserved current
2018-02-12 09:36:08 +00:00
Daniel Richtmann
6ab60c5b70
Merge remote-tracking branch 'upstream/develop' into feature/wilsonmg
2018-02-08 23:59:07 +01:00
Daniel Richtmann
8c692b7ffd
WilsonMG: Comment assertion on hermiticity of coarse operator for now
...
TODO: Think of a way to not break dwf_hdcr by doing that. It's only an assertion
but it still interferes with it.
2018-02-08 23:55:05 +01:00
Daniel Richtmann
2976132bdd
Add first version of multigrid for wilson clover analogous to wilson one
...
Just like the wilson one, this algorithm
• is currently only a 2-level method since I don't have correct implementations
for Mdir and Mdiag in CoarsenedMatrix yet (needed for further coarsening)
• needs levelization and refactoring into a proper algorithm
2018-02-08 23:52:10 +01:00
Daniel Richtmann
48177f2f2d
Add tests for all MR|GMRES solvers with wilson clover action
2018-02-08 23:52:09 +01:00
Daniel Richtmann
c4ce70a821
WilsonMG: Major cleanup
2018-02-08 23:52:08 +01:00
James Harrison
315f1146cd
QedFVol: Fix output of VPCounterTerms module.
2018-02-08 20:40:45 +00:00
Daniel Richtmann
a3e009ba54
Add tests for CAGMRES solvers with staggered action
2018-02-08 17:46:28 +01:00
Daniel Richtmann
eb7cf239d9
Print warning messages in CAGMRES solvers
...
Currently, the implementation of these algorithms doesn't differ from their non
communication-avoiding versions.
2018-02-08 17:43:47 +01:00
Daniel Richtmann
13ae371ef8
Make solver parameters match in all MR|GMRES solver tests
2018-02-08 17:33:10 +01:00
Daniel Richtmann
9f79a87102
Fix bugs in Flexible GMRES solvers
...
Somehow I got the left and right-preconditioned versions of GMRES mixed up. As
of now this is right-preconditioned version, which is what we want.
2018-02-08 16:00:31 +01:00
Daniel Richtmann
4ded1ceeb0
Make GMRES solvers perform no more than MaxIterations steps
...
I noticed that it was possible to overrun this number.
2018-02-08 15:29:44 +01:00
James Harrison
9f202782c5
QedFVol: Change format of scalar VP output files, and save diagrams without charge factors for consistency with ChargedProp module.
2018-02-07 20:31:50 +00:00
Daniel Richtmann
8bc12e0ce1
Remove superfluous comments in MR solver
2018-02-07 18:09:09 +01:00
Daniel Richtmann
cc2f00f827
Remove test for MR solver with dwf action as it doesn't converge
2018-02-07 18:09:08 +01:00
Daniel Richtmann
cd61e2e6d6
Increase max iterations in test of MR solver with staggered action
2018-02-07 18:09:07 +01:00
Daniel Richtmann
323ed1a588
Add an overrelaxation parameter to the MR solver
2018-02-07 18:09:06 +01:00
Daniel Richtmann
68c66d2e4b
Remove empty line in output of *Residual* solvers
2018-02-07 18:08:56 +01:00
Daniel Richtmann
1671adfd49
WilsonMG: Add some tests for linear operators
2018-02-07 17:15:22 +01:00
James Harrison
594a262dcc
QedFVol: Remove redundant file Communicator_mpi.cc
2018-02-07 11:37:01 +00:00
James Harrison
7f8ca54285
Merge branch 'develop' into feature/qed-fvol
2018-02-07 10:11:00 +00:00
James Harrison
c5b23c367e
QedFVol: Fix segmentation fault when multiple propagator modules are used.
2018-02-05 11:46:33 +00:00
Vera Guelpers
b6fe03eb26
BugFix: Now the stochatic EM potential weight is generated when calling for the first time
2018-02-02 15:29:38 +00:00
James Harrison
f37ed4958b
Implement IR improvement, with coefficients set in input file.
2018-02-02 11:56:51 +00:00
paboyle
b9b5bdfc3a
Proper offload (accelerator access) will require a mutable copy lambda.
2018-02-02 11:38:19 +00:00
paboyle
51eb2c5dfc
Make referencign the stencil and all info required to evaluate the kernel
...
accelerator marked up
2018-02-02 11:37:13 +00:00
paboyle
ede0dff794
Mark up as an accelerator function
2018-02-02 11:36:44 +00:00
paboyle
aa6de818e2
Copy data needed by Kernels out of the grid object to avoid host reference
2018-02-02 11:36:11 +00:00
paboyle
dcf6517a93
Accelerator offload and copy Opt into the kernel for GPU host var safety
2018-02-02 11:35:35 +00:00
paboyle
a308dff410
accelerator loop, copy Opt into the GPU
2018-02-02 11:34:37 +00:00
paboyle
14ba20898a
Accelerator loop the key kernel call
2018-02-02 11:30:07 +00:00
paboyle
a53d3ee19a
Add Opt to the lambda capture to get it into the GPU
2018-02-02 11:28:39 +00:00
paboyle
5df435319d
Use constexpr
2018-02-02 11:27:56 +00:00
paboyle
0da2d3e222
accelerator off load some more stuff
2018-02-02 11:27:35 +00:00
paboyle
9c9dfbfa78
Force accelerator
2018-02-02 11:25:09 +00:00
paboyle
e4df025d01
Accelerator related
2018-02-01 23:20:05 +00:00
paboyle
cfeda9d536
constexpr on const ints
2018-02-01 22:59:12 +00:00
paboyle
4450b1993a
Offload
2018-02-01 22:45:47 +00:00
paboyle
d03ce5c2a4
Provide a way to get around std::vector for a known type on device.
...
Use template specialisation to access a private member in the Clang++ STL implementation
2018-02-01 22:44:25 +00:00
paboyle
7d6522c1ef
Accelerator inline
2018-02-01 22:43:56 +00:00
paboyle
b96832a922
Accelerator inline
2018-02-01 22:43:26 +00:00
paboyle
5d7af47b05
accelerator_inline
2018-02-01 22:42:54 +00:00
paboyle
053ef25c90
constexpr makes GPU happy
2018-02-01 22:42:29 +00:00
paboyle
8ae77d3706
Small simplification of FermionOperatorImpl towards GPU but not there yet
2018-02-01 22:41:54 +00:00
Peter Boyle
896f3a8002
Fix to MPI for Hokusai system
2018-02-01 18:51:51 +00:00
James Harrison
5f85473d6b
QedFVol: Move Projection class into Result class
2018-02-01 16:16:13 +00:00
Daniel Richtmann
871649238c
WilsonMG: Stricter naming for linear operators
2018-02-01 14:43:08 +01:00
James Harrison
ac3b0ebc58
QedFVol: New structure for ChargedProp output files
2018-02-01 12:31:32 +00:00
Daniel Richtmann
7c86d2085b
WilsonMG: Some minor cleanup
2018-02-01 12:24:16 +01:00
Daniel Richtmann
9292be0b69
WilsonMG: Add check for Mdiag + Σ Mdir == M
...
Need to test my implementations of CoarsenedMatrix::Mdiag &
CoarsenedMatrix::Mdir.
2018-01-31 14:03:30 +01:00
Guido Cossu
f0fcdf75b5
Update README.md
2018-01-30 12:44:20 +01:00
Guido Cossu
53bffb83d4
Updating README with new SKL target
2018-01-30 12:42:36 +01:00
Daniel Richtmann
10141f90c9
WilsonMG: Rename test file
2018-01-30 10:25:09 +01:00
Guido Cossu
cd44e851f1
Fixing compilation error in FundtoHirep
2018-01-30 06:04:30 +01:00
Daniel Richtmann
a414430817
Merge remote-tracking branch 'upstream/develop' into feature/ddalphaamg
2018-01-29 18:32:31 +01:00
Daniel Richtmann
f20728baa9
WilsonMG: Some further steps towards a three level method
...
Currently this is very "manual" as we are still testing stuff. Will refactor
and make it an algorithm once everything works.
What currently does work:
- All tests in MultiGridPreconditioner::runChecks for the first coarse grid
- The tests for the intergrid operators going from the first to the second
coarse grid
- (1 - P R) v == 0
- (1 - R P) v_c == 0
- A full solve with VPGCR and a two-level MG preconditioner
What hinders the rest of the tests from passing with a three-level method is the
absence of implementations of CoarsenedMatrix::Mdir and CoarsenedMatrix::Mdiag.
2018-01-29 18:29:49 +01:00
Daniel Richtmann
d2e68c4355
WilsonMG: Perform some minor cleanup
2018-01-29 18:07:10 +01:00
Daniel Richtmann
1cb745c8dc
Add function to return full type as std::string
...
Also works for nested templates. I find it useful for debugging.
Possible usage:
std::cout << "getTypename<AType>() = " << getTypename<Atype>() << std::endl;
std::cout << "getTypename<decltype(AnInstance)>() = " << getTypename<decltype(AnInstance)>() << std::endl;
2018-01-29 17:39:19 +01:00
Daniel Richtmann
faf4278019
Use 2 passes of GS in coarse operator construction
2018-01-29 17:21:42 +01:00
Daniel Richtmann
194e4b94bb
Make MG checking function work level-wise
2018-01-29 17:18:20 +01:00
Daniel Richtmann
bfc1411c1f
Use more iterations in subspace creation
2018-01-29 17:11:29 +01:00
Daniel Richtmann
161637e573
Turn on orthogonality checking temporarily
2018-01-29 17:10:05 +01:00
paboyle
79b50feacf
fixme updates
2018-01-29 16:00:40 +00:00
Guido Cossu
fb24e3a7d2
Adding utilities for perf profiling
2018-01-29 11:11:45 +01:00
Guido Cossu
655a69259a
Added support for GCC compilation for Skylake AVX512
2018-01-28 17:02:46 +01:00
paboyle
c67c1544cd
abs no compile on travis fix attempt
2018-01-28 10:26:04 +00:00
paboyle
e657f9a344
OMP collapse changes to make NVCC happy
2018-01-28 01:21:53 +00:00
paboyle
b6ebf35af5
Intel compiler doesn't like Nvidia error disable pragmas
2018-01-28 01:03:10 +00:00
paboyle
604c05f4b8
parallel_for elimination -> thread_loop
2018-01-28 01:01:36 +00:00
paboyle
70e276e1ab
parallel_for elimination -> thread_loop
2018-01-28 01:01:14 +00:00
paboyle
9472b02771
Parallel_for elimination -> thread_loop.
2018-01-28 01:00:55 +00:00
paboyle
9597ab94eb
Zero changes, swap on lattice type.
2018-01-27 23:51:40 +00:00
paboyle
ce4da83bc2
Zero changes, literally
2018-01-27 23:51:10 +00:00
paboyle
d557f3ef77
Zero changes (literally) and also a warning elimination
2018-01-27 23:50:43 +00:00
paboyle
f574c20118
Zero changes, __VA_ARGS__ and swap
2018-01-27 23:50:17 +00:00
paboyle
f102897385
VA_ARGS to make comma safe automatic
2018-01-27 23:49:47 +00:00
paboyle
d6fce3e498
Zero changes, literally
2018-01-27 23:48:01 +00:00
paboyle
2d0bcc2606
Zero changes, acceleartor on kernels and some thread loop changes
2018-01-27 23:47:38 +00:00
paboyle
45df59720e
Zero changes and VA_ARGS changes
2018-01-27 23:46:58 +00:00
paboyle
44ef5bc207
Zero changes (literally speaking).
2018-01-27 23:46:28 +00:00
paboyle
98af36217a
Zero changes. (I mean literally)
2018-01-27 23:46:02 +00:00
James Harrison
4e0cf0cc28
QedFVol: Fix bug in ScalarVP.cc due to double use of temporary object. Still getting mpi3 errors when configured with enable-comms=mpi[-auto].
2018-01-27 15:15:25 +00:00
Guido Cossu
507c4e9efc
Correcting an missing semicolumn in avx512
2018-01-27 10:59:55 +01:00
paboyle
be7b37b9c9
Mistake on openmp
2018-01-27 00:05:11 +00:00
paboyle
c4f82e072b
_grid becomes private ; use Grid()§
2018-01-27 00:04:12 +00:00
paboyle
3f9654e397
Hiding internals
2018-01-26 23:09:03 +00:00
paboyle
912b50f6fa
Hiding lattice internals
2018-01-26 23:08:45 +00:00
paboyle
2a4a0e43c1
Hide internals
2018-01-26 23:08:27 +00:00
paboyle
32523a229c
Hide internals
2018-01-26 23:08:02 +00:00
paboyle
1ebd56c3fb
Hide internal data
2018-01-26 23:07:34 +00:00
paboyle
8dccffdfd5
Hide internal data
2018-01-26 23:06:51 +00:00
paboyle
5642ea270f
Hide internal data
2018-01-26 23:06:28 +00:00
paboyle
43cea62855
Hide internal data
2018-01-26 23:06:03 +00:00
paboyle
2b4067bb71
Hide internal data
2018-01-26 23:05:32 +00:00
paboyle
85771e97e9
Hide internal data
2018-01-26 23:04:46 +00:00
paboyle
8b371ffa94
Hide internal data
2018-01-26 23:03:54 +00:00
paboyle
bf659dfd92
Hide the ._odata
2018-01-26 22:27:47 +00:00
James Harrison
cdf550845f
QedFVol: Fix bugs in StochEm.cc and ChargedProp.cc (still only works without MPI).
2018-01-26 21:25:20 +00:00
James Harrison
3db7a5387b
BROKEN: Adapted scalarVP, UnitEm and VPCounterTerms modules to new Hadrons. Currently getting an assertion error from Communicator_mpi3.cc when I try to run.
2018-01-26 16:33:48 +00:00
paboyle
76a4dd36d9
Fix no compile of test serialisation
2018-01-26 00:13:21 +00:00
paboyle
f4010023ca
Warning fixes
2018-01-25 23:46:47 +00:00
paboyle
24a4589def
Changes to interface a little
2018-01-25 23:37:34 +00:00
paboyle
c904822e74
Warning removal
2018-01-25 23:37:15 +00:00
paboyle
40ee1e1957
Zero()
2018-01-25 23:36:58 +00:00
paboyle
461df78a3f
Better to use Zero(), and not zero static data
2018-01-25 23:36:22 +00:00
paboyle
db9c9475d4
const
2018-01-25 23:36:06 +00:00
paboyle
214f7a6f13
Drop std::vector container for the lattice data
2018-01-25 23:35:04 +00:00
paboyle
c844cfcda8
Remove commAllocator; make more simple; option to switch off the pointer caceh
2018-01-25 23:33:57 +00:00
paboyle
a3e3034e6f
Host compile
2018-01-25 23:33:00 +00:00
paboyle
e7cba358c2
Temporary update to reflect the new dropping of std::vector in Lattice
...
Will update again to hide the internals in an interface
2018-01-25 23:31:41 +00:00
Guido Cossu
f8a5194c70
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-01-25 13:46:37 +01:00
Guido Cossu
cff3bae155
Adding support for general Nc in the benchmark outputs
2018-01-25 13:46:31 +01:00
James Harrison
90dffc73c8
Merge branch 'feature/hadrons' into feature/qed-fvol
...
# Conflicts:
# extras/Hadrons/Modules.hpp
# extras/Hadrons/Modules/MGauge/StochEm.cc
# extras/Hadrons/Modules/MScalar/ChargedProp.cc
# extras/Hadrons/Modules/MScalar/ChargedProp.hpp
# extras/Hadrons/modules.inc
# lib/communicator/Communicator_mpi.cc
2018-01-24 16:41:44 +00:00
paboyle
99329197ee
Rename header to .h
2018-01-24 14:10:09 +00:00
paboyle
421401af55
Remove IMCI as really don't support
2018-01-24 13:53:21 +00:00
paboyle
0626c1e39e
Accelerator flaggina dn thrust complex for NVCC
2018-01-24 13:50:41 +00:00
paboyle
725f03e2e2
Accelerator markup and thrust complex on nvcc
2018-01-24 13:50:10 +00:00
paboyle
65f77112e0
Thread loops done properly
2018-01-24 13:49:39 +00:00
paboyle
408b868475
Generic for GPU needs accelerator markup of functions
2018-01-24 13:49:12 +00:00
paboyle
1c797deb04
Accelerator tweaks
2018-01-24 13:43:43 +00:00
paboyle
b9d5a42b57
Should be able to eliminate the COMMA_SAFE with VA_ARGS trick ; revisit this file
2018-01-24 13:42:06 +00:00
paboyle
e737591918
Accelerator loops
2018-01-24 13:41:12 +00:00
paboyle
ba5ea5830b
Acceleartor loops
2018-01-24 13:40:56 +00:00
paboyle
43f244badf
Thread loops for now; figure out what can be GPU accelerated later here
2018-01-24 13:40:30 +00:00
paboyle
e9c8ba5ef7
Accelerator loosp
2018-01-24 13:39:54 +00:00
paboyle
d70709a8e8
Thread construct changes
2018-01-24 13:39:06 +00:00
paboyle
733f8ff0b2
Still using parallel_for -- don't know how to implement reduction on GPU yet. Look at some sample code is best.
2018-01-24 13:38:13 +00:00
paboyle
0bfa5bb213
Accelerator loosp
2018-01-24 13:37:26 +00:00
paboyle
1f26a234f9
CPU loops explicit for peek poke
2018-01-24 13:36:31 +00:00
paboyle
13f0116425
Accelerator loops
2018-01-24 13:35:55 +00:00
paboyle
25f589b064
Accelerator loops
2018-01-24 13:35:36 +00:00
paboyle
210c50a278
Accelerator prep work
2018-01-24 13:35:13 +00:00
paboyle
549a143e78
Accelerator related
2018-01-24 13:34:46 +00:00
paboyle
277301486d
Simple warning elimination
2018-01-24 13:34:15 +00:00
paboyle
c851b39a49
Nicer way of including aggregate
2018-01-24 13:33:34 +00:00
paboyle
15cc12eb6c
Delete the old non ET file
2018-01-24 13:33:07 +00:00
paboyle
ae4f1f8c12
New file, split out two from Lattice_reduction
2018-01-24 13:32:43 +00:00
paboyle
5609624b44
Threading constructs replaced
2018-01-24 13:32:24 +00:00
paboyle
b5a947dd79
Change to make NVCC happy
2018-01-24 13:32:02 +00:00
paboyle
ee16f62322
stray semicolon elimination. NVCC is picky, but eventually picked up these diags
...
with a pragma to suppress
2018-01-24 13:31:17 +00:00
paboyle
3318de27d6
Thread macro changes
2018-01-24 13:30:23 +00:00
paboyle
ac56965306
GPU changes and threading macros replaced
2018-01-24 13:28:30 +00:00
paboyle
8e99264f40
Accelerator mark up of entire tensore space for offload
2018-01-24 13:27:30 +00:00
paboyle
69327db9a9
Improviements for NVCC. Eigen is not compat with CUDA 9 and must hack to disable device
...
compilation
2018-01-24 13:25:07 +00:00
paboyle
7331ee2d80
Warnings control to overpower the NVCC compiler
2018-01-24 13:24:36 +00:00
paboyle
918c105c57
NVCC warning elimination
2018-01-24 13:23:59 +00:00
paboyle
be1511d469
Remove old macros for threading
2018-01-24 13:23:24 +00:00
paboyle
f1c31df9d2
updated Eigen version. Still didn't fix CUDA 9 no compile.
...
Worked around by switching off __NVCC__ during the include of Eigen and switching it
back on after. No Eigen code can be offloaded, note as a rsult of this. No harm done.
2018-01-24 13:19:29 +00:00
paboyle
ff7b587fad
Ugly... nvcc needs -x cu to compile .cc as cuda.
...
Since CXXFLAGS is Also passed to linker, and -x cu breaks link phase must replace
CXX and CXXLD with nvcc -x cu and nvcc -link respectively.
2018-01-24 13:18:19 +00:00
paboyle
4e1135b214
Updated pugixml to v1.8; still didn't fix no compile under nvcc.
...
Turns out nvcc was right; must to an explicit template instantiation that was missing
but left gcc, icpc and clang happy for some reason.
Fix this.
2018-01-24 13:17:10 +00:00
paboyle
acd4955a18
remove rdtsc on __NVCC__ as may be device called
2018-01-24 13:16:18 +00:00
paboyle
bd08dc4f45
Pragma use for nvcc, warning elimination.
2018-01-24 13:15:43 +00:00
paboyle
22d137d4e5
Namespace, nvcc warning elimination.
2018-01-24 13:14:43 +00:00
paboyle
87ee592176
Pragma changes and layout and warning elimination for nvcc
2018-01-24 13:14:09 +00:00
paboyle
063603b1ea
Warning elimination
2018-01-24 13:12:14 +00:00
paboyle
f292106db6
Split out pragms from threads.h;
...
More work needed; renam threads directory to "parallelism" or something like that
2018-01-24 13:11:04 +00:00
paboyle
9d08aebea9
Compile through nvcc ; warning elimination fixes
2018-01-24 13:09:53 +00:00
paboyle
4e30739093
First compile OK through nvcc on host
2018-01-24 13:08:47 +00:00
a1151fc734
Hadrons: MPI-safe serial IO
2018-01-23 17:26:50 +00:00
James Harrison
ab3baeb38f
Implement contractions and data output in functions; calculate diagrams S, X and 4C separately; output 2E and 2T instead of sunset_shifted, sunset_unshifted, tadpole_shifted, tadpole_unshifted; add comments.
2018-01-23 17:07:45 +00:00
Vera Guelpers
389731d373
changed SeqConservedSummed.hpp to work with new hadrons interface
2018-01-23 10:11:33 +00:00
6e3ce7423e
Hadrons: don't display module list at startup (too long)
2018-01-22 20:04:05 +00:00
15f15a7cfd
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# extras/Hadrons/Modules.hpp
# extras/Hadrons/modules.inc
2018-01-22 20:03:36 +00:00
0e5f626226
Hadrons: module for scalar operator divergence
2018-01-22 19:38:19 +00:00
Daniel Richtmann
04f92ccddf
WilsonMG: Provide a fix for the previous commit; compiles and runs successfully now
...
I don't like the solution with the temporary very much though ...
2018-01-22 14:56:48 +01:00
Daniel Richtmann
3b2d805398
WilsonMG: Some first steps towards coarse spin dofs; not compiling yet
...
A failing conversion from the innermost type (Grid::Simd<...>) to a coarse
scalar (triple iScalar) in blockPromote prohibits this commit from working.
2018-01-22 12:45:51 +01:00
Azusa Yamaguchi
97b9c6f03d
No option for interior/exterior split of asm kernels since different directions get interleaved
2018-01-22 11:04:19 +00:00
Azusa Yamaguchi
63982819c6
No option to overlap comms and compute for asm implementation since different directions are interleaved
...
in the kernels, introducing if else structure would be too painful
2018-01-22 11:03:39 +00:00
Vera Guelpers
6fec507bef
merged new hadrons interface
2018-01-22 10:09:20 +00:00
James Harrison
219b3bd34f
Remove freeVpTensor object
2018-01-19 17:14:11 +00:00
Daniel Richtmann
9dc885d297
Fix a bug in Wilson MG
...
The calculation of the lattice size of a second coarse level was incorrect.
2018-01-18 17:02:04 +01:00
Daniel Richtmann
a70c1feecc
Remove some unnecessary stuff in Wilson MG
2018-01-18 15:48:28 +01:00
Daniel Richtmann
38328100c9
Implement correctness checks for Wilson MG
2018-01-18 15:43:15 +01:00
Daniel Richtmann
9732519c41
Apply clang-format to Wilson MG
...
I can provide the configuration file I used if people want that.
2018-01-18 15:14:37 +01:00
Daniel Richtmann
fa4eeb28c4
Save current state in Wilson MG test file
2018-01-17 17:56:34 +01:00
Guido Cossu
b00d2d2c39
Correction of Representations compilation and small compilation error for Intel 17
2018-01-17 13:46:12 +00:00
Guido Cossu
f1b3e21830
Merge branch 'feature/clover' into develop
2018-01-17 10:07:42 +00:00
paboyle
90ea472411
Auto emacs format C++ no namespace indent
2018-01-15 11:44:54 +00:00
paboyle
56999474e2
Indent
2018-01-15 11:44:45 +00:00
paboyle
d74c21a386
GLobal edit for QCD namespace removal & NAMESPACE macros
2018-01-15 09:37:58 +00:00
paboyle
ca6bdd7302
Useful drive to emacs C++ mode
2018-01-15 00:24:41 +00:00
paboyle
6f20f1d224
Namespace
2018-01-15 00:24:20 +00:00
paboyle
d0e357ef89
CLeanup and no QCD namespace
2018-01-15 00:23:51 +00:00
paboyle
21251f2e1b
Namespace and formatting changes
2018-01-15 00:21:27 +00:00
paboyle
fcf1ccf669
Namespace, indent, badly formatted
2018-01-15 00:17:58 +00:00
paboyle
49cce514f1
Namespace
2018-01-15 00:17:11 +00:00
paboyle
695af98a1d
Namespace, indent, tidy
2018-01-15 00:16:13 +00:00
paboyle
f8cb46d360
Namspace, indent, badly formatted code fixed
2018-01-15 00:14:47 +00:00
paboyle
0da64dea90
Namespace, indent
2018-01-15 00:13:32 +00:00
paboyle
2cceebbf12
Namespace, indent
2018-01-15 00:12:20 +00:00
paboyle
40232dcefe
Namespce
2018-01-15 00:11:19 +00:00
paboyle
dbd86bb95b
CLeanup, namespace, indent
2018-01-15 00:10:11 +00:00
paboyle
b8fd2c161f
Indent, namespace
2018-01-15 00:09:33 +00:00
paboyle
df9b979583
Indent, namespace
2018-01-15 00:08:40 +00:00
paboyle
23ef0e3e19
Namespace and indentation
2018-01-15 00:07:46 +00:00
paboyle
ae9175735a
Indentation, Namespace
2018-01-15 00:07:10 +00:00
paboyle
2d13ea1a22
Namespace and indentation emacs choices
2018-01-15 00:05:55 +00:00
paboyle
8c675064bd
Namespace and indentation
2018-01-15 00:04:43 +00:00
paboyle
550b905bb8
Namespace nd indentation
2018-01-15 00:03:49 +00:00
paboyle
edb79dc088
Namespce,and indent
2018-01-15 00:02:33 +00:00
paboyle
88e635c5d1
Namepscae, format
2018-01-15 00:02:01 +00:00
paboyle
ecb4a24de8
Namespace
2018-01-15 00:01:25 +00:00
paboyle
c8c1d36710
Namespace, indent
2018-01-15 00:00:52 +00:00
paboyle
b4bb428d9b
Namespace, indent
2018-01-14 23:59:57 +00:00
paboyle
e9ef7e3852
Namespace, indent
2018-01-14 23:59:23 +00:00
paboyle
31cbbfc07e
Namespace, indent
2018-01-14 23:58:44 +00:00
paboyle
4eb0552d1d
Namespace, indnet
2018-01-14 23:58:03 +00:00
paboyle
08f2a4564f
Namespace, formatting
2018-01-14 23:56:33 +00:00
paboyle
7e00f643f8
Namespace indent
2018-01-14 23:55:44 +00:00
paboyle
c19ccdad7c
Namespace, indent
2018-01-14 23:55:07 +00:00
paboyle
8aed4181e1
Namespace, indent
2018-01-14 23:54:25 +00:00
paboyle
06ab7f5661
Namespace
2018-01-14 23:53:31 +00:00
paboyle
645ec8eba0
Namespace
2018-01-14 23:52:26 +00:00
paboyle
72ffa8a88e
Namespace
2018-01-14 23:51:38 +00:00
paboyle
4c829b410e
Namespace
2018-01-14 23:50:20 +00:00
paboyle
eda4fd9912
Namespace
2018-01-14 23:49:11 +00:00
paboyle
041d9137c0
Namespace
2018-01-14 23:48:27 +00:00
paboyle
eeacdfe031
Namespace
2018-01-14 23:47:37 +00:00
paboyle
e5535f4d72
Namespace, indent
2018-01-14 23:46:51 +00:00
paboyle
044a292281
Namespace, indnet
2018-01-14 23:46:07 +00:00
paboyle
fe0467df1e
Namespace, indenting
2018-01-14 23:45:19 +00:00
paboyle
19234fb40e
Namespace, format
2018-01-14 23:44:16 +00:00
paboyle
f445257d28
Namespace, indenting
2018-01-14 23:43:36 +00:00
paboyle
bdc2a987aa
Namespace, indent
2018-01-14 23:42:47 +00:00
paboyle
72acb0e48f
Namespace, indent
2018-01-14 23:41:59 +00:00
paboyle
b4e9211df7
Namespace, indent
2018-01-14 23:40:38 +00:00
paboyle
97019d2997
Namespace, format
2018-01-14 23:39:57 +00:00
paboyle
83c5f05094
Namespace, indent
2018-01-14 23:39:13 +00:00
paboyle
1619e42d90
Indent and Namespace changes
2018-01-14 23:38:25 +00:00
paboyle
9f6cebe5ff
Namespace and format changes
2018-01-14 23:37:40 +00:00
paboyle
a84ebe5624
Namespace, format change
2018-01-14 23:36:45 +00:00
paboyle
c527e39881
Namespace, format indent change
2018-01-14 23:36:07 +00:00
paboyle
a0f4687887
Namespace, formatting indent changes
2018-01-14 23:35:16 +00:00
paboyle
3ef7b2389e
Format eamcs style after NAMESPCCE change
2018-01-14 23:34:08 +00:00
paboyle
7dfa3d0b50
Namespace, format
2018-01-14 23:33:16 +00:00
paboyle
bf629dddce
Namespace, format improved
2018-01-14 23:32:19 +00:00
paboyle
7747b95430
Namespace, formatting emacs style
2018-01-14 23:31:28 +00:00
paboyle
ccd75c039a
Namespace, fmt
2018-01-14 23:30:34 +00:00
paboyle
493ea80208
Namespace
2018-01-14 23:29:53 +00:00
paboyle
229baf3aba
Namespace, emacs fmt
2018-01-14 23:29:02 +00:00
paboyle
0ce4ecfc84
Emacs format indent
2018-01-14 23:28:12 +00:00
paboyle
ddfaae8ea6
Namespace
2018-01-14 23:27:49 +00:00
paboyle
70c5b781e5
Namespace, clean up
2018-01-14 23:26:41 +00:00
paboyle
901e359d28
Namespace changes; need to simplify the EOFA as too many cases and duplicated from Mobius
2018-01-14 23:25:51 +00:00
paboyle
e857d4d4c8
Namespace, indent
2018-01-14 23:24:51 +00:00
paboyle
e5b77c7fd8
Namespace, indent
2018-01-14 23:24:06 +00:00
paboyle
3b5d629048
Namespace, format
2018-01-14 23:23:26 +00:00
paboyle
08772d5e0c
Namespace, indent
2018-01-14 23:22:42 +00:00
paboyle
017dcd69a6
Namespace, indent
2018-01-14 23:21:40 +00:00
paboyle
8178a17b88
Namespace, indent
2018-01-14 23:20:55 +00:00
paboyle
c5c1b53e54
Namespace, indent
2018-01-14 23:20:08 +00:00
paboyle
440f9e2013
Namespace, indent
2018-01-14 23:19:22 +00:00
paboyle
c98657d588
Namespace
2018-01-14 23:18:46 +00:00
paboyle
f450857716
Namespce, indent
2018-01-14 23:17:33 +00:00
paboyle
9ec238df9e
Namespace, indent
2018-01-14 23:16:49 +00:00
paboyle
3ba8eb1500
Namespace, indent
2018-01-14 23:16:08 +00:00
paboyle
8da49c5a34
Namespace
2018-01-14 23:15:26 +00:00
paboyle
e04f61b1fa
Namespace
2018-01-14 23:14:46 +00:00
paboyle
115e13b227
Namespace
2018-01-14 23:13:49 +00:00
paboyle
75f3062a80
Think this should move to the algorithms directory
2018-01-14 23:12:14 +00:00
paboyle
b460cd3ef1
Namespace, format
2018-01-14 23:11:24 +00:00
paboyle
0e6727a33b
Namespace, format; possibly some conflict with Azusa beware
2018-01-14 23:10:21 +00:00
paboyle
4c6745cb4c
Namespace
2018-01-14 23:09:44 +00:00
paboyle
efdd0e572c
Namespace
2018-01-14 23:09:10 +00:00
paboyle
ca60a218ac
Namespace
2018-01-14 23:08:35 +00:00
paboyle
03633d709e
Namespace
2018-01-14 23:07:36 +00:00
paboyle
4de58c4aab
Namespace
2018-01-14 23:06:47 +00:00
paboyle
4f8b1c1940
Namespace
2018-01-14 23:05:23 +00:00
paboyle
dec39b313d
Namespace and format
2018-01-14 23:04:37 +00:00
paboyle
dc835ad1cb
Namespace
2018-01-14 23:03:49 +00:00
paboyle
71c8c9e4fb
Pretty
2018-01-14 23:03:01 +00:00
paboyle
a935ef7b39
Namespace
2018-01-14 23:01:07 +00:00
paboyle
a97ad1a51d
Namespce
2018-01-14 23:01:01 +00:00
paboyle
5ab9129db3
Namespace
2018-01-14 22:58:42 +00:00
paboyle
634943c11f
Namepsace
2018-01-14 22:57:59 +00:00
paboyle
e598e65f69
Namespace
2018-01-14 22:57:10 +00:00
paboyle
291407dc7f
Namespace
2018-01-14 22:54:42 +00:00
paboyle
641a28aa1d
Namespace
2018-01-14 22:53:50 +00:00
paboyle
75207fa010
FOrmat
2018-01-14 22:53:13 +00:00
paboyle
c2b0e0269a
Namespace
2018-01-14 22:52:22 +00:00
paboyle
7828887604
Namespace, indent
2018-01-14 22:51:18 +00:00
paboyle
e6efc93a7c
Namespace
2018-01-14 22:50:35 +00:00
paboyle
ff7e773d5e
Namesapce
2018-01-14 22:49:48 +00:00
paboyle
a0380fad72
Namespace
2018-01-14 22:48:57 +00:00
paboyle
61e9a33777
Namesapce
2018-01-14 22:48:08 +00:00
paboyle
3e139b52d3
Namespace
2018-01-14 22:47:24 +00:00
paboyle
fd6031b005
Namespace
2018-01-14 22:46:17 +00:00
paboyle
fe44fc50d9
Namespace
2018-01-14 22:45:29 +00:00
paboyle
2dd88cf3f8
Namespace
2018-01-14 22:44:41 +00:00
paboyle
6b7e82f1a9
Namespace, indentation
2018-01-14 22:44:06 +00:00
paboyle
be612b3931
Namespace, indentation
2018-01-14 22:43:27 +00:00
paboyle
f5e74033f9
Namespace
2018-01-14 22:42:31 +00:00
paboyle
8d52e0a349
Namespace
2018-01-14 22:41:23 +00:00
paboyle
a60f6d353e
Namespace
2018-01-14 22:40:29 +00:00
paboyle
5d3b574325
Missing banner; should recreate globally
2018-01-14 22:39:24 +00:00
paboyle
6ee5ea6b32
Namespace QCD gone
2018-01-14 22:38:22 +00:00
paboyle
cc349c6512
Namespace
2018-01-14 22:36:59 +00:00
paboyle
fde2e07bf4
Namespace
2018-01-14 22:36:15 +00:00
paboyle
2f38fe8d45
Namespace
2018-01-14 22:35:24 +00:00
paboyle
813af84ae8
Format emacs C++ mode
2018-01-14 22:34:12 +00:00
paboyle
cfe6c6838f
Namespace
2018-01-14 22:33:18 +00:00
paboyle
12a7216dfe
Namespace
2018-01-14 22:32:29 +00:00
paboyle
71ebd61327
Namespace
2018-01-14 22:31:39 +00:00
paboyle
2c2da60cc2
Namespace
2018-01-14 22:30:54 +00:00
paboyle
7631ed9c56
Namespace
2018-01-14 22:30:09 +00:00
paboyle
65669b116e
Namespace
2018-01-14 22:29:18 +00:00
paboyle
ae2a6cfc6e
Namespace
2018-01-14 22:27:32 +00:00
paboyle
c36223055e
Namespace
2018-01-14 22:26:55 +00:00
paboyle
e42de105c5
Namespace
2018-01-14 22:26:11 +00:00
paboyle
b08dae0809
Namespace
2018-01-14 22:25:29 +00:00
paboyle
3bf8fddbb5
Namespace
2018-01-14 22:24:47 +00:00
paboyle
d29fa23ebc
Namespace
2018-01-14 22:23:49 +00:00
paboyle
c978c88521
Namespace
2018-01-14 22:22:27 +00:00
paboyle
93f09818da
Namespace
2018-01-14 22:21:40 +00:00
paboyle
54a8ea93ec
Namespace QCD gone
2018-01-14 22:20:42 +00:00
paboyle
56e87d6e55
Namespace
2018-01-14 22:19:25 +00:00
paboyle
df29cc19ab
Namespace
2018-01-14 22:18:27 +00:00
paboyle
e61189db3f
Namespace
2018-01-14 22:17:43 +00:00
paboyle
361ce948c3
Namespace
2018-01-14 22:16:33 +00:00
paboyle
049b4a4631
Namespace
2018-01-14 22:15:55 +00:00
paboyle
9f2f294a27
Namespace
2018-01-14 22:14:58 +00:00
paboyle
81dcd0e6ea
Namespace
2018-01-14 22:13:46 +00:00
paboyle
34a788331f
Namespace
2018-01-14 22:13:02 +00:00
paboyle
e2c39945b3
Namespace
2018-01-14 22:11:03 +00:00
paboyle
1591d391b9
Namespace
2018-01-14 22:09:42 +00:00
paboyle
f4c06ed8c0
Namespace
2018-01-14 22:08:25 +00:00
paboyle
1f49f781bf
Namespace
2018-01-14 22:07:27 +00:00
paboyle
3a9f746421
Namespace
2018-01-14 22:06:01 +00:00
paboyle
4491d87766
Namespace
2018-01-14 22:04:21 +00:00
paboyle
0e080a7abc
Namespace
2018-01-14 22:03:14 +00:00
paboyle
8bf78846ee
Namespace
2018-01-14 22:02:09 +00:00
paboyle
9aa34dc803
Namespace
2018-01-14 22:01:17 +00:00
paboyle
fdcbe0a0d1
Namespace
2018-01-14 22:00:29 +00:00
paboyle
6a62a9c6a5
Namespace
2018-01-14 21:59:48 +00:00
paboyle
b331ecea78
Namespace
2018-01-14 21:58:47 +00:00
paboyle
66f8a2f082
Namespace
2018-01-14 21:57:46 +00:00
paboyle
d58b7cf9b9
Namespace changes
2018-01-14 21:56:55 +00:00
paboyle
0d749becff
Namespace
2018-01-14 21:55:47 +00:00
paboyle
1dbea9aa69
Namespace
2018-01-14 21:54:28 +00:00
paboyle
c1438cbbe3
Namespace
2018-01-14 21:53:39 +00:00
paboyle
f4623fd551
Namespace
2018-01-14 21:53:05 +00:00
paboyle
59ba9ff3bb
NAMESPACE & format
2018-01-14 21:52:27 +00:00
paboyle
1fbab4032b
Namespace changes
2018-01-14 21:51:19 +00:00
paboyle
c037244874
Tensor reformatted with NAMESPACE too
2018-01-13 00:31:02 +00:00
paboyle
f4272aa6fd
Clean up
2018-01-13 00:19:19 +00:00
paboyle
8cb7a1a887
Format
2018-01-13 00:17:16 +00:00
paboyle
b45bd8e097
NAMESPACE
2018-01-13 00:16:34 +00:00
paboyle
5e48b701ec
FOrmatting
2018-01-13 00:11:53 +00:00
paboyle
7f6bffe5ad
NAMESPACE
2018-01-13 00:11:30 +00:00
paboyle
6bf5fb1924
Clean up and format NAMESPACE
2018-01-13 00:08:25 +00:00
paboyle
086db7bd19
NAMESPACE and reformat
2018-01-13 00:05:33 +00:00
paboyle
c0a9b38c02
C++ NAMESPACE format emacs happy
2018-01-13 00:03:57 +00:00
paboyle
6d7bdfb5f5
Emacs happy
2018-01-13 00:02:53 +00:00
paboyle
be5d70ae6e
C++ happy
2018-01-13 00:02:10 +00:00
paboyle
ab1068044e
C++ emacs happy
2018-01-13 00:01:58 +00:00
paboyle
dda151250f
Emacs format
2018-01-12 23:59:58 +00:00
paboyle
18daf85069
Emacs format
2018-01-12 23:58:23 +00:00
paboyle
81cc28f6ca
Format
2018-01-12 23:57:22 +00:00
paboyle
c01a1e02fe
Namespace, format
2018-01-12 23:55:38 +00:00
paboyle
7e70f4ed9c
Format, NAMESPACE
2018-01-12 23:55:03 +00:00
paboyle
1056e36f11
Format, NAMESPACE
2018-01-12 23:49:46 +00:00
paboyle
0b8a88978b
Format, NAMESPACE
2018-01-12 23:47:24 +00:00
paboyle
59b31b6bb8
Format, NAMESPACE
2018-01-12 23:43:44 +00:00
paboyle
69496482fc
Format, NAMESPACE
2018-01-12 23:42:22 +00:00
paboyle
4be31ad1f6
C++ indentation
2018-01-12 23:39:49 +00:00
paboyle
176a021ce9
Formatting, NAMESPACE§
2018-01-12 23:38:15 +00:00
paboyle
b673174b71
FOrmat, NAMESPACE
2018-01-12 23:29:22 +00:00
paboyle
e6f7a5a818
Namespace
2018-01-12 23:28:01 +00:00
paboyle
68b69a2ac0
Namespace management
2018-01-12 23:26:14 +00:00
paboyle
bd15c38ae8
Formatting emacs compliant
2018-01-12 23:25:02 +00:00
paboyle
b815f5f764
Formatting
2018-01-12 23:23:21 +00:00
paboyle
4da437431e
Reformat
2018-01-12 23:22:46 +00:00
paboyle
3c7bf211a9
Reformat
2018-01-12 23:22:18 +00:00
paboyle
347d5404dd
format
2018-01-12 23:21:25 +00:00
paboyle
5e2cd0d07c
Format
2018-01-12 23:18:22 +00:00
paboyle
62fcee72c5
Format, NAMESPACE
2018-01-12 23:16:37 +00:00
paboyle
0a6168eef0
Format emacs style
2018-01-12 23:11:22 +00:00
paboyle
63865e4232
format
2018-01-12 23:10:48 +00:00
paboyle
c64deedf74
Format
2018-01-12 23:09:35 +00:00
paboyle
3281559ec3
Format
2018-01-12 23:09:01 +00:00
paboyle
6a2eca2ec2
NAMESAPCE
2018-01-12 23:00:03 +00:00
paboyle
d8ff895e74
NAMESPACE and format
2018-01-12 18:27:22 +00:00
paboyle
00c49d4c17
Format
2018-01-12 18:25:39 +00:00
paboyle
ec89714cce
NAMESPACE
2018-01-12 18:24:16 +00:00
paboyle
6ab744c720
NAMESPACE and formatting
2018-01-12 18:11:04 +00:00
paboyle
bbb657da5c
NAMESPACE and formatting
2018-01-12 18:10:11 +00:00
paboyle
fbc2380cb8
NAMESPACE & format
2018-01-12 18:05:36 +00:00
paboyle
08682c5461
NAMESPACE and format to my liking
2018-01-12 18:03:57 +00:00
paboyle
13bce2a6bf
NAMESPACE
2018-01-12 17:58:53 +00:00
paboyle
70e689900b
NAMESPACE
2018-01-12 17:58:13 +00:00
Guido Cossu
b7f8c5b823
Modify test to merge with the new Lanczos interface
2018-01-12 14:38:27 +00:00
Guido Cossu
3923683e9b
Updating the feature/clover branch with the newest Hadron package
2018-01-12 13:35:51 +00:00
Guido Cossu
e199fda9dc
Merge pull request #136 from pretidav/feature/clover
...
Feature/clover
2018-01-12 11:57:08 +00:00
7bb405e790
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/communicator/Communicator_mpi3_leader.cc
# lib/communicator/Communicator_shmem.cc
2018-01-11 18:50:15 +00:00
Daniel Richtmann
10f7a17ae4
Make timing in VPGCR more detailed
2018-01-11 13:42:18 +01:00
Daniel Richtmann
26f14d7dd7
Adapt output format of non-herm solvers to the one of VPGCR
2018-01-11 13:36:30 +01:00
ec16eacc6a
Hadrons: scalar SU(N) 2-pt function
2018-01-10 22:12:21 +00:00
pretidav
cf858deb16
Lanczos with 2 reps fixed (tobe tested)
2018-01-10 18:43:02 +01:00
David Preti
a3affac963
SU3 restored + output filename for mesons and baryons fixed.
2018-01-10 14:56:54 +01:00
d9d1f43ba2
Hadrons: code cleaning
2018-01-10 11:31:24 +00:00
b7cd721308
Hadrons: scalar SU(N) tr(mag^n)
2018-01-10 11:25:59 +00:00
29f026c375
Hadrons: scalar SU(N) tr(phi^n) 1-pt function
2018-01-10 11:01:03 +00:00
58c7a13d54
Hadrons: result file macro with trajectory number
2018-01-10 10:59:58 +00:00
Azusa Yamaguchi
24162c9ead
Staggered overlap comms comput
2018-01-09 13:02:52 +00:00
Daniel Richtmann
73434db636
Merge remote-tracking branch 'upstream/develop' into feature/ddalphaamg
2018-01-09 10:43:33 +01:00
paboyle
e564d11687
Allow resize of the shared memory buffers
2018-01-08 15:20:26 +00:00
paboyle
0b2162f375
Clean up
2018-01-08 14:06:53 +00:00
paboyle
5610570182
Synthetic test of lanczos
2018-01-08 11:36:39 +00:00
paboyle
44f65526e0
Simplify communicators
2018-01-08 11:35:43 +00:00
paboyle
43e48542ab
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2018-01-08 11:34:45 +00:00
paboyle
0b85f1bfc8
Simplify the communicator proliferation: mpi and none.
2018-01-08 11:33:47 +00:00
paboyle
9947cfbf14
Simplify number of communicator cases
2018-01-08 11:33:01 +00:00
paboyle
357badce5e
Simplify communicator case proliferation
2018-01-08 11:32:16 +00:00
paboyle
0091eec23a
Simplify communicator cases
2018-01-08 11:31:32 +00:00
paboyle
9e9c2962df
Simplify comms layer proliferation
2018-01-08 11:30:22 +00:00
paboyle
bda97212a9
Simplify proliferation of comms layers
2018-01-08 11:29:20 +00:00
paboyle
b91282ad46
Simplify comms layer proliferation
2018-01-08 11:28:52 +00:00
paboyle
0a68470f9a
Simplify comms layers
2018-01-08 11:28:30 +00:00
paboyle
6ecf280723
Simplify comms layer proliferation
2018-01-08 11:28:04 +00:00
paboyle
7eeab7f995
Simplify comms layers
2018-01-08 11:27:43 +00:00
paboyle
9b32d51cd1
Simplify comms layer proliferatoin
2018-01-08 11:27:14 +00:00
paboyle
7b3ed160aa
Rationalise MPI options
2018-01-08 11:26:48 +00:00
paboyle
1a0163f45c
Updated to do list
2018-01-08 11:26:11 +00:00
Daniel Richtmann
c6411f8514
Merge remote-tracking branch 'upstream/develop' into feature/ddalphaamg
2018-01-08 10:37:10 +01:00
David Preti
9028e278e4
Trying to fix a bug with SU4 mesons (still under investigation)
2018-01-06 15:57:38 +01:00
dd62f2f371
Hadrons: log message fix
2017-12-29 16:58:44 +01:00
0d612039ed
Hadrons: prettier Grid logging (non-intrusive)
2017-12-29 16:58:23 +01:00
e8ac75055c
Hadrons: binary configuration loader
2017-12-27 14:24:29 +01:00
8b30c5956c
Hadrons: copyright update
2017-12-26 14:16:47 +01:00
185da83454
Hadrons: new MIO module namespace, NERSC loader moved there
2017-12-26 14:05:17 +01:00
6718fa8c4f
Merge branch 'feature/scalar_adjointFT' into feature/hadrons
2017-12-26 12:59:33 +01:00
pretidav
4ce63af7d5
Working on Hadrons with Hirep. (QCD is set for SU4)
2017-12-22 19:02:07 +01:00
Daniel Richtmann
6cf635d61c
Remove some old code in Wilson MG
2017-12-22 13:20:09 +01:00
Daniel Richtmann
39558cce52
Multiply TVs in Wilson MG with G5 instead of G5R5
2017-12-22 13:07:56 +01:00
Vera Guelpers
935cd1e173
conserved current insertion summed over Lorentzindex
2017-12-22 11:38:45 +00:00
Vera Guelpers
55e39df30f
tadpole insertion for DWF
2017-12-22 11:36:31 +00:00
67c3fa0f5f
Hadrons: all modules are now ported, more tests need to be done
2017-12-21 11:39:07 +00:00
65d4f17976
Hadrons: no errors when trying to recreate a cache
2017-12-19 20:28:32 +00:00
e2fe97277b
Hadrons: getReference use is rare, empty by default
2017-12-19 20:28:04 +00:00
Guido Cossu
84f9c37ed4
Merge branch 'feature/scalar_adjointFT' of https://github.com/paboyle/Grid into feature/scalar_adjointFT
2017-12-19 15:43:55 +00:00
bcf6f3890c
Hadrons: more fixes after test
2017-12-14 21:14:10 +00:00
591a38c487
Hadrons: VM fixes
2017-12-14 19:42:16 +00:00
James Harrison
581be32ed2
Implement infrared improvement for v=0 on-shell self-energy
2017-12-14 13:42:41 +00:00
842754bea9
Hadrons: most modules ported to the new interface, compiles but untested
2017-12-13 19:41:41 +00:00
James Harrison
6bc136b1d0
Add module for calculating diagrams required for HVP counter-terms
2017-12-13 17:31:01 +00:00
0887566134
Hadrons: scheduler back!
2017-12-13 16:36:15 +00:00
61fc50d616
Hadrons: better organisation of the VM
2017-12-13 13:44:23 +00:00
a9c8d7dad0
Hadrons: code cleaning
2017-12-13 12:13:40 +00:00
259d504ef0
Hadrons: first full implementation of the module memory profiler
2017-12-12 19:32:58 +00:00
f3a77f4b7f
Merge branch 'feature/hadrons' into feature/hadrons-new-memory-model
2017-12-12 14:05:23 +00:00
26d7b829a0
Hadrons: error managed through expections
2017-12-12 14:04:28 +00:00
64161a8743
Hadrons: much simpler reference dependency
2017-12-12 13:08:01 +00:00
2401360784
Merge pull request #138 from guelpers/feature/hadrons
...
bug fix in sequential insertion of conserved vector current
2017-12-11 18:53:41 +01:00
Vera Guelpers
2cfb50cbe5
bug fix in sequential insertion of conserved vector current
2017-12-08 11:13:39 +00:00
f9aa39e1c4
global memory debug through command line flag
2017-12-07 14:40:58 +01:00
Daniel Richtmann
df152648d6
Fix error in MR code when compiling for single precision
2017-12-06 18:00:58 +01:00
0fbf445edd
Hadrons: object creation that get properly captured by the memory profiler
2017-12-06 16:51:48 +01:00
e78794688a
memory profiler improvement
2017-12-06 16:50:25 +01:00
9e31307963
Merge branch 'feature/hadrons' into feature/hadrons-new-memory-model
2017-12-06 16:49:32 +01:00
29e2eddea8
Merge branch 'develop' into feature/hadrons-new-memory-model
2017-12-06 16:49:21 +01:00
0a038ea15a
Merge branch 'develop' into feature/hadrons
2017-12-06 16:49:10 +01:00
62eb1f0e59
FermionOperator virtual destructor needed for polymorphism
2017-12-06 16:48:17 +01:00
5422251959
Hadrons: execution part moved in a new virtual machine class
2017-12-05 15:31:59 +01:00
paboyle
9579c9c327
Threading improvement
2017-12-05 14:12:22 +00:00
paboyle
3729c7a7a6
Clean up of test
2017-12-05 13:07:31 +00:00
paboyle
c24d4c8d0e
Improved parallel RNG init
2017-12-05 13:01:10 +00:00
paboyle
a14038051f
Improved AllToAll asserts
2017-12-05 11:43:25 +00:00
paboyle
3e560b9462
Faster RNG init
2017-12-05 11:42:05 +00:00
paboyle
d93c6760ec
Faster code for split unsplit
2017-12-05 11:39:26 +00:00
paboyle
ae3b7713a9
Cold start doesnt need RNG
2017-12-05 11:36:31 +00:00
cbd8fbe771
Merge branch 'feature/hadrons' into feature/hadrons-new-memory-model
2017-12-03 19:48:56 +01:00
d391f05cb7
Merge branch 'develop' into feature/hadrons
2017-12-03 19:48:46 +01:00
3127b52c90
bootstrap script does not destroy Eigen is working offline
2017-12-03 19:48:34 +01:00
01f00385a4
Hadrons: genetic pair selection based on exponential probability
2017-12-03 19:47:40 +01:00
59aae5f5ec
Hadrons: garbage collector clean temporaries
2017-12-03 19:47:11 +01:00
624246409c
Hadrons: module setup/execute protected to forbid user to bypass execution control
2017-12-03 19:46:18 +01:00
2a9ebddad5
Hadrons: scheduler offline, minimal code working again
2017-12-03 19:45:15 +01:00
ff7afe6e17
Merge branch 'feature/hadrons' into feature/hadrons-new-memory-model
2017-12-01 19:45:44 +00:00
33cb509d4b
Merge branch 'develop' into feature/hadrons
2017-12-01 19:45:32 +00:00
456c78c233
Merge branch 'develop' into feature/hadrons-new-memory-model
2017-12-01 19:45:12 +00:00
2fd4989029
Merge branch 'develop' of github.com:paboyle/Grid into develop
2017-12-01 19:44:31 +00:00
2427a21428
minor serial IO fixes, XML now issues warning when trying to read absent nodes, these becomes
2017-12-01 19:44:07 +00:00
514993ed17
Hadrons: progress on the interface, genetic algorithm freezing
2017-12-01 19:38:23 +00:00
David Murphy
7a0c9e84f8
Fix HDF5 src
2017-11-29 18:03:03 -05:00
David Murphy
caf1a3c85d
Also add HDF5 src
2017-11-29 17:58:02 -05:00
David Murphy
21ca730a49
Also import some dependcies
2017-11-29 17:34:40 -05:00
David Murphy
c6cd27e9b2
Also import Eigen.inc
2017-11-29 17:26:20 -05:00
David Murphy
6068411d61
Remove Eigen from gitignore
2017-11-29 17:24:40 -05:00
Daniel Richtmann
4e965c168e
Implement analogon to test vector analysis in WMG codebase
2017-11-29 15:05:27 +01:00
Daniel Richtmann
f260af546e
Save current state
2017-11-28 15:03:02 +01:00
paboyle
28ceacec45
Split/Unsplit working
2017-11-27 15:13:29 +00:00
paboyle
e6a3e375cf
Debug
2017-11-27 15:10:22 +00:00
paboyle
4987edbd44
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-11-27 12:34:56 +00:00
paboyle
ad140bb6e7
Clean on multinode target after split 1 1 2 4 -> 1 1 2 2
2017-11-27 12:34:25 +00:00
paboyle
1f04e56038
Believe split/unsplit works, but need to make pretty
2017-11-27 12:33:08 +00:00
paboyle
4bfc8c85c3
Clean up verbose communicator create
2017-11-27 12:32:37 +00:00
azusayamaguchi
e55397bc13
Staggerd cg
2017-11-24 14:18:30 +00:00
Daniel Richtmann
649b8c9aca
Save current state
2017-11-24 10:46:20 +01:00
Daniel Richtmann
0afa22747d
Merge remote-tracking branch 'upstream/develop' into feature/new-solver-algorithms
2017-11-24 10:11:42 +01:00
a3fe874a5b
Hadrons: everything is broken, repairing while implementing the new memory model
2017-11-22 23:27:19 +00:00
f403ab0133
gitignore update
2017-11-22 17:13:09 +00:00
paboyle
94b8fb5686
Debug in progress
2017-11-19 01:39:04 +00:00
Guido Cossu
1f1d77b01a
Performance metrics for the Scalar Action force term
2017-11-14 10:01:48 +00:00
pretidav
6a15e2e8ef
Added WilsonTwoIndexAntiSymmImpl instantiation in WilsonKernelsHand.cc (shoud not be necessary)
2017-11-12 14:16:19 +01:00
074d17429f
Merge branch 'develop' into feature/scalar_adjointFT
...
# Conflicts:
# lib/communicator/Communicator_mpi3.cc
2017-11-11 18:09:55 +00:00
Daniel Richtmann
fa43206c79
Remove some empty lines
2017-11-10 13:48:38 +01:00
Peter Boyle
25f73018f4
Merge pull request #135 from fionnoh/develop
...
Declaring virtual functions as pure virtual functions.
2017-11-09 23:19:08 +00:00
fionnoh
1d7ccc6b2c
Declaring virtual functions as pure virtual functions.
2017-11-09 19:46:57 +00:00
Daniel Richtmann
a367835bf2
Set everything up for the implementation of FCAGMRES
...
The current implementation is the exact same code as normal FGMRES. This commit
only sets up the "framework" for the implementation of FCAGMRES, i.e., a test
and an include in the algorithms header file.
2017-11-09 17:30:41 +01:00
Daniel Richtmann
d7743591ea
Fix some minor formatting errors
2017-11-09 17:28:19 +01:00
Daniel Richtmann
c6cbe533ea
Set everything up for the implementation of CAGMRES
...
The current implementation is the exact same code as normal GMRES. This commit
only sets up the "framework" for the implementation of CAGMRES, i.e., a test and
an include in the algorithms header file.
2017-11-09 17:14:44 +01:00
Daniel Richtmann
8402ab6cf9
Some minor formatting improvements
2017-11-09 12:52:04 +01:00
Daniel Richtmann
c63095345e
Remove some superfluous comments
2017-11-09 12:47:20 +01:00
pretidav
59d9ccf70c
restored WilsonKernelsHand.cc and added Qtop to production codes
2017-11-08 22:02:32 +01:00
Daniel Richtmann
a7ae46b61e
Remove some comments
2017-11-08 16:58:20 +01:00
Daniel Richtmann
cd63052205
Remove everything preconditioner-related in GMRES code
2017-11-08 16:57:40 +01:00
Daniel Richtmann
699d537cd6
Add FGMRES test with staggered fermions
2017-11-08 16:56:42 +01:00
Daniel Richtmann
9031f0ed95
Fix a filename in a file header
2017-11-08 16:42:26 +01:00
Daniel Richtmann
26b3d441bb
Check in forgotten FGMRES test with wilson Fermions
2017-11-08 16:39:11 +01:00
Daniel Richtmann
99bc4cde56
Fix an implementation error in FGMRES
2017-11-08 16:38:34 +01:00
Daniel Richtmann
e843d83d9d
Make z in FGMRES a single Field
2017-11-08 16:38:16 +01:00
Daniel Richtmann
0f75ea52b7
First version of FGMRES; not working yet
2017-11-08 16:17:18 +01:00
Daniel Richtmann
8107b785cc
Rename misunderstood "rsd_sq" to "rsq" in GMRES code
2017-11-08 14:40:03 +01:00
Daniel Richtmann
37b777d801
Add test for GMRES solver with staggered fermions
2017-11-08 14:28:48 +01:00
Daniel Richtmann
7382787856
Some minor changes
2017-11-08 14:23:55 +01:00
Daniel Richtmann
781c611ca0
Perform minor code style fix
2017-11-08 14:22:38 +01:00
Daniel Richtmann
b069090b52
Remove a superfluous comment
2017-11-08 13:58:02 +01:00
Daniel Richtmann
0c1c1d9900
Set precision and formatting only once in MR code
2017-11-08 13:57:06 +01:00
Daniel Richtmann
7f4ed6c2e5
First working version of GMRES + a test for Wilson fermions
2017-11-08 13:56:41 +01:00
Daniel Richtmann
56d32a4afb
Rename misunderstood "rsd_sq" to "rsq" in MR code
2017-11-08 13:51:08 +01:00
Daniel Richtmann
b8ee496ed6
Print some info at start of GMRES
2017-11-08 13:23:41 +01:00
Azusa Yamaguchi
1860b1698c
Fixed the bag on MPI_T at Cam
2017-11-08 09:03:01 +00:00
Azusa Yamaguchi
9b8d1cc3da
Staggered Schur decomposed matrix norm changed to not be the Schur anymore :(
...
Carleton wanted this for multimass / multishift
2017-11-07 14:48:45 +00:00
James Harrison
0c668bf46a
QedFVol: Write to output files from one process only.
2017-11-07 14:46:39 +00:00
Guido Cossu
149c3f9e9c
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-11-07 14:01:13 +00:00
Daniel Richtmann
b87416dac4
Fix error with conformable
2017-11-07 15:00:08 +01:00
Daniel Richtmann
176bf37372
Remove some commented stuff
2017-11-07 14:57:36 +01:00
Guido Cossu
c519aab19d
Fixing the MPI memory leak in the communicators
2017-11-07 13:55:37 +00:00
Daniel Richtmann
b3d342ca22
Remove old implementation of GMRES operator
2017-11-07 10:24:49 +01:00
Daniel Richtmann
e1f928398d
Save current state
2017-11-07 10:22:41 +01:00
paboyle
69929f20bb
Destructor fix. Split Grid and MPI3 will not yet work without more effort from me.
2017-11-06 23:45:00 +00:00
Daniel Richtmann
8c579d2d4a
Save current state
2017-11-06 18:09:48 +01:00
James Harrison
840814c776
QedFVol: Patch to fix MPI communicators error
2017-11-06 16:34:55 +00:00
Daniel Richtmann
fc7d07ade0
Correct function signature of body of GMRES outer loop
2017-11-06 17:12:38 +01:00
Daniel Richtmann
b3be9195b4
Save one lattice fermion in GMRES code
2017-11-06 17:12:23 +01:00
Daniel Richtmann
9e3c187a4d
Save current state
2017-11-06 17:05:25 +01:00
Daniel Richtmann
8363edfcdb
Perform some minor changes to GMRES code
2017-11-06 16:17:44 +01:00
Daniel Richtmann
74af31564f
Adapt style of wilson GMRES test to style of wilson MR test
2017-11-06 14:06:45 +01:00
Daniel Richtmann
e0819d395f
Merge remote-tracking branch 'upstream/develop' into feature/new-solver-algorithms
2017-11-06 13:09:36 +01:00
pretidav
a493429218
added Production tests for MixedRep, Adj, 2S, 2AS. Still missing QObs. The HMC is not printing correctly all the actions and forces.
2017-11-04 18:16:54 +01:00
pretidav
915f610da0
clover 2indexSymm hmc production test created. clover 2indexAsymm and clover mixed to be filled.
2017-11-04 01:17:06 +01:00
pretidav
c79606a5dc
Test production code wilson clover. Still missing QObs measurement on-the-fly.
2017-11-03 22:46:32 +01:00
James Harrison
95af55128e
QedFVol: Redo optimisation of scalar VP (extra memory requirements were not the problem), and undo optimisation of charged propagator (which seemed to be causing HDF5 errors, although I don’t know why).
2017-11-03 18:46:16 +00:00
James Harrison
9f2a57e334
QedFVol: Undo optimisation of scalar VP, to reduce memory requirements
2017-11-03 13:10:11 +00:00
James Harrison
c645d33db5
QedFVol: Redo optimisation of charged propagator, and fix I/O bug
2017-11-03 10:59:26 +00:00
James Harrison
e0f1349524
QedFVol: Undo optimisation of charged propagator
2017-11-03 09:22:41 +00:00
paboyle
360efd0088
Improved treatment of reverse asked for by chris.
...
Truncate the basis.
Power method renormalises
2017-11-02 22:05:31 +00:00
pretidav
7b42ac9982
added polyakov loop observable to the hmc
2017-11-02 21:58:16 +01:00
paboyle
c5c647e35e
Merge branch 'feature/lanczos-reorg' into develop
2017-11-02 15:23:11 +00:00
a4e5fd1000
Merge branch 'feature/hadrons' into feature/hadrons-new-memory-model
2017-11-01 19:24:51 +00:00
682e7d7839
Merge branch 'develop' into feature/hadrons
2017-11-01 19:24:38 +00:00
Guido Cossu
8e057721a9
Anisotropic Clover term written and tested
2017-11-01 12:50:54 +00:00
Guido Cossu
fa5e4add47
Added support for anisotropy to the WilsonFermion class
2017-10-31 18:20:38 +00:00
Daniel Richtmann
6f81906b00
Add test for the MR solver with staggered fermions; does not converge atm
...
TODO: Is this a property of staggered or did I do something wrong?
2017-10-30 16:57:55 +01:00
James Harrison
79b761f923
Merge branch 'develop' into feature/qed-fvol
...
# Conflicts:
# lib/communicator/Communicator_base.cc
2017-10-30 15:53:18 +00:00
James Harrison
0d4e31ca58
QedFVol: Calculate phase factors for momentum projections once per configuration only.
2017-10-30 15:46:50 +00:00
Daniel Richtmann
a2d83d4f3d
Add test for the MR solver with DW fermions; does not converge atm
...
TODO: Is this a property of DWF or did I do something wrong?
2017-10-30 16:39:30 +01:00
Daniel Richtmann
89bacb0470
Fix path in MR solver header commentary
2017-10-30 16:16:55 +01:00
James Harrison
b07a354a33
QedFVol: output scalar propagator before FFT in spatial directions.
2017-10-30 14:20:44 +00:00
Daniel Richtmann
19010ff66a
Merge remote-tracking branch 'upstream/develop' into feature/new-solver-algorithms
2017-10-30 13:16:46 +01:00
paboyle
27ea2afe86
No compile on comms == none fix
2017-10-30 01:14:11 +00:00
paboyle
78e8704eac
Shaking out
2017-10-30 00:25:31 +00:00
paboyle
67131d82f2
Get subrank info from communicator constructor
2017-10-30 00:24:11 +00:00
paboyle
615a9448b9
Extended sub comm supported
2017-10-30 00:23:34 +00:00
paboyle
00164f5ce5
:
2017-10-30 00:22:52 +00:00
paboyle
a7f72eb994
SHaking out
2017-10-30 00:22:06 +00:00
paboyle
501fa1614a
Communicator updates for split grid
2017-10-30 00:16:12 +00:00
paboyle
5bf42e1e15
Update
2017-10-30 00:05:21 +00:00
paboyle
fe4d9b003c
More digits
2017-10-30 00:04:47 +00:00
paboyle
4a699b4da3
New rank can be found out
2017-10-30 00:04:14 +00:00
paboyle
689323f4ee
Reverse dim ordering lexico support
2017-10-30 00:03:15 +00:00
Guido Cossu
749189fd72
Full clover force correct
2017-10-29 12:03:08 +00:00
Guido Cossu
f941c4ee18
Clover term force ok
2017-10-29 11:43:33 +00:00
paboyle
84b441800f
Merge branch 'develop' into feature/lanczos-reorg
2017-10-27 14:21:38 +01:00
paboyle
1ef424b139
Split grid Y2K bug fix attempt
2017-10-27 14:20:35 +01:00
Daniel Richtmann
5a477ed29e
Perform minor style correction
2017-10-27 14:46:18 +02:00
Daniel Richtmann
54128d579a
Make MR a bit more verbose
2017-10-27 14:45:29 +02:00
Daniel Richtmann
e7b1933e88
Add a test for the MR solver
2017-10-27 14:38:57 +02:00
Daniel Richtmann
1bad64ac6a
Some formatting
2017-10-27 14:35:04 +02:00
Daniel Richtmann
15dfa9f663
Change stopping criterion implementation in MR solver + some cleanup
2017-10-27 14:33:25 +02:00
Daniel Richtmann
2185b0d651
Correct author in the file
2017-10-27 14:32:38 +02:00
Daniel Richtmann
f61c0b5d03
Very early version of MR solver
2017-10-27 14:09:02 +02:00
Daniel Richtmann
074db32e54
Fix build of gmres test
2017-10-27 14:08:48 +02:00
paboyle
aa66f41c69
Bug fix in the coarse restore...
...
Think this is nearly there
2017-10-27 10:29:34 +01:00
paboyle
f96c800d25
Passes reload of coarse basis
2017-10-27 09:43:22 +01:00
paboyle
32a52d7583
Move the local coherence lanczos into algorithms.
...
Keep the I/O in the tester. Other people can copy this method to write other I/O formats.
2017-10-27 09:04:31 +01:00
paboyle
fa04b6d3c2
Finished ? Verifying coarse evec restore
2017-10-27 08:18:29 +01:00
paboyle
7fab183c0e
Better read test
2017-10-27 08:17:49 +01:00
paboyle
9ec9850bdb
64bit ftello update
2017-10-26 23:34:31 +01:00
paboyle
0c4ddaea0b
Cleaning up
2017-10-26 23:31:46 +01:00
paboyle
00ebc150ad
Mistake in string parse; interface is ambiguous and must fix. Is char * a file, or a XML buffer ?
2017-10-26 23:30:37 +01:00
paboyle
0f3e9ae57d
Gsites error. Only appeared (so far) in I/O code for even odd fields
2017-10-26 23:29:59 +01:00
Azusa Yamaguchi
034de160bf
Staggered updates : Schur fixed and added a unit test for Test_staggered_cg_schur.cc giving stronger check
2017-10-26 20:58:46 +01:00
Guido Cossu
76bcf6cd8c
Deleting vscode settings file
2017-10-26 18:45:41 +01:00
Guido Cossu
91b8bf0613
Debugging force term
2017-10-26 18:23:55 +01:00
paboyle
14507fd6e4
Final? candidate for push back on the lanczos reorg feature
2017-10-26 16:25:01 +01:00
paboyle
2db05ac214
Test for split/unsplit in isolation
2017-10-26 07:48:03 +01:00
paboyle
31f99574fa
Moving these out of algorithms
2017-10-26 07:47:42 +01:00
paboyle
a34c8a2961
Update to IRL; getting close to the structure I would like.
2017-10-26 07:45:56 +01:00
paboyle
ccd20df827
Better IRL interface
2017-10-26 01:59:59 +01:00
paboyle
e9be293444
Better messaging
2017-10-26 01:59:30 +01:00
paboyle
d577211cc3
Relax stoppign condition
2017-10-25 23:57:54 +01:00
paboyle
f4336e480a
Faster converge time
2017-10-25 23:53:44 +01:00
paboyle
e4d461cb03
Messagign
2017-10-25 23:53:19 +01:00
paboyle
3d63b4894e
Use existing functionality where possible
2017-10-25 23:52:47 +01:00
paboyle
08583afaff
Red black friendly coarsening
2017-10-25 23:51:18 +01:00
paboyle
b395a312af
Better error messaging
2017-10-25 23:50:37 +01:00
paboyle
66295b99aa
Bit less verbose SciDAC IO
2017-10-25 23:50:05 +01:00
paboyle
b8654be0ef
64 bit safe offsets
2017-10-25 23:49:23 +01:00
paboyle
a479325349
Rewrite of local coherence lanczos
2017-10-25 23:48:47 +01:00
paboyle
f6c3f6bf2d
XML serialisation of parms and initialise from parms object
2017-10-25 23:47:59 +01:00
paboyle
d83868fdbb
Identity linear op added -- useful in circumstances where a linear op may or may not be needed.
...
Supply a trivial one if not needed
2017-10-25 23:47:10 +01:00
paboyle
303e0b927d
Improvements for coarse grid compressed lanczos
2017-10-25 23:46:33 +01:00
paboyle
28ba8a0f48
Force spacing more nicely
2017-10-25 23:45:57 +01:00
Azusa Yamaguchi
f9e28577f3
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-10-25 21:07:56 +01:00
Guido Cossu
e0cae833da
Merge branch 'develop' into feature/scalar_adjointFT
2017-10-25 10:49:50 +01:00
Guido Cossu
8a3aae98f6
Solving minor bug in compilation
2017-10-25 10:34:49 +01:00
Guido Cossu
8309f2364b
Solving again the MPI comm bug with FFTs
2017-10-25 10:24:14 +01:00
Daniel Richtmann
d5f661ba70
Save intermediate state
2017-10-25 10:38:26 +02:00
Azusa Yamaguchi
cac1750078
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-10-24 23:30:36 +01:00
Guido Cossu
e17cd35151
Merge branch 'develop' into feature/scalar_adjointFT
2017-10-24 17:31:22 +01:00
Guido Cossu
ccdec7a7ab
Merge branch 'develop' into feature/clover
2017-10-24 16:51:14 +01:00
Guido Cossu
93642d813d
Merging
2017-10-24 16:48:05 +01:00
Daniel Richtmann
1ab8d5cc13
Save two more files
2017-10-24 16:58:05 +02:00
Daniel Richtmann
789e892865
Save current state
2017-10-24 16:58:04 +02:00
Daniel Richtmann
53cfa44d7a
Save current state
2017-10-24 16:58:03 +02:00
Guido Cossu
0bc381f982
Merge pull request #133 from pretidav/feature/clover
...
Feature/clover
2017-10-24 15:15:21 +01:00
Guido Cossu
2986aa76f8
Restoring Perfcounts
2017-10-24 13:32:02 +01:00
Guido Cossu
657779374b
Adding vscode to gitignore
2017-10-24 13:27:17 +01:00
Guido Cossu
ec8cd11c1f
Cleanup and prepare for pull request
2017-10-24 13:21:17 +01:00
Guido Cossu
cbda4f66e0
Debug of the field strength
2017-10-24 10:20:13 +01:00
Guido Cossu
6579dd30ff
More debug test
2017-10-23 18:47:00 +01:00
Guido Cossu
031c94e02e
Debugging process for the clover term
2017-10-23 18:27:34 +01:00
Guido Cossu
6391b2a1d0
Added test for Wilson and Clover fermions
2017-10-23 14:42:35 +01:00
Guido Cossu
2e50b55ae4
Changes in the Makefile to compile against Chroma on Linux
2017-10-23 13:32:26 +01:00
James Harrison
c433939795
QedFVol: Temporarily remove incomplete implementation of infinite-volume photon
2017-10-20 16:27:58 +01:00
James Harrison
b6a4c31b48
Merge branch 'feature/qed-fvol' of https://github.com/jch1g10/Grid into feature/qed-fvol
2017-10-20 16:25:07 +01:00
James Harrison
98b1439ff9
QedFVol: pass arbitrary input values to photon constructor in UnitEm
2017-10-20 16:24:09 +01:00
Guido Cossu
27936900e6
Putting the FG verbosity in the Integrator level
2017-10-18 13:08:09 +01:00
James Harrison
564738b1ff
Add module for unit EM field
2017-10-17 14:03:57 +01:00
Guido Cossu
cd3e810d25
Merge branch 'develop' into feature/scalar_adjointFT
2017-10-17 11:31:14 +01:00
pretidav
317ddfedee
updated test clover + first attempt derivative clove term (still missing spin part)
2017-10-16 02:47:33 +02:00
paboyle
e325929851
ALl codes compile against the new Lanczos call signature
2017-10-13 14:02:43 +01:00
paboyle
47af3565f4
Logging improvement; reunified the Lanczos codes
2017-10-13 13:23:07 +01:00
paboyle
4b4d187935
Reunified the Lanczos implementations
2017-10-13 13:22:44 +01:00
paboyle
9aff354ab5
Final version prior to reunification
2017-10-13 13:22:26 +01:00
paboyle
cb9ff20249
Approx tests and lanczos improvement
2017-10-13 11:30:50 +01:00
James Harrison
a80e43dbcf
Added infinite-volume photon in Photon.h (not checked yet)
2017-10-11 16:44:51 -04:00
paboyle
9fe6ac71ea
Starting reorg of Blocked lanczos
2017-10-11 10:12:07 +01:00
5c392a6ecc
Merge commit 'bf58557fb1ec710c766e19c9a8809b0a352de239' into feature/scalar_adjointFT
2017-10-10 17:14:56 +01:00
Azusa Yamaguchi
f1fa00b71b
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-10-10 14:26:44 +01:00
paboyle
bf58557fb1
Block compressed Lanczos
2017-10-10 14:15:11 +01:00
paboyle
10cb37f504
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-10-10 14:09:44 +01:00
Azusa Yamaguchi
1374c943d4
Correct Schur operator called
2017-10-10 13:59:50 +01:00
paboyle
a1d80282ec
cb factorise
2017-10-10 13:49:31 +01:00
paboyle
4eb8bbbebe
Christop mods
2017-10-10 13:48:51 +01:00
paboyle
d1c6288c5f
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-10-10 13:38:40 +01:00
Azusa Yamaguchi
dd949bc428
Merge branch 'feature/staggering' into develop
2017-10-10 13:02:51 +01:00
Azusa Yamaguchi
bb7378cfc3
Schur for staggered
2017-10-10 12:02:18 +01:00
Azusa Yamaguchi
f0e084a88c
Schur staggered
2017-10-10 10:00:43 +01:00
paboyle
153672d8ec
Split CG testing
2017-10-09 23:20:58 +01:00
paboyle
08ca338875
Split grid communication
2017-10-09 23:19:45 +01:00
paboyle
f7cbf82c04
Better stdout/err debug
2017-10-09 23:18:48 +01:00
paboyle
07009c569a
Comms splitting improvements
2017-10-09 23:16:51 +01:00
Guido Cossu
15d690e9b9
Adding the cartesian communicator destructor
2017-10-09 09:59:58 +01:00
63b2bc1936
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/qcd/action/fermion/FermionOperatorImpl.h
2017-10-05 14:16:23 +01:00
David Preti
d810e8c8fb
first attempt to write C terms in clover derivative. Some shifts to be fixed
2017-10-05 10:13:53 +02:00
Azusa Yamaguchi
09f4cdb11e
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2017-10-04 10:51:16 +01:00
Azusa Yamaguchi
1e54882f71
Stagger
2017-10-04 10:51:06 +01:00
Guido Cossu
27caff92c6
Merge branch 'feature/scalar_adjointFT' of https://github.com/paboyle/Grid into feature/scalar_adjointFT
2017-10-04 09:44:27 +01:00
d38cee73bf
Scalar: easier Fourier acceleration parametrisation through -D flags
2017-10-03 17:29:34 +01:00
8784f2a88d
post-merge fix
2017-10-03 14:38:10 +01:00
c497864b5d
Merge commit 'd54807b8c0cd1a7658ff8563bb00d1137b987e3e' into feature/scalar_adjointFT
...
# Conflicts:
# lib/communicator/Communicator_base.h
# lib/communicator/Communicator_mpi.cc
# lib/communicator/Communicator_mpit.cc
2017-10-03 14:27:54 +01:00
05c1c88440
Scalar: more action generalisation
2017-10-03 14:26:20 +01:00
paboyle
d54807b8c0
MPIT works with split grid now
2017-10-02 23:14:56 +01:00
Guido Cossu
f6ba2b95ce
Merge branch 'develop' into feature/scalar_adjointFT
2017-10-02 15:19:20 +01:00
paboyle
5625b47c7d
Merge branch 'feature/dwf-multirhs' into develop
2017-10-02 12:42:32 +01:00
paboyle
1edcf902b7
Macos ANON
2017-10-02 12:41:02 +01:00
paboyle
e5c19e1fd7
RB constructor change
2017-10-02 12:25:52 +01:00
paboyle
a11d0a33d1
Merge branch 'feature/dwf-multirhs' of https://github.com/paboyle/Grid into feature/dwf-multirhs
2017-10-02 11:42:07 +01:00
paboyle
4f8b6f26b4
Merge branch 'develop' into feature/dwf-multirhs
2017-10-02 11:41:49 +01:00
paboyle
073525c5b3
Small patch from cori
2017-10-02 03:38:21 -07:00
Azusa Yamaguchi
eb6153080a
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2017-10-02 08:56:33 +01:00
Guido Cossu
f7072d1ac2
Solving an annoying compilation error in json
2017-10-02 07:13:40 +01:00
a021933002
Scalar: SU(N) action change to t'Hooft scaling
2017-09-29 16:09:34 +01:00
James Harrison
b99622d9fb
QedFVol: fix problem with JSON wanting gcc 4.9
2017-09-28 13:34:33 -04:00
937c77ead2
Merge branch 'develop' into feature/qed-fvol
2017-09-28 16:25:20 +01:00
95e5a2ade3
Merge pull request #116 from jch1g10/feature/qed-fvol
...
Feature/qed fvol
2017-09-25 15:08:33 +01:00
David Preti
56478d63a5
clover + test (valence)
2017-09-24 19:32:15 +02:00
df21668f2c
memory profiler update
2017-09-22 14:21:18 +01:00
Guido Cossu
482368e9de
Merge branch 'develop' into feature/scalar_adjointFT
2017-09-21 13:44:08 +01:00
paboyle
fddeb29d6b
Bug fix with spreadout FFT
2017-09-21 11:10:08 +01:00
paboyle
a9ec5cf564
Christoph bug report integrate
2017-09-21 10:32:41 +01:00
Peter Boyle
946a8671b9
Merge pull request #129 from djm2131/feature/eofa
...
Add support for DWF with the exact one flavor algorithm
2017-09-21 10:15:21 +01:00
Azusa Yamaguchi
a6eeea777b
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-09-21 10:12:41 +01:00
Peter Boyle
771a1b8e79
Merge pull request #128 from paboyle/feature/CG-reliable-update
...
Feature/cg reliable update
2017-09-21 10:12:03 +01:00
Peter Boyle
bfb68e6f02
Merge pull request #130 from giltirn/gparity-handunroll
...
Gparity handunroll
2017-09-21 10:11:00 +01:00
Azusa Yamaguchi
77f7737ccc
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-09-19 14:28:01 +01:00
Guido Cossu
9a827d0242
Fixing a compilation error
2017-09-18 14:55:51 +01:00
Guido Cossu
999c623590
Solving a memory leak in Communicator_mpi
2017-09-18 14:39:04 +01:00
paboyle
18c335198a
Merge branch 'hotfix/dirac-ITT-fix1' into develop
2017-09-16 18:19:02 +01:00
paboyle
f9df685cde
Merge branch 'hotfix/dirac-ITT-fix1'
2017-09-16 18:18:48 +01:00
paboyle
17c5b0f152
Patching comparison point
2017-09-16 18:18:07 +01:00
paboyle
5918769f97
Subtle Naik term bug updated in Stencil; less on logical && with a function call on right
2017-09-16 12:51:26 +01:00
Guido Cossu
b542d349b8
Minor cosmetic changes
2017-09-15 11:48:36 +01:00
Guido Cossu
91eaace19d
Added support for FFT accelerated updates
2017-09-15 11:33:45 +01:00
Guido Cossu
bbaf1ada91
Merge branch 'feature/json-fix' into develop
2017-09-08 16:02:08 +01:00
Guido Cossu
1950ac9294
Fixed the Intel compiler problem with the JSON classes
2017-09-08 15:18:59 +01:00
Guido Cossu
13fa70ac1a
Merge branch 'develop' into feature/json-fix
2017-09-08 13:42:20 +01:00
Guido Cossu
7cb2b11f26
Fixing Intel compiler error for the JSON parser
2017-09-08 13:41:53 +01:00
Guido Cossu
1184ed29ae
Merge pull request #124 from nmeyer-ur/feature/arm-neon
...
Added integer reduce functionality
2017-09-08 10:54:35 +02:00
paboyle
203c7bf6fa
Merge branch 'hotfix/dirac-ITT-fix' into develop
2017-09-05 15:08:51 +01:00
paboyle
c709883f3f
Merge branch 'hotfix/dirac-ITT-fix'
2017-09-05 15:08:16 +01:00
paboyle
aed5de4d50
Patching macos compile
2017-09-05 15:07:07 +01:00
paboyle
ba27cc6571
Mac os happiness
2017-09-05 15:00:16 +01:00
paboyle
d856327250
Merge branch 'release/dirac-ITT' into develop
2017-09-05 14:56:12 +01:00
paboyle
d75369cb56
Merge branch 'release/dirac-ITT'
2017-09-05 14:55:54 +01:00
Peter Boyle
bf973d0d56
SHM complete
2017-09-05 14:30:29 +01:00
Peter Boyle
837bf8a5be
Updating to control the SHM allocation scheme under configure time options
2017-09-05 12:51:02 +01:00
Peter Boyle
c05b2199f6
Improvements to huge memory
2017-09-04 10:41:21 -04:00
Azusa Yamaguchi
a5fe07c077
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-09-04 14:10:15 +01:00
Azusa Yamaguchi
b83b2b1415
Stability improvement to BCG. Force m_rr hermitian beyond rounding.
2017-09-04 14:09:47 +01:00
James Harrison
91676d1dda
Fix “MAP_ANONYMOUS undefined” error on OSX.
2017-09-01 15:48:30 +01:00
Peter Boyle
b331be9101
Better reporting
2017-08-31 11:32:57 +01:00
Peter Boyle
49c20a9fa8
Patch to reporting
2017-08-31 11:32:21 +01:00
paboyle
7359df3501
Full reporting for benchmark; save robustness factor
2017-08-31 10:42:35 +01:00
Christopher Kelly
59bd1fe21b
Fix for 'perm' and 'local' not being set for hand-unrolled external-site Dslash, which caused incorrect behavior of G-parity kernel
2017-08-29 13:07:37 -07:00
a56e3b40c4
Merge branch 'develop' into feature/hadrons
2017-08-29 11:03:53 -06:00
Nils Meyer
4e907fef2c
Merge remote-tracking branch 'grid/develop' into feature/arm-neon
2017-08-29 17:47:36 +02:00
Christopher Kelly
67888b657f
Merge branch 'gparity-handunroll' of https://github.com/giltirn/Grid into gparity-handunroll
2017-08-29 09:52:05 -04:00
Christopher Kelly
74af885d4e
Removed some no-longer-needed associated with G-parity hand unrolled kernel
2017-08-29 09:50:37 -04:00
James Harrison
ac3611bb19
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/qed-fvol
2017-08-29 11:53:37 +01:00
Christopher Kelly
d36d2fb40d
Added ability to override default Ls in Benchmark_dwf
2017-08-28 06:53:56 -07:00
Peter Boyle
5b9267e88d
Cleaner comms benchmark treatment for one node runs
2017-08-27 18:24:48 -04:00
paboyle
15fd4003ef
Improving presentation of results
2017-08-27 13:46:02 +01:00
paboyle
4b4c2a715b
fcntl.h needed
2017-08-26 11:38:04 +01:00
paboyle
54a5e6c1d0
Check if we get huge pages on linux. Larry Meadows piece of magic.
2017-08-25 22:36:08 +01:00
paboyle
73aeca7dea
Merge branch 'feature/multi-communicator' into develop
2017-08-25 21:55:09 +01:00
paboyle
ad89abb018
Fix
2017-08-25 20:43:37 +01:00
paboyle
80c5bce5bb
Merge branch 'develop' into feature/multi-communicator
2017-08-25 20:21:26 +01:00
paboyle
f68b5de9c8
No compile fix on Clang
2017-08-25 19:35:21 +01:00
Peter Boyle
d0f3d525d5
Optimal block size for KNL
2017-08-25 19:33:54 +01:00
Christopher Kelly
f365a83fae
In G-parity unrolled kernel, replaced calls to permute and exchange with run-time-evaluated permute type with explicit calls to appropriate underlying functions
2017-08-25 14:24:11 -04:00
Peter Boyle
3a58217405
Updated
2017-08-25 14:29:53 +01:00
Peter Boyle
c289699d9a
updated from cambridge mpi3 shakeout
2017-08-25 11:41:01 +01:00
Peter Boyle
c3b1263e75
Benchmark prep
2017-08-25 09:25:54 +01:00
Christopher Kelly
34a9aeb331
Reduced number of if-statement evaluations in G-parity unrolled kernel
2017-08-24 13:53:50 -07:00
5846566728
Merge branch 'develop' into feature/hadrons
2017-08-24 18:20:52 +01:00
102ea9ae66
CI update
2017-08-24 18:17:09 +01:00
James Harrison
cc4afb978d
Fix bug in non-zero momentum projection
2017-08-24 17:31:44 +01:00
21b02760c3
Merge branch 'develop' into feature/hadrons
2017-08-24 17:05:45 +01:00
Peter Boyle
2bcb704af2
Merge pull request #121 from Lanny91/feature/hadrons
...
Feature/hadrons
2017-08-24 12:59:08 +01:00
paboyle
5fa386ddc9
FFT test compile fixed
2017-08-24 10:17:52 +01:00
Christopher Kelly
edabb3577f
Imported Benchmark_gparity
2017-08-23 16:54:06 -04:00
Christopher Kelly
ce5df177ee
Removed superfluous implementation of G-parity twist for hand-unrolled kernel from GparityWilsonImpl
2017-08-23 15:05:22 -04:00
Christopher Kelly
a0bb8e5b46
Added hand-unrolled kernel implementations of all the other dslash precision / comms precision combinations with G-parity
2017-08-23 14:44:40 -04:00
Christopher Kelly
46f88e6d72
G-parity hand-unrolled intrinsics twist now uses one less permute and one less temporary
2017-08-23 13:21:10 -04:00
David Murphy
dd8f1ea189
Vectorized Mobius EOFA Dperp + shift operation
2017-08-23 13:17:26 -04:00
Christopher Kelly
b61835c1a5
Added inplace version of intrinsic G-parity twist to hand-unrolled kernel
2017-08-23 12:33:48 -04:00
Azusa Yamaguchi
d9cd4f0273
Staggered multinode block cg debugged. Missing global sum.
...
Code stalls and resumes on KNL at cambridge. Curious.
CG iterations 23ms each, then 3200 ms pauses. Mean bandwidth reports
as 200MB/s. Comms dominant in the report. However, the time behaviour suggests it
is *bursty*.... Could be swap to disk?
2017-08-23 15:07:18 +01:00
David Murphy
459f70e8d4
Check-in of working Mobius EOFA class and tests
2017-08-22 22:38:30 -04:00
Christopher Kelly
061e48fd73
Replaced slow unpack-repack in G-parity BC twist with intrinsics version
2017-08-22 18:12:12 -04:00
Christopher Kelly
ab50145001
Implemented first, unoptimized version of hand-unrolled G-parity kernels
...
Improved Test_gparity
2017-08-22 17:12:25 -04:00
paboyle
b49bec0cec
MAP_HUGETLB portability fix
2017-08-20 03:08:54 +01:00
paboyle
ae56e556c6
finalise issue on new OPA revert
2017-08-20 02:53:12 +01:00
paboyle
1cdf999668
Moving multicommunicator into mpi3 also for threading
2017-08-20 02:39:10 +01:00
paboyle
11062fb686
Comms none fail fix
2017-08-20 01:37:07 +01:00
paboyle
383ca7d392
Switch off comms for now until feature/multi-communicator is merged
2017-08-20 01:27:48 +01:00
paboyle
a446d95c33
Trying to pass TeamCity and Travis
2017-08-20 01:10:50 +01:00
paboyle
be66e7dd95
Merge branch 'develop' into feature/multi-communicator
2017-08-19 23:12:38 +01:00
paboyle
6d0d064a6c
Update TODO
2017-08-19 23:11:30 +01:00
paboyle
bfef525ed2
New benchmark prep
2017-08-19 23:10:12 +01:00
Peter Boyle
0b0cf62193
Fix mpi 3 interface change
2017-08-19 13:18:50 -04:00
Peter Boyle
7d88198387
Merge branch 'develop' into feature/multi-communicator
2017-08-19 13:03:35 -04:00
Peter Boyle
2f619482b8
Enable blocking stencil send
2017-08-19 12:53:59 -04:00
Peter Boyle
d6472eda8d
Use mmap
2017-08-19 12:53:18 -04:00
Peter Boyle
9e658de238
Use Vector
2017-08-19 12:52:44 -04:00
Peter Boyle
bcefdd7c4e
Align both allocator calls to 2MB
2017-08-19 12:49:02 -04:00
David Murphy
9d45fca8bc
Implement MobiusEOFAFermioncache.cc
2017-08-17 23:45:36 -04:00
David Murphy
ac9e6b63c0
More re-import of Mobius EOFA
2017-08-17 19:28:53 -04:00
David Murphy
e140b3f802
Beginning to re-import Mobius EOFA
2017-08-16 23:36:23 -04:00
David Murphy
d9d3d30cc7
Minor clean-up
2017-08-16 20:57:51 -04:00
David Murphy
47a12ec7b5
Implement EOFA pseudofermion force and Shamir tests for G-parity and non G-parity cases
2017-08-16 19:50:08 -04:00
David Murphy
ec1e2f7a40
Add (mostly implemented) ExactOneFlavourRatio pseudofermion class and tests of Shamir heatbath and action
2017-08-16 12:38:59 -04:00
David Murphy
41f73ec083
Add ChronoForecast class for forecasting solutions across poles in the EOFA heatbath
2017-08-16 12:37:38 -04:00
Guido Cossu
fd367d8bfd
Debugging the PointerCache
2017-08-16 09:42:57 +01:00
David Murphy
6d0786ff9d
Typo fixes and check-in of G-parity action test for DWF
2017-08-15 22:47:00 -04:00
David Murphy
b7f93aeb4d
Change CayleyFermion5D::SetCoefficientsInternal to virtual to allow overriding in derived EOFA classes
2017-08-15 14:18:51 -04:00
David Murphy
202a7fe900
Re-import DWF and abstract base EOFA fermion classes and tests
2017-08-15 13:36:08 -04:00
Guido Cossu
8d168ded4a
Correction of the dagger version of the Clover
2017-08-15 10:50:44 +01:00
Guido Cossu
8a3fe60a27
Added more asserts at grid creation time
2017-08-08 11:36:20 +01:00
Guido Cossu
44051aecd1
Checking for integer divisions in cartesian full
2017-08-08 10:31:12 +01:00
Guido Cossu
06e6f8de00
Check that the reduced dim is an integer
2017-08-08 10:22:12 +01:00
Guido Cossu
dbe4d7850c
Make a test file compatible with all architectures
2017-08-06 10:49:45 +01:00
Guido Cossu
4fe182e5a7
Added high level HMC support for overriding default SIMD lane decomposition
2017-08-06 10:46:19 +01:00
Guido Cossu
75ee6cfc86
Debugging the Clover term
2017-08-04 16:08:07 +01:00
Guido Cossu
fde71c3c52
Merge branch 'develop' into feature/clover
2017-08-04 12:19:57 +01:00
Guido Cossu
175f393f9d
Binary IO error checking
2017-08-04 12:14:10 +01:00
Christopher Kelly
7d867a8134
Merge branch 'develop' into feature/CG-reliable-update
2017-08-02 09:48:04 -04:00
Christopher Kelly
9939b267d2
Added switching to fallback linear operator in reliable update CG, and added recalculation of b parameter on update.
2017-07-31 13:39:44 -04:00
Lanny91
323e9c439a
Hadrons: Legal banner fixes
2017-07-31 12:26:34 +01:00
Lanny91
28396f1048
Merge branch 'feature/rare_kaon' of https://github.com/Lanny91/Grid into feature/hadrons
2017-07-31 12:19:54 +01:00
Lanny91
67b34e5789
Modified conserved current 5th dimension loop for compatibility with 5D vectorisation.
2017-07-31 11:35:01 +01:00
Peter Boyle
14d53e1c9e
Threaded MPI calls patches
2017-07-29 13:08:10 -04:00
Guido Cossu
8bd869da37
Correcting a bug in the IO routines
2017-07-27 15:12:50 +01:00
Guido Cossu
c7036f6717
Adding checks for libm and libstdc++
2017-07-27 11:15:09 +01:00
Guido Cossu
c0485d799d
Explicit parameter declaration in the WilsonGauge test
2017-07-26 16:26:04 +01:00
Guido Cossu
7abc5613bd
Added smearing to the topological charge observable
2017-07-26 16:21:17 +01:00
Guido Cossu
237cfd11ab
Solving the spurious O2 flags
2017-07-26 12:08:51 +01:00
Guido Cossu
a4b7dddb67
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-07-26 12:07:38 +01:00
Guido Cossu
5696781862
Debug error in Tensor mult
2017-07-26 12:07:34 +01:00
Christopher Kelly
8f4b3049cd
Merge branch 'feature/CG-reliable-update' into ckelly_develop
2017-07-25 11:55:26 -04:00
Christopher Kelly
2a6e673a91
Merge branch 'develop' into feature/CG-reliable-update
2017-07-25 11:54:43 -04:00
Christopher Kelly
9b6cde173f
Merge branch 'feature/CG-reliable-update' into ckelly_develop
2017-07-25 11:51:08 -04:00
Christopher Kelly
9f280b82c4
Added mixed-precision CG with reliable updates
2017-07-25 11:30:41 -04:00
c3f0889eda
Merge pull request #123 from giltirn/develop
...
Fix for 'using namespace' in lib/qcd/utils/GaugeFix.h
2017-07-25 11:32:02 -03:00
Nils Meyer
7a53dc3715
Added integer reduce functionality
2017-07-24 11:12:59 +02:00
Christopher Kelly
0f214ad427
Moved FourierAcceleratedGaugeFixer into Grid::QCD namespace and removed 'using namespace' directives from header
2017-07-21 11:13:51 -04:00
Peter Boyle
fe4912880d
Update README.md
2017-07-17 09:53:07 +01:00
Lanny91
875e1a841f
Hadrons: updated Quark -> MFermion/GaugeProp module name in test.
2017-07-16 13:47:00 +01:00
Lanny91
0366288b1c
Hadrons: added tests for 3pt contractions.
2017-07-16 13:45:55 +01:00
Lanny91
6293d438cd
Hadrons: sink smearing compatibility for 3pt contraction modules.
2017-07-16 13:43:25 +01:00
Lanny91
852ade029a
Hadrons: Added module to sink a propagator
2017-07-16 13:41:47 +01:00
Peter Boyle
f038c6babe
Update README.md
2017-07-14 22:59:16 +01:00
Peter Boyle
169f4b2711
Update README.md
2017-07-14 22:56:06 +01:00
Peter Boyle
2d8aff36fe
Update README.md
2017-07-14 22:52:16 +01:00
Guido Cossu
9fa07eecde
Merge branch 'develop' into feature/json-fix
2017-07-12 15:47:22 +01:00
azusayamaguchi
659d7d1a40
For test/solver
...
Fixed
2017-07-12 15:01:48 +01:00
Guido Cossu
f64fb7bd77
Fix gcc error on JSON compilation
2017-07-12 14:55:42 +01:00
Guido Cossu
2a35449b91
Merge branch 'develop' into feature/json-fix
2017-07-12 14:47:00 +01:00
Guido Cossu
184af5bd05
Added support for std::pair in the JSON serialiser
2017-07-12 14:44:53 +01:00
Guido Cossu
097c9637ee
Fixed the JSON parsing error
2017-07-11 14:31:57 +01:00
azusayamaguchi
dc6f078246
fixed the header file for mpi3
2017-07-11 14:15:08 +01:00
Peter Boyle
8a4714a4a6
Update README.md
2017-07-09 00:11:54 +01:00
Peter Boyle
40e119c61c
NUMA improvements worth preserving from AMD EPYC tests
2017-07-08 22:27:11 -04:00
Guido Cossu
d9593c4b81
Merge branch 'develop' into feature/json-fix
2017-07-07 14:17:50 +01:00
paboyle
ac740f73ce
Works on Cori
2017-07-02 16:47:58 -07:00
paboyle
75dc7794b9
Working on Cori
2017-07-02 16:47:42 -07:00
paboyle
dee68fc728
IO working multiple nodes again. Strategy of all nodes writing metadata is unsafe.
...
Only one rank should do this. must identify this rank. Means pass communicator to the
Objects.
2017-07-02 23:33:48 +01:00
paboyle
a2d3643634
Merge branch 'feature/dwf-multirhs' of https://github.com/paboyle/Grid into feature/dwf-multirhs
2017-07-02 14:59:22 -07:00
paboyle
57002924bc
NERSC shakeout of this
2017-07-02 14:58:30 -07:00
Peter Boyle
7b0237b081
Update README.md
2017-07-01 10:24:41 +01:00
Peter Boyle
b68ad0cc0b
Update README.md
2017-07-01 10:20:07 +01:00
Peter Boyle
37263fd9b1
Update README.md
2017-07-01 10:06:24 +01:00
Peter Boyle
3d09e3e9e0
Update README.md
2017-07-01 10:05:46 +01:00
Peter Boyle
1354b46338
Update README.md
2017-07-01 10:04:32 +01:00
Peter Boyle
251a97fe1b
Update README.md
2017-07-01 09:55:36 +01:00
Peter Boyle
e18929eaa0
Update README.md
2017-07-01 09:53:15 +01:00
Peter Boyle
f3b0a92e71
Update README.md
2017-07-01 09:48:00 +01:00
Peter Boyle
a0be3f7330
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-06-30 10:53:50 +01:00
Peter Boyle
b5a6e4f1fd
Best option for Xeon cache blocking set
2017-06-30 10:53:22 +01:00
Peter Boyle
7a788db3dc
Guard first touch
2017-06-30 10:49:08 +01:00
Peter Boyle
f20eceb6cd
First touch once per page in a threaded loop
2017-06-30 10:48:27 +01:00
Peter Boyle
38325ebbc6
Interleave code path; not enabled
2017-06-30 10:23:51 +01:00
Peter Boyle
b73bd151bb
Switch off counters by default
2017-06-30 10:16:35 +01:00
Peter Boyle
694b305cab
Update to reporting
2017-06-30 10:16:13 +01:00
Peter Boyle
2d3737a133
O3, KNL
2017-06-30 10:15:59 +01:00
Peter Boyle
ac1f1838bc
KNL only
2017-06-30 10:15:32 +01:00
Guido Cossu
09d09d0fe5
Update README.md
2017-06-29 11:48:11 +01:00
Guido Cossu
bf630a6821
README file update
2017-06-29 11:42:25 +01:00
Guido Cossu
8859a151cc
Small corrections to the NEON port
2017-06-29 11:30:29 +01:00
Guido Cossu
688a39cfd9
Merge pull request #114 from nmeyer-ur/feature/arm-neon
...
ARM neon intrinsics support
Guido: checked and approved
2017-06-29 09:57:17 +01:00
paboyle
6f5a5cd9b3
Improved threaded comms benchmark
2017-06-28 23:27:02 +01:00
Nils Meyer
0933aeefd4
corrected Grid_neon.h
2017-06-28 20:22:22 +02:00
Peter Boyle
322f61acee
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-06-28 15:30:35 +01:00
Peter Boyle
08e04b9676
Better benchmarks
2017-06-28 15:30:06 +01:00
feaa2ac947
Merge branch 'feature/scalar-hmc-update' into develop
2017-06-28 12:46:18 +01:00
07de925127
minor scalar action fixes
2017-06-28 12:45:44 +01:00
Nils Meyer
a9c816a268
moved file to correct folder
2017-06-27 21:39:15 +02:00
Nils Meyer
e43a8b6b8a
removed comments
2017-06-27 20:58:48 +02:00
Nils Meyer
bf729766dd
removed collision with QPX implementation
2017-06-27 20:32:24 +02:00
Guido Cossu
dafb351d38
Merge pull request #120 from paboyle/feature/scalar-hmc-update
...
Scalar HMC update.
I agree with the changes.
2017-06-27 16:23:14 +01:00
0b707b861c
Merge branch 'develop' into feature/scalar-hmc-update
2017-06-27 14:40:05 +01:00
15e87a4607
HDF5 IO fix
2017-06-27 14:39:27 +01:00
7d7220cbd7
scalar: lambda/4! convention
2017-06-27 14:38:45 +01:00
Lanny91
7d2d5e8d3d
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/hadrons
2017-06-26 15:19:46 +01:00
paboyle
54e94360ad
Experimental: Multiple communicators to see if we can avoid thread locks in --enable-comms=mpit
2017-06-24 23:10:24 +01:00
0af740dc15
minor scalar HMC code improvement
2017-06-24 23:04:05 +01:00
d2e8372df3
SU(N) algebra fix (was not working)
2017-06-24 23:03:39 +01:00
paboyle
869b99ec1e
Threaded calls to multiple communicators
2017-06-24 10:55:54 +01:00
paboyle
4a29ab0d0a
Merge branch 'feature/dwf-multirhs' of https://github.com/paboyle/Grid into feature/dwf-multirhs
2017-06-23 23:10:43 +01:00
paboyle
0165bcb58e
Added an update to TODO list
2017-06-23 23:10:24 +01:00
Lanny91
deca1ecc50
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/rare_kaon
2017-06-23 19:35:19 +02:00
4372d04ad4
Merge pull request #118 from Lanny91/hotfix/bgq
...
Hotfix/bgq
2017-06-23 16:59:27 +01:00
paboyle
349d75e483
Precision fix
2017-06-23 02:57:59 -07:00
Lanny91
56abbdf4c2
AVX512 integer reduce fix (for non-intel compiler)
2017-06-23 11:09:14 +02:00
Lanny91
af71c63f4c
AVX2 fix
2017-06-23 11:03:12 +02:00
paboyle
e51475703a
Ticking off lots on the TODO list
2017-06-23 09:42:21 +01:00
paboyle
1feddf4ba6
const fixes
2017-06-22 19:32:41 +01:00
paboyle
600d7ddc2e
Proof of concept : Multi RHS solver, running independent solves on different ranks
2017-06-22 18:54:34 +01:00
paboyle
e504260f3d
Able to run a test job splitting into multiple MPI subdomains.
2017-06-22 18:53:11 +01:00
Lanny91
0440d4ce66
Merge branch 'develop' of https://github.com/paboyle/Grid into hotfix/bgq
2017-06-22 17:09:42 +02:00
Lanny91
08b0e472aa
Fixed hadrons tests after merge
2017-06-22 16:34:33 +02:00
Lanny91
c11d69787e
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/rare_kaon
...
# Conflicts:
# extras/Hadrons/Modules.hpp
# extras/Hadrons/Modules/MFermion/GaugeProp.hpp
# extras/Hadrons/modules.inc
# tests/hadrons/Test_hadrons.hpp
# tests/hadrons/Test_hadrons_meson_3pt.cc
2017-06-22 16:26:31 +02:00
Lanny91
dc6b2d30d2
Documentation fix
2017-06-22 16:09:45 +02:00
Lanny91
7a3bd5c66c
Hadrons: new conserved current contraction test (for regression testing)
2017-06-22 16:06:15 +02:00
Lanny91
18211eb5b1
Hadrons: Fixed test to use new implementation of meson module.
2017-06-22 16:03:59 +02:00
Lanny91
863bb2ad10
Moving overly-specialised code out of Grid
2017-06-22 16:02:15 +02:00
paboyle
5e4bea8f20
Benchmark DWF works
2017-06-22 08:38:54 +01:00
paboyle
6ebf9f15b7
Splitting communicators first cut
2017-06-22 08:14:34 +01:00
paboyle
1d7aa673a4
Include BlockCG by default
2017-06-21 21:08:53 +01:00
paboyle
b9104f3072
Block CG
2017-06-21 21:08:03 +01:00
b22eab8c8b
Merge commit 'a7d56523abee6c9030fdd9303c79954897b1086f' into feature/hadrons
2017-06-21 18:32:48 +01:00
paboyle
a7d56523ab
Merge branch 'feature/lanczos-simplify' into develop
2017-06-21 14:03:20 +01:00
paboyle
9e56c65730
Updated TODO list
2017-06-21 14:02:58 +01:00
paboyle
ef4f2b8c41
todo update
2017-06-21 09:22:20 +01:00
paboyle
e8b95bd35b
Clean up finished. Could shrink Lanczos to around 400 lines at a push
2017-06-21 02:50:09 +01:00
paboyle
7e35286860
Simplified lanczos, added Eigen diagonalisation.
...
Curious if we can deprecate dependencly on BLAS.
Will see when we get 48^3 running on our BG/Q port
2017-06-21 02:26:03 +01:00
paboyle
0486ff8e79
Improved the lancos
2017-06-20 18:46:01 +01:00
1e8a2e1621
various compatibility fixes after merge
2017-06-20 17:24:55 +01:00
7587df831a
Merge branch 'develop' into feature/hadrons
...
# Conflicts:
# lib/qcd/action/scalar/ScalarImpl.h
2017-06-20 15:50:39 +01:00
Azusa Yamaguchi
e9cc21900f
Block solver complete for staggered. Now stable on mass 0.003 and
...
gives 8x (!) speed up on Haswell laptop vs. standard CG for 8 RHS solves.
166 iterations vs. 537 iterations so algorithmic gain + 2x in flop rate gain.
Better than a slap in the face with a wet kipper.
2017-06-20 12:37:41 +01:00
Azusa Yamaguchi
0a8faac271
Fix make tests compile
2017-06-19 22:54:18 +01:00
Azusa Yamaguchi
abc4de0fd2
No compile make tests fix
2017-06-19 22:03:03 +01:00
b672717096
Test_serialiation update for JSON
2017-06-19 14:38:39 +01:00
284ee194b1
JSON update
2017-06-19 14:38:15 +01:00
Azusa Yamaguchi
cfe3cd76d1
Block solver improvements
2017-06-19 14:04:21 +01:00
Azusa Yamaguchi
3fa5e3109f
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-06-19 14:01:44 +01:00
paboyle
8b7049f737
Improved detectino of usqcdInfo for plaq/linktr
2017-06-19 08:46:07 +01:00
paboyle
c85024683e
Merge branch 'feature/parallelio' into develop
2017-06-19 01:39:48 +01:00
paboyle
1300b0b04b
Update to enable multiple records per file more consistent with SciDAC.
...
open, close, write records...
2017-06-19 01:01:48 +01:00
paboyle
e6d984b484
ILDG tests
2017-06-18 00:13:22 +01:00
paboyle
1d18d95d4f
Class name return
2017-06-18 00:13:03 +01:00
paboyle
ae39ec85a3
ComplexField defined
2017-06-18 00:12:48 +01:00
paboyle
b96daf53a0
Query tensor structures
2017-06-18 00:12:15 +01:00
paboyle
46879e1658
Complex defined in Impl even for gauge.
2017-06-18 00:11:45 +01:00
paboyle
ae4de94798
SciDAC I/O support
2017-06-18 00:11:23 +01:00
paboyle
0ab555b4f5
SciDAC I/O and ILDG improvements
2017-06-18 00:11:02 +01:00
paboyle
8e9be9f84f
Updates for SciDAC IO
2017-06-18 00:10:42 +01:00
paboyle
d572170170
Update for SciDAC
2017-06-18 00:10:20 +01:00
81b18f843a
Merge branch 'feature/scalar_adjointFT' into feature/hadrons
...
# Conflicts:
# lib/qcd/action/scalar/ScalarImpl.h
2017-06-16 17:59:55 +01:00
Lanny91
1bd311ba9c
Faster sequential conserved current implementation, now compatible with 5D vectorisation & G-parity.
2017-06-16 16:43:15 +01:00
Lanny91
41af8c12d7
Code cleaning for conserved current contractions. Will now be easier to implement mobius conserved current.
2017-06-16 16:38:59 +01:00
Lanny91
a833f88c32
Added missing SIMD integer reduction implementation for AVX, AVX-512, SSE4, IMCI
2017-06-16 15:58:47 +01:00
Lanny91
07b2c1b253
Placeholder precision change functions to allow Grid to compile with QPX (warning: no actual functionality)
2017-06-16 15:04:26 +01:00
Lanny91
735cbdb983
QPX Integer reduction (+ integer reduction test)
2017-06-14 10:55:10 +01:00
Lanny91
2ad54c5a02
QPX exchange support
2017-06-14 10:53:39 +01:00
paboyle
12ccc73cf5
Serialisation no compile fix
2017-06-14 05:19:17 +01:00
Nils Meyer
3d04dc33c6
ARM neon intrinsics support
2017-06-13 13:26:59 +02:00
paboyle
e7564f8330
Starting a test for reading an ILDG file.
2017-06-13 12:22:50 +01:00
paboyle
91199a8ea0
openmpi is not const safe
2017-06-13 12:21:29 +01:00
paboyle
0494feec98
Libz dependency
2017-06-13 12:00:23 +01:00
paboyle
a16b1e134e
gcc 4.9 fix
2017-06-13 10:48:43 +01:00
James Harrison
20e92a7009
QedVFol: Allow output of scalar propagator and vacuum polarisation projected to arbitrary lattice momentum, not just zero-momentum.
2017-06-12 18:27:32 +01:00
Lanny91
5633a2db20
Faster implementation of conserved current site contraction. Added 5D vectorised support, but not G-parity.
2017-06-12 10:41:02 +01:00
Lanny91
2d433ba307
Changed header include guards to match new convention
2017-06-12 10:32:14 +01:00
paboyle
769ad578f5
Odd new error on G++ 49 on travis
2017-06-12 00:41:21 +01:00
paboyle
eaac0044b5
Compile fixes
2017-06-12 00:20:49 +01:00
paboyle
56042f002c
New files
2017-06-11 23:19:20 +01:00
paboyle
3bfd1f13e6
I/O improvements
2017-06-11 23:14:10 +01:00
James Harrison
42f0afcbfa
QedFVol: Output all scalar VP diagrams separately
2017-06-09 18:08:40 +01:00
Azusa Yamaguchi
70ab598c96
Move gfix into utils
2017-06-08 22:22:23 +01:00
Azusa Yamaguchi
1d0ca65e28
Move Gfix into utils
2017-06-08 22:21:50 +01:00
Azusa Yamaguchi
2bc4d0a20e
Move code into utils
2017-06-08 22:21:25 +01:00
James Harrison
20ac13fdf3
QedFVol: add ChargedProp as an input to ScalarVP module, instead of calculating scalar propagator within ScalarVP.
2017-06-08 17:43:39 +01:00
2490816297
Hadrons: rare kaon program removed
2017-06-07 20:11:02 -05:00
5f55bca378
Hadrons: Quark module renamed MFermion::GaugeProp
2017-06-07 20:10:48 -05:00
James Harrison
e38612e6fa
QedFVol: Update ScalarVP module for compatibility with new scalar action
2017-06-07 17:42:00 +01:00
James Harrison
c2b2b71c5d
Merge branch 'feature/qed-fvol' of https://github.com/paboyle/Grid into feature/qed-fvol
...
# Conflicts:
# extras/Hadrons/Modules.hpp
# extras/Hadrons/modules.inc
2017-06-07 16:59:47 +01:00
James Harrison
009f48a904
QedFVol: Add missing factor of 2 in free vacuum polarisation
2017-06-07 16:34:09 +01:00
Lanny91
b8e45ae490
Fixed remaining fermion type aliases after merge.
2017-06-07 16:26:22 +01:00
Lanny91
b35fc4e7f9
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/rare_kaon
...
# Conflicts:
# extras/Hadrons/Global.hpp
# tests/hadrons/Test_hadrons_rarekaon.cc
2017-06-07 14:38:51 +01:00
Lanny91
60f11bfd72
Removed redundant test module
2017-06-07 12:34:47 +01:00
f6aa82b7f2
Merge branch 'develop' into feature/hadrons
2017-06-06 11:46:33 -05:00
22749699a3
Fixes after merge and point sink module
2017-06-06 11:45:30 -05:00
Lanny91
8d442b502d
Sequential current fix for spacial indices.
2017-06-06 17:06:40 +01:00
Lanny91
e5c8b7369e
Boundary condition option in quark actions for hadrons tests.
2017-06-06 14:19:10 +01:00
0503c028be
Merge branch 'feature/qed-fvol' into feature/hadrons (non-trivial conflicts on scalar Impl)
...
# Conflicts:
# configure.ac
# lib/qcd/action/scalar/Scalar.h
2017-06-05 16:37:47 -05:00
Lanny91
c504b4dbad
Code cleaning
2017-06-05 15:56:43 +01:00
Lanny91
622a21bec6
Improvements to sequential conserved current test and small bugfix.
2017-06-05 15:55:32 +01:00
Lanny91
eec79e0a1e
Ward Identity test improvements and conserved current bug fixes
2017-06-05 11:55:41 +01:00
paboyle
092dcd4e04
MPI I/O only if MPI compiled
2017-06-02 22:50:25 +01:00
Guido Cossu
4a8c4ccfba
Test wilson flow, added maxTau for adaptive flow
2017-06-02 17:02:29 +01:00
Guido Cossu
9b44189d5a
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-06-02 16:56:00 +01:00
Guido Cossu
7da4856e8e
Wilson flow with adaptive steps
2017-06-02 16:55:53 +01:00
Guido Cossu
aaf1e33a77
Adding adaptive integration in the WilsonFlow
2017-06-02 16:32:35 +01:00
paboyle
094c3d091a
Improved and RNG's now survive checkpoint
2017-06-02 00:38:58 +01:00
Peter Boyle
4b98e524a0
Roll over to MPI version of I/O
2017-06-01 17:38:18 -04:00
Peter Boyle
1a1f6d55f9
Roll over to MPI IO for parallel IO
2017-06-01 17:37:26 -04:00
Peter Boyle
21421656ab
Big changes improving the code to use MPI IO
2017-06-01 17:36:53 -04:00
Peter Boyle
6f687a67cd
As local vols increase, use 64 bits for safety
2017-06-01 17:36:18 -04:00
paboyle
b30754e762
Merge branch 'feature/parallelio' of https://github.com/paboyle/Grid into feature/parallelio
2017-05-30 23:41:28 +01:00
paboyle
1e429a0d57
Added MPI version
2017-05-30 23:41:07 +01:00
paboyle
d38a4de36c
Beginning move to MPI IO
2017-05-30 23:40:39 +01:00
paboyle
ef1b7db374
Diff comparison check
2017-05-30 23:40:11 +01:00
paboyle
53a9aeb965
Cosmetic only
2017-05-30 23:39:53 +01:00
paboyle
e30fa9f4b8
RankCount; need to clean up ambigious ProcessCount
2017-05-30 23:39:16 +01:00
paboyle
58e8d0a10d
reverse direction lexico mapping
2017-05-30 23:38:30 +01:00
paboyle
62cf9cf638
Cleaner code
2017-05-30 23:38:02 +01:00
paboyle
0fb458879d
Precision safe compile
2017-05-30 23:37:02 +01:00
Peter Boyle
725c513d94
Better MPI3 benchmarking
2017-05-29 16:47:32 -04:00
d8648307ff
Merge branch 'develop' into feature/hadrons
2017-05-29 12:58:08 +01:00
064315c00b
Hadrons: mesons gamma list fix
2017-05-29 12:57:33 +01:00
Guido Cossu
7c6cc85df6
Updating WilsonFlow test
2017-05-27 18:03:49 +01:00
Guido Cossu
a6691ef87c
Merge pull request #110 from Lanny91/feature/hadrons
...
Hadrons: Fermion boundary conditions can now be set in measurement code.
2017-05-26 16:43:22 +01:00
Lanny91
23135aa58a
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/rare_kaon
2017-05-26 16:00:50 +01:00
Lanny91
8e0ced627a
Hadrons: Fermion boundary conditions can now be set in measurement code.
2017-05-26 15:59:15 +01:00
Guido Cossu
0de314870d
Faster derivative for WilsonGauge
2017-05-26 14:31:49 +01:00
Guido Cossu
ffb91e53d2
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-05-26 12:46:02 +01:00
Guido Cossu
f4e8bf2858
Fixing the topological charge. Wilson Flow tested, ok
2017-05-26 12:45:59 +01:00
a74c34315c
Bootstrap script fix
2017-05-25 14:27:49 +01:00
paboyle
69470ccc10
Update to do list
2017-05-25 13:41:26 +01:00
paboyle
b8b5934193
Attempts to speed up the parallel IO
2017-05-25 13:32:24 +01:00
Guido Cossu
75856f2945
Compilation fix in the Tensor_exp
2017-05-25 12:44:56 +01:00
Guido Cossu
3c112a7a25
Small correction to the general exp definition
2017-05-25 12:09:00 +01:00
Guido Cossu
ab3596d4d3
Using Cayley-Hamilton form for the exponential of SU(3) matrices
2017-05-25 12:07:47 +01:00
paboyle
a8c10b1933
Use a global-X x Local-Y chunksize for parallel binary I/O.
...
Gives O(32 x 8 x 18*8*8) chunk size on configuration I/O.
At 150KB should be getting close to packet sizes and 4MB filesystem
block sizes that are reasonably (!?) performant. We shall see once I move
this off my laptop and over to BNL and time it.
2017-05-25 11:43:33 +01:00
Guido Cossu
15e801af3f
Fixing a compilation error for generic SIMD
2017-05-19 16:39:36 +01:00
Guido Cossu
0ffc235741
Adding more statistics to the Benchmark_comms. Min and max
2017-05-19 10:55:04 +01:00
Guido Cossu
8e19c99c7d
Adding more statistical info in the Benchmark_comms
2017-05-18 19:07:35 +01:00
Guido Cossu
a0bc0ad06f
Reverting change in Bechmark_comms. Keeping 300 iterations
2017-05-18 17:48:11 +01:00
Guido Cossu
a8fb2835ca
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-05-18 14:45:00 +01:00
Guido Cossu
bc862ce3ab
Fixing an allocation issue in Benchmark_comms
2017-05-18 14:44:56 +01:00
Lanny91
08b314fd0f
Hadrons: conserved current test fixes. Axial current tests now also optional.
2017-05-18 13:16:14 +01:00
22f4feee7b
Merge branch 'develop' into feature/scalar_adjointFT
2017-05-17 13:27:13 +02:00
3f858d6755
Scalar: phi^2 observable
2017-05-17 13:25:14 +02:00
paboyle
3267683e22
Union workaround for g++
2017-05-17 11:26:18 +01:00
Azusa Yamaguchi
f46a67ffb3
No compile issue on clang on mac fixed.
...
Compiler version was clang++-3.9 under mpicxx
2017-05-17 10:51:01 +01:00
paboyle
f7b8383ef5
Half precisoin comms mixed prec test
2017-05-16 14:52:51 +01:00
Guido Cossu
10f2872aae
Faster exponentiation for lattice fields
2017-05-15 15:51:16 +01:00
Lanny91
34332fe393
Improvement to sequential conserved current insertion tests
2017-05-12 16:30:43 +01:00
Lanny91
c2010f21ab
Added sequential propagator test for gamma matrix insertion
2017-05-12 16:23:01 +01:00
Lanny91
98f610ce53
Reduced code duplication in hadron tests
2017-05-12 16:15:26 +01:00
Lanny91
d44cc204d1
Added test module for sequential gamma matrix insertion
2017-05-12 14:58:17 +01:00
35fa3d1dfd
Merge branch 'master' into feature/scalar_adjointFT
2017-05-12 10:41:39 +01:00
paboyle
cd73897b8d
Merge branch 'release/v0.7.0' into develop
2017-05-12 01:16:02 +01:00
paboyle
c4435e6beb
Merge branch 'release/v0.7.0'
2017-05-12 01:15:59 +01:00
paboyle
7a8f6af5f8
Drop verbose compiler predefine check
2017-05-11 12:48:40 +01:00
paboyle
49a5d9bac7
Clang major, minor trailing underscore
2017-05-11 12:25:02 +01:00
paboyle
2b3fdd4a58
Print CXX predefines
2017-05-11 12:05:50 +01:00
paboyle
34502ec471
4.8 dropped as buggy.
2017-05-11 11:43:39 +01:00
paboyle
8a43e88b4f
Compiler check early in build
2017-05-11 11:43:06 +01:00
d1ece74137
HMC scalar test: magnetisation measurement
2017-05-11 11:40:44 +01:00
paboyle
238df20370
Still working on the compiler compat checks
2017-05-11 11:30:14 +01:00
paboyle
97a32a6145
Add 4.8 test
2017-05-11 11:24:21 +01:00
paboyle
655492a443
Compiler detection
2017-05-11 11:21:11 +01:00
paboyle
1cab06f6bd
Compat checks for compilers
2017-05-11 10:20:24 +01:00
43c817cc67
Scalar action: const fix
2017-05-11 00:07:17 +01:00
paboyle
f8024c262b
Update Eigen
2017-05-10 13:30:09 +01:00
Guido Cossu
4cc5f01f4a
Small change in the readme about the intel compiler
2017-05-09 15:38:59 +01:00
James Harrison
5cfc0180aa
QedFVol: Output free VP along with charged VP.
2017-05-09 12:46:57 +01:00
James Harrison
914f180fa3
QedFVol: Implement exact O(alpha) vacuum polarisation.
2017-05-09 11:46:25 +01:00
Guido Cossu
9c12c37aaf
Confirming the fix on the complex boundary conditions
2017-05-09 08:41:29 +01:00
Guido Cossu
806eaa0530
Adding back the IO tests in the list
2017-05-08 22:26:44 +01:00
Guido Cossu
01d0e54594
Merge branch 'release/v0.7.0' into develop
2017-05-08 22:02:51 +01:00
Guido Cossu
5aafa335fe
Fixing JSON error for complex numbers
2017-05-08 21:56:44 +01:00
Guido Cossu
8ba0494485
Fixing JSON for complex numbers
2017-05-08 21:41:39 +01:00
Peter Boyle
d99d98d9fd
Merge branch 'release/v0.7.0' of https://github.com/paboyle/Grid into release/v0.7.0
2017-05-08 15:08:20 -04:00
Peter Boyle
95a017a4ae
Relax force constraints to pass in single precision.
2017-05-08 15:06:41 -04:00
paboyle
92f92379e6
Adding olivers test version
2017-05-08 18:42:19 +01:00
paboyle
529e78d43f
Restart the v0.7.0 release
2017-05-08 18:20:04 +01:00
paboyle
4ec746d262
Merge branch 'release/v0.7.0' into develop
2017-05-06 18:43:03 +01:00
paboyle
51bf1501fc
Merge branch 'release/v0.7.0'
2017-05-06 18:42:50 +01:00
paboyle
66d819c054
More info on gcc bug
2017-05-06 18:42:11 +01:00
paboyle
3f3686f869
formattign
2017-05-06 18:41:27 +01:00
paboyle
26bb829f8c
Formatting
2017-05-06 18:40:55 +01:00
paboyle
67cb04fc66
README update
2017-05-06 18:39:54 +01:00
paboyle
a40bd68aed
Version update
2017-05-06 17:00:14 +01:00
paboyle
36495e0fd2
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-05-06 16:39:27 +01:00
paboyle
93f6c15772
Warning squash
2017-05-06 16:38:58 +01:00
Peter Boyle
cb93eeff21
Update README
2017-05-06 16:28:12 +01:00
paboyle
c7cc7e6101
Fix
2017-05-06 16:10:09 +01:00
paboyle
c349aa6511
DEFINE warning elimination
2017-05-06 16:08:35 +01:00
paboyle
3bae0a2d5c
Drop a gcc warning
2017-05-06 15:51:42 +01:00
paboyle
c1c7566089
GCC bug work around in 5.0 through 6.2 inclusive.
2017-05-06 15:20:25 +01:00
paboyle
2439999ec8
Warning elimination; drop to -O2 on G++ bad versions
2017-05-06 14:44:49 +01:00
paboyle
1d96f662e3
Fixed 4d fermion gparity force. Put strong tests on make check force tests
2017-05-06 00:46:31 +01:00
paboyle
41d1889941
trusty ubuntu
2017-05-05 21:25:35 +01:00
paboyle
0c3981e0c3
Trying to force recent automake
2017-05-05 21:15:22 +01:00
paboyle
c727bd4609
Trying to work around automake version
2017-05-05 21:00:00 +01:00
paboyle
db23749b67
Adding travis to make check
2017-05-05 20:42:08 +01:00
paboyle
751f2b9703
Better check and benchmark driving
2017-05-05 19:54:38 +01:00
Guido Cossu
741bc836f6
Exposing support for Ncolours and Ndimensions and JSON input file for the ScalarAction
2017-05-05 17:36:43 +01:00
James Harrison
6cb563a40c
QedFVol: Access HVP tensor using a vector<vector<ScalarField>> instead of vector<vector<ScalarField*>>
2017-05-05 17:12:41 +01:00
paboyle
697c0603ce
SITMO I/O for NERSC working now bit repro
2017-05-05 16:54:44 +01:00
paboyle
14bedebb11
Source pointed to
2017-05-05 16:17:27 +01:00
Guido Cossu
8546d01a4c
Merge branch 'develop' into feature/scalar_adjointFT
2017-05-05 15:47:33 +01:00
paboyle
47b5c07ffb
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-05-05 14:27:02 +01:00
Guido Cossu
da86a2bf54
Merge branch 'feature/hmc_generalise' into develop
2017-05-05 14:23:02 +01:00
paboyle
c1cb60a0b3
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-05-05 14:22:37 +01:00
Guido Cossu
5ed5b4bfbf
Merge branch 'develop' into feature/hmc_generalise
2017-05-05 14:22:33 +01:00
Guido Cossu
de84aacdfd
Fixing a configure error for the smearing tests
2017-05-05 13:59:10 +01:00
paboyle
2888003765
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-05-05 13:02:24 +01:00
paboyle
da06bf5b95
Zmobius force test added
2017-05-05 12:52:45 +01:00
Guido Cossu
20999c1370
Merge branch 'develop' into feature/hmc_generalise
2017-05-05 12:47:17 +01:00
Lanny91
77e0af9c2e
Compilation fix after merge - conserved current code not yet operational for vectorised 5D or Gparity Impl.
2017-05-05 12:27:50 +01:00
paboyle
33f0ed1a33
No compile fix
2017-05-05 11:04:30 +01:00
paboyle
50be56433b
Delete old and defunct tests
2017-05-04 23:41:16 +01:00
paboyle
43924007db
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-05-04 19:53:41 +01:00
paboyle
78ef10e60f
Mobius force improvement
2017-05-04 19:53:21 +01:00
Lanny91
ca1077c560
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/rare_kaon
...
# Conflicts:
# lib/qcd/action/fermion/WilsonFermion5D.cc
# tests/hadrons/Test_hadrons_rarekaon.cc
2017-05-04 16:22:33 +01:00
679ae98b14
Merge branch 'feature/better-external-library' into develop
2017-05-04 15:42:12 +01:00
paboyle
90f6bc16bb
No compile clang fix
2017-05-04 12:15:06 +01:00
Peter Boyle
9b5b639546
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-05-03 20:51:40 -04:00
Peter Boyle
945767c6d8
More info
2017-05-03 20:26:35 -04:00
Peter Boyle
422cdf4979
Some checks
2017-05-03 18:37:38 -04:00
Peter Boyle
38db174f3b
Print statement
2017-05-03 18:25:26 -04:00
Peter Boyle
92e364a35f
Better reporting in benchmark for MPI3
2017-05-03 15:43:36 -04:00
James Harrison
db3837be22
QedFVol: Change “double” to “Real” in ScalarVP.cc
2017-05-03 13:26:49 +01:00
James Harrison
2f0dd83016
Calculate HVP using a single contraction of O(alpha) charged propagators.
2017-05-03 12:53:41 +01:00
58299b8ba2
Git info separated from version in git-config
2017-05-02 20:04:41 +01:00
124bf4d829
git ref in config summary
2017-05-02 19:41:01 +01:00
e8e56b3414
Config summary saved in git-config
2017-05-02 19:40:47 +01:00
89c430136d
grid-config program
2017-05-02 19:13:13 +01:00
ea9aef7baa
New header for standard headers (was an issue with Remez.h and external compilation)
2017-05-02 18:26:11 +01:00
c9e9e8061d
Merge branch 'feature/hadrons' into develop
2017-05-02 18:23:47 +01:00
Guido Cossu
453cf2a1c6
Moving the topological charge outside the HMC related routines
2017-05-02 14:40:12 +01:00
Guido Cossu
de7bbfa5f9
Adding ParameterFile option for the HMC
2017-05-02 12:16:16 +01:00
dda8d77c87
Merge branch 'feature/hadrons' into feature/rare_kaon
2017-05-01 17:50:57 +01:00
aa29f4346a
Hadrons: weird bus error with recent macOS clang
2017-05-01 17:49:08 +01:00
Guido Cossu
86116dbed6
Adding boundary condition switch (compile time) for the Mobius HMC example
2017-05-01 16:33:11 +01:00
Guido Cossu
7bd31e3f7c
Adding external file support in the Mobius example (JSON)
2017-05-01 16:30:24 +01:00
Guido Cossu
74f451715f
Fix for Mac compilation on the size_t uint64_t types
2017-05-01 15:12:07 +01:00
Guido Cossu
655be8ed76
Adding tests for the mobius operator
2017-05-01 14:42:16 +01:00
Guido Cossu
4063238943
Adding HMC test file example for Mobius + smearing
2017-05-01 13:44:00 +01:00
Guido Cossu
3344788fa1
Merge branch 'develop' into feature/hmc_generalise
2017-05-01 12:13:56 +01:00
Guido Cossu
62a64d9108
EO support, wip
2017-05-01 11:06:21 +01:00
Lanny91
49331a3e72
Minor improvements to Ward Identity checks
2017-04-28 16:50:17 +01:00
Lanny91
51d84ec057
Bugfixes in Wilson 5D sequential conserved current insertion
2017-04-28 16:49:14 +01:00
Lanny91
db14fb30df
Hadrons: overhaul of conserved current test
2017-04-28 16:48:00 +01:00
Lanny91
b9356d3866
Added more complete test of sequential insertion of conserved current.
2017-04-28 16:46:40 +01:00
Guido Cossu
99a73f4287
Correcting the M and Mdag in the clover term
2017-04-28 15:51:05 +01:00
Lanny91
f302eea91e
SitePropagator redefined to be a scalar object in TYPE_ALIASES.
2017-04-28 15:27:49 +01:00
Guido Cossu
5553b8d2b8
Clover term compiles, not tested
2017-04-28 15:23:34 +01:00
Lanny91
a6ccbbe108
Conserved current sequential source now registered properly and fixed module inputs.
2017-04-28 10:43:47 +01:00
James Harrison
3ac27e5596
QedFVol: remove unnecessary copies of free propagator from shifted sources in ScalarVP
2017-04-27 14:17:50 +01:00
Peter Boyle
99220f6531
Fixes and better timing
2017-04-26 17:24:11 -04:00
Lanny91
d2003f24f4
Corrected incorrect usage of ExtractSlice for conserved current code.
2017-04-26 17:25:28 +01:00
Lanny91
6299dd35f5
Hadrons: Added test of conserved current code. Tests Ward identities for conserved vector and partially conserved axial currents.
2017-04-26 12:41:39 +01:00
Lanny91
a39daecb62
Removed make_5D const declaration to avoid compilation error
2017-04-26 12:39:07 +01:00
Lanny91
159770e21b
Legal Banners added
2017-04-26 09:32:57 +01:00
paboyle
2a6d093749
move the sudo: required to match locatoin on Guido's branch
2017-04-26 09:15:34 +01:00
paboyle
c947947fad
sudo required suggested by guido
2017-04-26 08:45:36 +01:00
paboyle
f555b50547
Merge branch 'feature/half-prec-comms' into develop
2017-04-26 08:43:40 +01:00
paboyle
738c1a11c2
longer nloop
2017-04-26 08:43:20 +01:00
Peter Boyle
f8797e1e3e
bug fix. works now and great face performance
2017-04-26 03:14:02 -04:00
Peter Boyle
fd1eb7de13
Clean implementation of the exterior faces listing only those points on the boudary
2017-04-26 02:34:52 -04:00
Peter Boyle
2ce898efa3
Pretty code
2017-04-26 02:34:25 -04:00
Lanny91
dc5a6404ea
Hadrons: modules for testing conserved current contractions and sequential insertion.
2017-04-25 22:08:33 +01:00
Lanny91
44260643f6
First conserved current implementation for Wilson fermions only. Not implemented for Gparity or 5D-vectorised Wilson fermions.
2017-04-25 18:00:24 +01:00
Lanny91
1425afc72f
Rare Kaon test fix
2017-04-25 17:26:56 +01:00
James Harrison
bd466a55a8
QedFVol: remove charge dependence in chargedProp function of ScalarVP
2017-04-25 10:04:03 +01:00
paboyle
ab66bac4e6
Think I'm getting on top of the reduced cost exterior precomputed list of links
2017-04-25 08:50:26 +01:00
paboyle
56277a11c8
Build a list of whats on the surface
2017-04-24 17:06:15 +01:00
Guido Cossu
752048f410
Merge branch 'develop' into feature/clover
2017-04-24 14:41:20 +01:00
paboyle
916e9e1d3e
Merge branch 'feature/half-prec-comms' of https://github.com/paboyle/Grid into feature/half-prec-comms
2017-04-24 10:39:19 +01:00
Peter Boyle
5b55867a7a
Slightly cheaper Ext assembly
2017-04-24 05:36:11 -04:00
Peter Boyle
3accb1ef89
Debugged assemply split phase with interior suppression
2017-04-23 19:30:19 -04:00
Peter Boyle
e3d0e31525
Debugged assemply split phase with interior suppression
2017-04-23 19:29:27 -04:00
Peter Boyle
5812eb8a8c
Partially fixed. But the comms-overlap does not work yet.
2017-04-22 18:50:25 -04:00
paboyle
4dd3763294
Use OMP as much as possible
2017-04-22 20:35:20 +01:00
paboyle
c429ace748
Cleaner OpenMP use
2017-04-22 20:28:42 +01:00
paboyle
ac58565d0a
Dangerous rewrite of the assembly. If I make a mistake the debug will be painful.
2017-04-22 19:31:04 +01:00
paboyle
3703b718aa
Mark up a table if a given site only receives from itself; including MPI3 splitting info.
2017-04-22 19:28:37 +01:00
paboyle
b722889234
Try a better load balancing loop
2017-04-22 19:27:41 +01:00
paboyle
abba44a837
Hand unrolled for overlapped comms
2017-04-22 17:45:17 +01:00
paboyle
f301be94ce
Fixed
2017-04-22 17:42:31 +01:00
Peter Boyle
1d1b225497
Hand unrolled Nc=3 kernels support split phase compute (on-node, off-node).
2017-04-22 09:05:28 -04:00
Peter Boyle
53a785a3dd
Fixing the KNL compile
2017-04-22 08:11:51 -04:00
paboyle
736bf3c866
Major rework of stencil. Half precision and MPI3 now working.
2017-04-22 11:33:50 +01:00
paboyle
b9bbe5d188
L1p config bg/q
2017-04-22 11:33:09 +01:00
paboyle
3844bcf800
If no f16c instructions supported must use software half precision conversion.
...
This will also become useful on BG/Q, so will move out from SSE4 into a general area.
Lifted the Eigen half precision from web. Looks sensible, but not extensively regressed
against the intrinsics implementation yet.
2017-04-20 15:30:52 +01:00
paboyle
e1a2319d01
Simple compressor moved out of cshift into stencil
2017-04-20 13:18:15 +01:00
paboyle
180c732b4c
Move compressors out of Cshift.
...
Slice iterators would help
2017-04-20 13:17:55 +01:00
paboyle
957a706d0b
Useful script
2017-04-20 13:17:44 +01:00
paboyle
d2312e9874
Drop compressor entirely from Cshift to only Stencil.
2017-04-20 13:16:55 +01:00
paboyle
fc4ab9ccd5
Working half precision comms
2017-04-20 11:20:26 +01:00
paboyle
4a340aa5ca
Massive compressor rework to support reduced precision comms
2017-04-20 09:28:27 +01:00
paboyle
3b7de792d5
Type comparison in the traits work
2017-04-18 13:28:04 +01:00
paboyle
557c3fa109
Pretty change
2017-04-18 13:27:38 +01:00
paboyle
ec18e9f7f6
Merge branch 'develop' into feature/half-prec-comms
2017-04-18 11:39:39 +01:00
paboyle
a839d5bc55
Updated todo list
2017-04-18 11:22:17 +01:00
paboyle
de41b84c5c
Merge branch 'feature/normHP' into develop
2017-04-18 10:57:21 +01:00
paboyle
8e161152e4
MultiRHS solver improvements with slice operations moved into lattice and sped up.
...
Block solver requires a lot of performance work.
2017-04-18 10:51:55 +01:00
paboyle
3141ebac10
MultiRHS working, starting to optimise. Block doesn't and I thought it already was; puzzled.
2017-04-17 10:50:19 +01:00
paboyle
7ede696126
Non compile of tests fixed
2017-04-16 23:40:00 +01:00
paboyle
bf516c3b81
higher precision reduction variables in norm and inner product
2017-04-15 12:27:28 +01:00
paboyle
441a52ee5d
First cut at higher precision reduction
2017-04-15 10:57:21 +01:00
paboyle
a8db024c92
Cleaning up the dense matrix and lanczos sector
2017-04-15 08:54:11 +01:00
paboyle
a9c22d5f43
Verbose removal
2017-04-14 14:38:49 +01:00
paboyle
3ca41458a3
Fix to no USE_FP16 case
2017-04-14 14:20:54 +01:00
paboyle
9e2d29c644
USE_FP16 macro
2017-04-14 14:17:14 +01:00
Guido Cossu
b694996302
adding comments
2017-04-14 13:30:14 +01:00
Peter Boyle
951be75292
Half precision conversion working on AVX512 now too
2017-04-13 17:35:11 +01:00
James Harrison
c8e6f58e24
Fix typos in ScalarVP
2017-04-13 17:04:37 +01:00
Peter Boyle
b9113ed310
Patches for knl
2017-04-13 12:02:12 -04:00
James Harrison
888988ad37
Merge branch 'feature/qed-fvol' of https://github.com/paboyle/Grid into feature/qed-fvol
...
# Conflicts:
# lib/qcd/action/fermion/Fermion.h
2017-04-13 15:54:40 +01:00
1407418755
Old qed-fvol program build disabled
2017-04-13 15:32:30 +01:00
a6a0da873f
Merge branch 'feature/hadrons' into feature/qed-fvol
2017-04-13 15:31:06 +01:00
paboyle
42fb49d3fd
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-04-13 14:12:47 +01:00
paboyle
2a54c9aaab
Merge branch 'feature/block-cg' into develop
2017-04-13 14:12:24 +01:00
paboyle
0957378679
Fixing conditional ugly way
2017-04-13 13:47:56 +01:00
paboyle
2ed6c76fc5
Getting multiline if then fi working
2017-04-13 13:43:13 +01:00
paboyle
d3b9a7fa14
F16c apparently requires AVX, even if the 128 bit are used.
...
Seems odd.
2017-04-13 13:19:11 +01:00
paboyle
75ea306ce9
Another try at travis
2017-04-13 13:05:32 +01:00
paboyle
4226c633c4
Default to FP16 off again
2017-04-13 12:51:39 +01:00
paboyle
5a4eafbf7e
.travis
2017-04-13 12:50:43 +01:00
paboyle
eb8e26018b
Travis update for macos
2017-04-13 12:35:11 +01:00
paboyle
db5ea001a3
Update to use Xcode 8.3 since -mfp16 causes SIGILL
2017-04-13 12:22:40 +01:00
paboyle
2846f079e5
Predicate tests on fp16 being enabled
2017-04-13 12:08:05 +01:00
paboyle
1d502e4ed6
FP16 optional compile time
2017-04-13 11:55:24 +01:00
paboyle
73cdf0fffe
Drop f16c from SSE because of a macos compile error on travis
2017-04-13 11:23:41 +01:00
paboyle
1c25773319
Trap illegal instructions
2017-04-13 10:51:40 +01:00
paboyle
c38400b26f
Trap signals
2017-04-13 10:35:20 +01:00
paboyle
9c3065b860
Debug flags off again
2017-04-13 10:01:32 +01:00
paboyle
94eb829d08
Align cast fixed for __mm128i gcc complained
2017-04-13 08:40:44 +01:00
paboyle
68392ddb5b
Exchange in generic
...
Precision change in AVX, SSE, AVX512, Generic. QPX still to do.
2017-04-13 08:38:12 +01:00
paboyle
cb6b81ae82
Half precision conversion
2017-04-12 19:32:37 +01:00
Lanny91
c382c351a5
Quark test output correction.
2017-04-12 14:36:15 +01:00
Lanny91
af2d6ce2e0
Encapsulated 4D->5D and 5D->4D conversions in separate functions & added corresponding tests.
2017-04-12 14:36:02 +01:00
90ec6eda0c
Rare K test solver name fix
2017-04-10 17:48:58 +01:00
Lanny91
ac1253bb76
Corrected solver in rare kaon test
2017-04-10 17:42:55 +01:00
fe8d625694
Merge commit '5e477ec553aa48d7d19b5a7c45d41acbb3392bcb' into feature/rare_kaon
2017-04-10 17:23:37 +01:00
53e76b41d2
Merge branch 'develop' into feature/hadrons
2017-04-10 17:00:53 +01:00
8ef4300412
spurious .dirstamp files removed
2017-04-10 17:00:22 +01:00
98a24ebf31
The macro “magics” is very intensive for the preprocessor in the measurement code which has numerous serialisable classes. Reducing the number of serialisable fields to 64 (instead of 1024) helps a lot, this is enough for now and can be extended trivially if needed in the future.
2017-04-10 16:58:54 +01:00
James Harrison
e4a105a30b
Merge branch 'feature/qed-fvol' of https://github.com/paboyle/Grid into feature/qed-fvol
2017-04-10 16:35:01 +01:00
James Harrison
26ebe41fef
QedFVol: Implement charged propagator calculation within ScalarVP module
2017-04-10 16:33:54 +01:00
paboyle
b12dc89d26
Commenting and clean up
2017-04-10 20:38:20 +09:00
paboyle
d80d802f9d
MultiRHS solver test
2017-04-10 00:12:12 +09:00
paboyle
3d99b09dba
Start of blockCG
2017-04-09 23:42:10 +09:00
paboyle
db5f6d3ae3
Verbose fix
2017-04-09 23:41:30 +09:00
paboyle
683550f116
Const args improvement
2017-04-09 23:41:04 +09:00
Lanny91
5e477ec553
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/rare_kaon
2017-04-07 11:51:09 +01:00
paboyle
55d0329624
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2017-04-07 11:08:14 +09:00
paboyle
86aaa35294
Christoph needs SchurDiagTwoKappa which is mobius specific.
2017-04-07 11:07:40 +09:00
Guido Cossu
363611ae21
Merge branch 'develop' into feature/clover
2017-04-05 16:26:04 +01:00
Guido Cossu
172d3dc93a
Correcting names in tests
2017-04-05 16:24:04 +01:00
Guido Cossu
3b8a791e28
Merge branch 'develop' into feature/clover
2017-04-05 16:20:28 +01:00
Guido Cossu
7b03d8d087
Fixing the remaining merge conflicts
2017-04-05 16:17:46 +01:00
Guido Cossu
4b759b8f2a
Merge branch 'feature/hmc_generalise' into feature/scalar_adjointFT
2017-04-05 14:50:28 +01:00
Guido Cossu
8c540333d5
Merge branch 'develop' into feature/hmc_generalise
2017-04-05 14:41:04 +01:00
Guido Cossu
6fd82228bf
Working on the derivative
2017-04-05 10:51:44 +01:00
paboyle
5592f7b8c1
Creation mode better implementation
2017-04-05 02:35:34 +09:00
paboyle
35da4ece0b
UID fix
2017-04-05 02:18:15 +09:00
paboyle
061b15b9e9
Merge branch 'feature/sitmo-skipahead' into develop
2017-04-05 01:24:49 +09:00
Guido Cossu
ca6efc685e
Merge branch 'develop' into feature/clover
2017-04-04 10:19:02 +01:00
1e496fee74
Merge branch 'develop' into feature/qed-fvol
...
# Conflicts:
# lib/qcd/action/fermion/Fermion.h
2017-04-03 19:02:57 +01:00
ff4e54ef80
Merge branch 'develop' into feature/hadrons
2017-04-03 18:56:21 +01:00
paboyle
561426f6eb
Clean up
2017-04-02 23:13:48 +09:00
paboyle
83f6fab8fa
Big/Small crush test, and fast SITMO rng init, faster but not ideal
...
MT and Ranlux init.
2017-04-02 12:10:51 +09:00
paboyle
0fade84ab2
No random device
2017-04-02 00:29:40 +09:00
paboyle
9dc7ca4c3b
Sitmo fast init
2017-04-02 00:28:22 +09:00
paboyle
935d82f5b1
sanity checks
2017-04-02 00:27:28 +09:00
paboyle
9cbcdd65d7
No random device seed
2017-04-02 00:26:57 +09:00
paboyle
f18f5ed926
Drop random device
2017-04-02 00:26:26 +09:00
paboyle
d1d63a4f2d
sitmo default
2017-04-02 00:26:05 +09:00
paboyle
7e5faa0f34
Multiple RNGs
2017-04-02 00:25:44 +09:00
paboyle
6af459cae4
Christoph's coefficients.
2017-03-31 17:07:43 +09:00
paboyle
1c4bc7ed38
Debugged staggered conventions
2017-03-31 14:41:48 +09:00
Lanny91
cd1bd921bd
Reduced code duplication for Weak Hamiltonian contraction modules
2017-03-30 18:02:14 +01:00
Guido Cossu
b8ae787b5e
Correcting a simple typo
2017-03-30 11:33:15 +01:00
Guido Cossu
fbe2c3b5f9
]Merge branch 'develop' into feature/clover
2017-03-30 11:18:31 +01:00
Guido Cossu
1ed69816b9
First steps for the force term
2017-03-30 11:14:27 +01:00
Lanny91
fff5751b1a
HADRONS: Updated rare kaon test program, including all contractions. Sink smearing still to be implemented.
2017-03-30 10:57:01 +01:00
Lanny91
2c81696fdd
HADRONS: 4pt Weak + current disconnected topology (e.g. for rare neutral kaon decays)
2017-03-30 10:37:17 +01:00
Lanny91
c9dc22efa1
HADRONS: Standalone disconnected loop contraction.
2017-03-30 10:33:18 +01:00
Lanny91
0ab04a000f
HADRONS: 3pt contraction with gamma insertion between two propagators.
2017-03-30 10:30:58 +01:00
paboyle
93ea5d9468
Pretty code
2017-03-30 15:00:03 +09:00
paboyle
1ec5d32369
Chulwoo's test to zmobius helped me shake out
2017-03-30 13:45:13 +09:00
paboyle
9fd23faadf
Pretty layout
2017-03-30 13:44:45 +09:00
paboyle
10e4fa0dc8
Template instantiation improvements
2017-03-30 13:44:25 +09:00
paboyle
c4aca1dde4
Conjugate coefficients on adjoint
2017-03-30 13:44:05 +09:00
paboyle
b9e8ea3aaa
conjugate coefficient on the dagger
2017-03-30 13:43:13 +09:00
paboyle
077aa728b9
Fix the ZMobius (I think)
2017-03-30 13:42:09 +09:00
paboyle
a8d83d886e
Macro controls
2017-03-30 13:31:34 +09:00
paboyle
7fd46eeec4
Trailing whitespace removal
2017-03-30 13:31:10 +09:00
paboyle
e0c4eeb3ec
Compiles again
2017-03-30 13:30:45 +09:00
paboyle
cb9a297a0a
Chulwoo's Zmobius test
2017-03-30 13:30:25 +09:00
paboyle
2b115929dc
Small AVX512 asm ifdef patch
2017-03-29 18:51:23 +09:00
paboyle
5c6571dab1
Merge branch 'feature/bgq-asm' into develop
2017-03-29 18:48:55 +09:00
paboyle
417ec56cca
Release candidate
2017-03-29 05:45:33 -04:00
paboyle
756bc25008
Verbose header print by default
2017-03-29 04:44:17 -04:00
paboyle
35695ba57a
Bug fix in MPI3
2017-03-29 04:43:55 -04:00
paboyle
81ead48850
Log any errors to a file
2017-03-29 04:39:52 -04:00
paboyle
d805867e02
Better init
2017-03-28 13:25:05 -04:00
paboyle
e55a751e23
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
2017-03-28 12:20:12 -04:00
paboyle
358eb75995
Shorten loop
2017-03-28 12:20:02 -04:00
paboyle
98f9318279
Build on AVX2 and MPI passing with clang++
2017-03-28 23:16:04 +09:00
paboyle
4b17e8eba8
Merge branch 'develop' into feature/bgq-asm
...
Conflicts:
lib/qcd/action/fermion/Fermion.h
lib/qcd/action/fermion/WilsonFermion.cc
lib/util/Init.cc
tests/Test_cayley_even_odd_vec.cc
2017-03-28 04:49:30 -04:00
paboyle
75112a632a
IO improvements to fail on IO error
2017-03-28 02:28:04 -04:00
paboyle
18bde08d1b
Merge branch 'feature/staggering' into develop
2017-03-28 15:25:55 +09:00
James Harrison
9f755e0379
Add functions momD1 and momD2 to ScalarVP
2017-03-27 16:49:18 +01:00
James Harrison
4512dbdf58
Rename module ScalarFV to ScalarVP
2017-03-27 15:02:16 +01:00
James Harrison
483fd3cfa1
Add propagator expansion terms as inputs to ScalarFV
2017-03-27 13:24:51 +01:00
Guido Cossu
3750b9ffee
Deleting MPI test for OSX in Travis
2017-03-27 16:53:32 +09:00
Guido Cossu
5e549ebd8b
Adding force terms
2017-03-27 16:43:15 +09:00
Guido Cossu
fff484eca5
Populating Clover fermions methods
2017-03-27 15:12:57 +09:00
Guido Cossu
5fdc05782b
More in the clover fermion class
2017-03-27 10:54:16 +09:00
paboyle
d45cd7e677
Adding a simple read of NERSC test
2017-03-26 09:24:26 -04:00
paboyle
4e96679797
Added a bnl log
2017-03-25 09:25:46 -04:00
James Harrison
85516e9c7c
Output all terms of scalar propagator separately
2017-03-24 17:13:55 +00:00
James Harrison
0c006fbfaa
Add ScalarFV inputs to ScalarFV.hpp
2017-03-24 11:59:09 +00:00
James Harrison
54c10a42cc
Add source and emField inputs to ScalarFV module
2017-03-24 11:42:32 +00:00
Guido Cossu
a04eb7df5d
Starting Clover term
2017-03-24 12:43:28 +09:00
Guido Cossu
4c1ea8677e
Small cosmetic changes and vscode gitignore
2017-03-23 14:09:35 +09:00
paboyle
fc93f0b2ec
Save some code for static huge tlb's. It is ifdef'ed out but an interesting root only experiment.
...
No gain from it.
2017-03-21 22:30:29 -04:00
paboyle
8c8473998d
Average over whole cluster the comm time.
2017-03-21 22:29:51 -04:00
James Harrison
ef0fe2bcc1
Added empty ScalarFV module
2017-03-21 11:39:46 +00:00
Guido Cossu
120fb59978
Adding tests for WilsonFlow classes
2017-03-21 16:11:35 +09:00
Guido Cossu
fd56b3ff38
Merge branch 'develop' into feature/hmc_generalise
2017-03-21 13:33:41 +09:00
Guido Cossu
0ec6829edc
Fixing compilation errors for the WilsonFlow
2017-03-21 13:06:32 +09:00
Guido Cossu
18b7845b7b
Adding WilsonFlow smearing
2017-03-21 11:52:05 +09:00
Guido Cossu
3d0fe15374
Added topological charge measurement
2017-03-17 16:14:57 +09:00
Guido Cossu
91886068fe
Fixed seg fault for observable modules
2017-03-17 13:59:31 +09:00
Guido Cossu
6d1e9e5f92
Small cleanup of the observables
2017-03-17 11:42:55 +09:00
Guido Cossu
b640230b1e
Moving hmc observables in a different directory
2017-03-17 11:40:17 +09:00
paboyle
e7c36771ed
ZMobius prep for asm
2017-03-15 14:23:33 -04:00
Guido Cossu
038b6ee9cd
Fixing JSON compilation error
2017-03-16 01:09:24 +09:00
Guido Cossu
38806343a8
Improving efficiency of the force term
2017-03-15 15:16:16 +09:00
Guido Cossu
831ca4e3bf
Added Scalar action for fields in the adjoint representation
2017-03-14 14:55:18 +09:00
paboyle
8dc57a1e25
Layout change
2017-03-13 11:11:46 +00:00
paboyle
f57bd770b0
Merge branch 'bugfix/dminus' into feature/bgq-asm
2017-03-13 11:11:03 +00:00
paboyle
4ed10a3d06
Merge branch 'develop' into feature/bgq-asm
2017-03-13 11:10:10 +00:00
Peter Boyle
dfefc70b57
Merge pull request #93 from Lanny91/hotfix/qpx
...
Some fixes for QPX and generic SIMD types.
2017-03-13 09:31:26 +00:00
Chulwoo Jung
0b61f75c9e
Adding ZMobius CG test
2017-03-13 00:12:43 -04:00
Chulwoo Jung
33edde245d
Changing Dminus(Dag) to use full vectors to work correctly
2017-03-12 23:02:42 -04:00
paboyle
b64e004555
MPI run fail on macos
2017-03-13 01:59:01 +00:00
paboyle
447c5e6cd7
Z mobius hermiticity correction
2017-03-13 01:30:43 +00:00
paboyle
8b99d80d8c
Merge branch 'bgq-asm-shmemfixes' into feature/bgq-asm
2017-03-12 23:30:09 +00:00
Guido Cossu
b3dede4dd3
Merge branch 'develop' into feature/hmc_generalise
2017-03-10 23:57:37 +09:00
Guido Cossu
4e34132f4d
Correcting modules use in test files
2017-03-10 23:54:53 +09:00
Guido Cossu
c07cb10247
Merge branch 'feature/hmc_generalise' of https://github.com/paboyle/Grid into feature/hmc_generalise
2017-03-10 22:37:25 +09:00
Guido Cossu
d7767a2a62
Few more tests
2017-03-10 22:33:48 +09:00
Guido Cossu
ec035983fd
Fixing the implicit integration
2017-03-01 11:56:35 +00:00
paboyle
3901b17ade
timeings from BNL
2017-02-28 17:06:45 -05:00
paboyle
af230a1fb8
Average the time across the whole machine for outliers
2017-02-28 17:05:22 -05:00
Christopher Kelly
06a132e3f9
Fixes to SHMEM comms
2017-02-28 13:31:54 -08:00
Guido Cossu
596dcd85b2
Auxiliary fields
2017-02-27 13:16:38 +00:00
paboyle
96d44d5c55
Header fix
2017-02-24 19:12:11 -05:00
Guido Cossu
7270c6a150
Integrator works now
2017-02-24 17:03:42 +00:00
Lanny91
7fe797daf8
SIMD vector length sanity checks
2017-02-23 16:49:44 +00:00
Lanny91
486a01294a
Corrected QPX SIMD width
2017-02-23 16:47:56 +00:00
paboyle
586a7c90b7
Merge branch 'develop' into feature/bgq-asm
2017-02-23 00:26:59 +00:00
paboyle
e099dcdae7
Merge branch 'develop' into feature/bgq-asm
2017-02-23 00:25:29 +00:00
paboyle
4e7ab3166f
Refactoring header layout
2017-02-22 18:09:33 +00:00
paboyle
aac80cbb44
Bug fix from Chris K
2017-02-22 12:19:09 -05:00
Lanny91
c80948411b
Added tRotate function and MaddRealPart struct for generic SIMD, bugfix in MultRealPart and minor cosmetic changes.
2017-02-22 14:57:10 +00:00
Lanny91
95625a7bd1
Use Grid Integer type
2017-02-22 13:09:32 +00:00
Lanny91
0796696733
Emulated integer vector type for QPX and generic SIMD instruction sets.
2017-02-22 12:01:36 +00:00
Peter Boyle
f8b9ad7d50
Merge pull request #91 from sunpho84/public_modules_memebers
...
making public same serializable parameters in HMC Module
2017-02-22 00:53:20 +00:00
Peter Boyle
04a1959895
Merge pull request #90 from sunpho84/liming
...
adding --with switch to pass lime path
2017-02-22 00:52:53 +00:00
Peter Boyle
cc773ae70c
Merge pull request #89 from sunpho84/prepend_package_with_grid
...
Prepending PACKAGE_ with GRID_ in Config.h
2017-02-22 00:52:10 +00:00
Peter Boyle
d21c51b9be
Merge pull request #88 from sunpho84/pickpoketting
...
now it is possible to pass {coords list} to a peek or poke
2017-02-22 00:51:33 +00:00
Peter Boyle
597a7b4b3a
Merge pull request #81 from edbennett/develop
...
Fix misleading message: "doxygen-pdf requires doxygen-pdf"
2017-02-22 00:50:59 +00:00
azusayamaguchi
1c30e9a961
Verified
2017-02-21 23:01:25 +00:00
Francesco Sanfilippo
93cc270016
making public same serializable parameters in HMC Module
...
RNGModuleParameters
GridModuleParameters
2017-02-21 23:11:56 +01:00
Francesco Sanfilippo
29b60f7e1a
adding --with switch to pass lime path
2017-02-21 23:09:39 +01:00
Francesco Sanfilippo
041884acf0
Prepending PACKAGE_ with GRID_ in Config.h
...
Avoid polluting linking progr
2017-02-21 22:51:36 +01:00
Francesco Sanfilippo
15e668eef1
now it is possible to pass {coords list} to a peek or poke
2017-02-21 22:48:38 +01:00
azusayamaguchi
bf7e3f20d4
Staggaered fermion optimised version
2017-02-21 14:35:42 +00:00
Guido Cossu
902afcfbaf
Adding metric and the implicit steps
2017-02-21 11:30:57 +00:00
paboyle
3ae92fa2e6
Global changes to parallel_for structure.
...
Move the comms flags to more sensible names
2017-02-21 05:24:27 -05:00
paboyle
3906cd2149
Stencil fix on BNL KNL system
2017-02-20 17:51:31 -05:00
paboyle
5a1fb29db7
Useful debug code info to preserve
2017-02-20 17:49:23 -05:00
paboyle
661fc4d3d1
Debug AVX512 exchange code paths
2017-02-20 17:48:36 -05:00
paboyle
41009cc142
Move excange into the stencil only; keep Cshift fully general
2017-02-20 17:48:04 -05:00
paboyle
37720c4db7
Count bytes off node only
2017-02-20 17:47:40 -05:00
paboyle
1a30455a10
1000 iters on bmark for more accurate timing
2017-02-20 17:47:01 -05:00
Guido Cossu
97a6b61551
Covariant laplacian and implicit integration
2017-02-20 11:17:27 +00:00
paboyle
cd0da81196
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
2017-02-16 18:52:30 -05:00
paboyle
f246fe3304
Improvements to avx for invertible to avoid latent bug
2017-02-16 23:52:44 +00:00
paboyle
8a29c16bde
Faster gather exchange
2017-02-16 23:52:22 +00:00
paboyle
d68907fc3e
Debug temp
2017-02-16 18:51:35 -05:00
paboyle
5c0adf7bf2
Make clang happy with parenthesis
2017-02-16 23:51:33 +00:00
paboyle
be3a8249c6
Faster gather
2017-02-16 23:51:15 +00:00
paboyle
bd600702cf
Vectorise the XYZT face gathering better.
...
Hard coded for simd_layout <= 2 in any given spread out direction; full generality is inconsistent
with efficiency.
2017-02-15 11:11:04 +00:00
Lanny91
f011bdb869
Fixed overwrite of pminus projection in construction of 4d propagator from 5d.
2017-02-14 14:07:17 +00:00
Guido Cossu
bafb101e4f
Testing different versions of the Laplacian
2017-02-13 15:38:11 +00:00
Guido Cossu
08fdf05528
Added and tested the covariant laplacian + CG solver
2017-02-13 15:05:01 +00:00
paboyle
aca7a3ef0a
Optimisation control improvements
2017-02-10 18:22:31 -05:00
Guido Cossu
9e72a6b22e
Reverting to Xcode 7.3
2017-02-10 12:57:03 +00:00
Guido Cossu
1c12c5612c
Xcode 8.2 for travis
2017-02-10 12:12:01 +00:00
Guido Cossu
a8193c4bcb
Correcting travis compilation on gcc
2017-02-10 10:59:30 +00:00
Guido Cossu
c3d7ec65fa
All tests compile.
2017-02-10 10:27:51 +00:00
Guido Cossu
8b6a6c8236
Resolving small merge conflict
2017-02-09 16:20:24 +00:00
Guido Cossu
e0571c872b
Merge branch 'develop' into feature/hmc_generalise
2017-02-09 16:12:00 +00:00
Guido Cossu
c67f41887b
Reverting parameters to original
2017-02-09 15:59:56 +00:00
Guido Cossu
84687ccf1f
Handling an Intel compiler warning for Json class
2017-02-09 15:33:33 +00:00
Guido Cossu
3274561cf8
Cleanup
2017-02-09 15:18:38 +00:00
e08fbb3771
Merge pull request #84 from Lanny91/feature/rare_kaon
...
Rare Kaon decay contraction code
2017-02-08 08:23:34 -08:00
Lanny91
d7464aa0fe
Switched from XmlWriter to CorrWriter in contraction code
2017-02-08 16:13:44 +00:00
Lanny91
00d29153f0
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/rare_kaon
2017-02-08 16:11:15 +00:00
2ce989f220
Hadrons: default I/O to HDF5
2017-02-08 07:50:05 -08:00
Lanny91
d7a1dc85be
Revert "Hadrons: test for rare kaon contraction code."
...
This reverts commit 1e257a1251 .
2017-02-08 13:23:05 +00:00
Lanny91
fc19503673
Removed MSink namespace.
2017-02-08 13:17:39 +00:00
Lanny91
beba824136
Make use of GammaL class in Weak Hamiltonian contractions
2017-02-08 12:45:39 +00:00
Lanny91
6ebf8b12b6
Removed unnecessary repeat of write in Weak Hamiltonian contractions
2017-02-08 12:43:13 +00:00
Lanny91
e5a7ed4362
Moved write outside of loop, some physics corrections
2017-02-08 12:29:33 +00:00
Lanny91
b9f7ea47c3
Access hasModule function directly from Environment instance.
2017-02-08 10:10:06 +00:00
Lanny91
06f7ee202e
Revert "Add function to say whether or not a module exists in application class"
...
This reverts commit 522f6bf91a .
2017-02-08 10:08:18 +00:00
Lanny91
2b2fc6453f
Fixed single precision compatibility issues
2017-02-07 13:59:29 +00:00
Lanny91
bdd2765461
Added missing allocation of Weak Hamiltonian result vector
2017-02-07 13:06:42 +00:00
paboyle
2c246551d0
Overlap comms and compute options in wilson kernels
2017-02-07 01:37:10 -05:00
paboyle
71ac2e7940
Faster RNG init
2017-02-07 01:33:23 -05:00
paboyle
2bf4688e83
Running on BNL KNL
2017-02-07 01:32:10 -05:00
paboyle
a48ee6f0f2
Don't use MPI3_leader any more. No real gain and complex
2017-02-07 01:31:24 -05:00
paboyle
73547cca66
MPI3 working i think
2017-02-07 01:30:02 -05:00
paboyle
123c673db7
Policy to control async or sync SendRecv
2017-02-07 01:24:54 -05:00
paboyle
61f82216e2
Communicator Policy, NodeCount distinct from Rank count
2017-02-07 01:22:53 -05:00
paboyle
8e7ca92278
Debugged cshift case
2017-02-07 01:21:32 -05:00
paboyle
485ad6fde0
Stencil working in SHM MPI3
2017-02-07 01:20:39 -05:00
paboyle
6ea2184e18
OMP define change
2017-02-07 01:17:16 -05:00
paboyle
fdc170b8a3
Parallel fors in lattice transfer
2017-02-07 01:16:39 -05:00
paboyle
060da786e9
Comms benchmark improvements
2017-02-07 01:07:39 -05:00
paboyle
85c7bc4321
Bug fixes for cases that physics code couldn't hit but latent
...
and discovered on KNL (long vector, y SIMD dir) and checker dir set to y.
Remove the assertions on these code paths now they are tested.
2017-02-07 01:01:15 -05:00
paboyle
0883d6a7ce
Overlap comms compute support; make reg naming consistent with bgq aasm
2017-02-07 00:59:32 -05:00
paboyle
9ff97b4711
Improved stencil tests passing all on KNL multinode
2017-02-07 00:58:34 -05:00
paboyle
b5e9c900a4
Better printing and signal handling options
2017-02-07 00:57:55 -05:00
paboyle
4bbdfb434c
Overlap comms compute modifications
2017-02-07 00:57:01 -05:00
Lanny91
4a45c06dd7
Code cleaning and addition of Weak Hamiltonian contraction log message
2017-02-06 20:12:30 +00:00
Lanny91
d6a7d7d1e0
Hadrons: added missing momentum parameter in rare kaon contraction test
2017-02-06 18:15:49 +00:00
Lanny91
1a122a0dd8
Hadrons: corrected gamma matrix inputs in rare kaon test
2017-02-06 17:35:41 +00:00
Lanny91
20e20733e8
Merge branch 'feature/hadrons' into feature/rare_kaon
2017-02-06 14:12:21 +00:00
Lanny91
b7cd1a19e3
Utilities for reading and writing "pair" objects.
2017-02-06 14:08:59 +00:00
Lanny91
f510002a62
Merge remote-tracking branch 'paboyle/feature/hadrons' into feature/hadrons
2017-02-03 14:37:34 +00:00
Christopher Kelly
c94133af49
Added iteration reporting to CG and mixed CG
...
Added ability to manually change the initial CG inner tolerance in mixed CG
Added .hpp files to filelist script
2017-02-02 17:04:42 -05:00
eedcaf6470
Merge branch 'feature/hadrons' into feature/qed-fvol
2017-02-01 15:53:10 -08:00
e7d8030a64
operator>> for serialisable enums
2017-02-01 15:51:08 -08:00
d775fbb2f9
Gammas: code cleaning and gamma_L implementation & test
2017-02-01 15:45:05 -08:00
863855f46f
header fix
2017-02-01 11:59:44 -08:00
419af7610d
New gamma matrices tidying: generated code is confined to Gamma.* for readability
2017-02-01 11:23:12 -08:00
Lanny91
1e257a1251
Hadrons: test for rare kaon contraction code.
2017-02-01 16:36:40 +00:00
Lanny91
522f6bf91a
Add function to say whether or not a module exists in application class
2017-02-01 16:36:08 +00:00
Lanny91
d35d87d2c2
Weak Hamiltonian Eye-type contraction execution
2017-02-01 16:33:24 +00:00
Lanny91
74a5cda84b
Removed unnecessary "3pt" labels
2017-02-01 15:03:49 +00:00
Lanny91
5be05d85b8
Fixed collision of Wall source and sink header ifdefs
2017-02-01 13:56:22 +00:00
Lanny91
35ac85aea8
Updated Weak Hamiltonian contractions to use zero-flop gamma matrices
2017-02-01 12:57:34 +00:00
Lanny91
fa237401ff
Consistent variable name in macro
2017-02-01 12:56:55 +00:00
Lanny91
97053adcb5
Merge branch 'feature/hadrons' into feature/rare_kaon
2017-02-01 10:13:29 +00:00
Lanny91
f8fbe4d7a3
Merge remote-tracking branch 'paboyle/feature/hadrons' into feature/hadrons
...
# Conflicts:
# extras/Hadrons/Modules/MContraction/Meson.hpp
# tests/hadrons/Test_hadrons_meson_3pt.cc
Updated Meson.hpp to utilise zero-flop gamma matrices.
2017-02-01 09:27:00 +00:00
Lanny91
ef31c012bf
Merge remote-tracking branch 'paboyle/develop' into feature/hadrons
2017-01-31 17:36:10 +00:00
7da7d263c4
typo
2017-01-30 10:53:13 -08:00
1140573027
Gamma adj fix: now in Grid namespace to avoid collisions
2017-01-30 10:53:04 -08:00
Lanny91
9e9f621d5d
Hadrons: added Weak Hamiltonian module dependencies, some reformatting.
2017-01-30 17:54:21 +00:00
Lanny91
651e1a7cbc
Hadrons: Momentum inserted as multiples of 2*pi/L
2017-01-30 17:14:33 +00:00
a0cfbb6e88
Merge branch 'feature/gammas' into feature/hadrons
...
# Conflicts:
# .gitignore
# lib/qcd/spin/Dirac.cc
# scripts/filelist
2017-01-30 09:10:49 -08:00
Lanny91
c4d3672720
Hadrons: Momentum projection in meson module.
2017-01-30 17:09:04 +00:00
515a26b3c6
gammas: copyright update
2017-01-30 09:07:09 -08:00
Guido Cossu
16be6d378c
Now action factory support different Fields (templated)
2017-01-30 14:22:41 +00:00
Guido Cossu
f05d0565aa
Adding ScalarField theory
2017-01-30 10:59:28 +00:00
b39f0d1fb6
Hadrons: default I/O to HDF5 if possible, XML otherwise
2017-01-27 18:12:35 -08:00
9f1267dfe6
Merge branch 'feature/qed-fvol' of github.com:paboyle/Grid into feature/qed-fvol
2017-01-27 17:06:34 -08:00
2e90285232
Merge pull request #80 from jch1g10/feature/qed-fvol
...
ChargedProp: remove ScalarField fs
2017-01-27 17:06:13 -08:00
e254de982e
Merge branch 'feature/qed-fvol' of github.com:paboyle/Grid into feature/qed-fvol
2017-01-27 17:02:35 -08:00
28d99b5297
Merge branch 'develop' into feature/qed-fvol
2017-01-27 16:59:53 -08:00
c946d3bf3f
Merge branch 'develop' of github.com:edbennett/Grid into develop
2017-01-27 22:12:11 +00:00
1c68098780
fix misleading message: "doxygen-pdf requires doxygen-pdf"
2017-01-27 22:04:26 +00:00
Lanny91
9bf4108d1f
Weak Hamiltonian contraction modules, for Eye and NonEye contraction topologies. Execution for NonEye type diagrams has been implemented, but not yet for Eye type.
2017-01-27 16:58:11 +00:00
Guido Cossu
899e685627
Merge branch 'feature/sitmo_rng' into develop
2017-01-27 14:15:56 +00:00
James Harrison
ee93f0218b
ChargedProp: remove ScalarField fs
2017-01-27 12:22:48 +00:00
Guido Cossu
6929a84c70
Reformatting files
2017-01-27 11:54:44 +00:00
Guido Cossu
5c779a789b
Moving registrations in an independent file
2017-01-27 11:23:51 +00:00
161ed102a5
Merge pull request #79 from jch1g10/feature/qed-fvol
...
Fixed bug in ChargedProp
2017-01-26 19:49:14 -08:00
3bf993d81a
gitignore update
2017-01-26 17:00:59 -08:00
fad743fbb1
Build system sanity check: corrected several headers not in the <Grid/*> format
2017-01-26 17:00:41 -08:00
Guido Cossu
e863a948e3
Cleaning up files and directories
2017-01-26 15:24:49 +00:00
James Harrison
f65a585236
ChargedProp: Switch to HDF5 output
2017-01-26 15:02:30 +00:00
Lanny91
977f34dca6
Added missing typename
2017-01-26 13:18:33 +00:00
Lanny91
90ad956340
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/rare_kaon
2017-01-26 12:08:41 +00:00
Guido Cossu
7996f06335
Commented out registrations.
...
Move to an independent file that is linked only for the factory managed HMC
2017-01-25 18:27:45 +00:00
Guido Cossu
ef8d3831eb
Temporary patch the threading error in InsertSlice and ExtractSlice
...
Find source and fix the error
2017-01-25 18:12:04 +00:00
Guido Cossu
70ed9fc40c
Updating the engine to the last version
2017-01-25 18:10:41 +00:00
Guido Cossu
7b40a3e3e5
Reorganizing files
2017-01-25 18:09:46 +00:00
4d3787db65
Hadrons fixed for new gammas, Meson only does one contraction but this’ll change in the future
2017-01-25 09:59:00 -08:00
Guido Cossu
677757cfeb
Added and tested SITMO PRNG
2017-01-25 12:47:22 +00:00
Guido Cossu
f7fbbaaca3
Compiles after merging
2017-01-25 12:11:58 +00:00
Guido Cossu
17629b8d9e
Merge branch 'develop' into feature/hmc_generalise
2017-01-25 11:33:53 +00:00
Guido Cossu
0baa20d292
Againg fixing compilation on Travis, no LIME lib present
2017-01-25 11:18:44 +00:00
Guido Cossu
4571c918a4
Fixing compilation error when compiling without LIME
2017-01-25 11:14:43 +00:00
Guido Cossu
5251ea4d30
Adding more fermion action modules, generalised DWF
2017-01-25 11:10:44 +00:00
05cb6d318a
gammas: adjoint implemented as a symbolic operation
2017-01-24 18:07:43 -08:00
0432e30256
Gamma right multiply code fix (now passes consistency check)
2017-01-24 17:36:23 -08:00
2c3ebc1e07
.gitignore update
2017-01-24 17:35:42 -08:00
068b28af2d
Extensive gamma test program
2017-01-24 17:35:29 -08:00
f7db342f49
Serialisable enums can be converted to int
2017-01-24 17:33:26 -08:00
d65e81518f
Merge branch 'feature/hadrons' into develop
2017-01-24 09:21:44 -08:00
Guido Cossu
7f456b4173
👷 Added all pseudofermion actions to the serialiser
2017-01-24 13:57:32 +00:00
a37e71f362
New automatic implementation of gamma matrices, Meson and SeqGamma are broken
2017-01-23 19:13:43 -08:00
James Harrison
ae99e99da2
Fixed bug in ChargedProp
2017-01-23 17:27:50 +00:00
Lanny91
c291ef77b5
Merge branch 'feature/hadrons' of https://github.com/paboyle/Grid into feature/hadrons
2017-01-23 15:24:47 +00:00
Lanny91
7dd2764bb2
Wall sink smearing
2017-01-23 15:17:54 +00:00
Guido Cossu
244f8fb6dc
Added JSON parser (without NextElement)
2017-01-23 14:57:38 +00:00
azusayamaguchi
05c1924819
Timing loop change
2017-01-23 10:43:45 +00:00
f3ca29af6c
Merge branch 'feature/hadrons' into feature/qed-fvol
2017-01-21 13:41:05 -08:00
b7da264b0a
Hadrons: Application is not storing the environment ref but calling getInstance() each time, solving a very nasty set fault on Linux/KNL
2017-01-21 13:40:23 -08:00
37988221a8
Merge branch 'feature/serialisation-hdf5' into feature/qed-fvol
2017-01-20 14:04:20 -08:00
74ac2aa676
Merge branch 'feature/serialisation-hdf5' into feature/hadrons
2017-01-20 14:03:51 -08:00
4c75095c61
HDF5: header fix
2017-01-20 12:14:01 -08:00
afa095d33d
HDF5: better complex number support
2017-01-20 12:10:41 -08:00
6b5259cc10
HDF5 detects if a name is a dataset or not without using exception catching
2017-01-20 11:03:19 -08:00
Guido Cossu
27dfe816fa
Added TwoFlavorsEO
...
Had to remove a conformability check in the Derivative of SchurDiff,
see the comments in the file
2017-01-20 16:59:31 +00:00
Lanny91
af29be2c90
Simplified operation of meson module. Result has been modified to output one contraction at a time for each pair of gamma insertions at source and sink.
2017-01-20 16:38:50 +00:00
Guido Cossu
f96fac0aee
All functionalities ready.
...
Todo: add all the fermion action modules
2017-01-20 12:56:20 +00:00
7423a352c5
HDF5: typos
2017-01-19 18:33:04 -08:00
81e66d6631
HDF5: revert back to native types
2017-01-19 18:24:53 -08:00
ade1058e5f
Hdf5Type does not need to be a pointer anymore
2017-01-19 18:23:55 -08:00
6eea9e4da7
HDF5 types static initialisation is mysteriously buggy on BG/Q, changing strategy
2017-01-19 18:02:53 -08:00
2c673666da
Standardisation of HDF5 types
2017-01-19 17:19:12 -08:00
7a327a3f28
Merge branch 'develop' into feature/qed-fvol
2017-01-19 14:22:36 -08:00
Lanny91
07f2ebea1b
Meson module now takes list of gamma matrices to insert at source and sink.
2017-01-19 22:18:42 +00:00
d6401e6d2c
Merge branch 'feature/hadrons' into develop
2017-01-19 14:10:01 -08:00
24d3d31b01
Genetic scheduler: uses insert instead of emplace for better compiler compatibility
2017-01-19 14:08:22 -08:00
Guido Cossu
851f2ad8ef
Adding fermions actions support in the factories
2017-01-19 10:00:02 +00:00
5405526424
Code typo
2017-01-18 22:42:19 -08:00
f3f0b6fef9
serious rewriting of Test_serialisation, now crashes if IO inconsistent
2017-01-18 17:41:05 -08:00
654e0b0fd0
Serialisable object are now comparable with ==
2017-01-18 17:40:32 -08:00
4be08ebccc
debug code cleaning
2017-01-18 17:39:59 -08:00
f599cb5b17
HDF5 serial IO implemented and tested
2017-01-18 16:50:21 -08:00
Guido Cossu
23e0561dd6
Added all required functionalities, time for cleaning
...
All actions to be added
2017-01-18 16:31:51 +00:00
a4a509497a
Merge branch 'develop' of github.com:paboyle/Grid into develop
2017-01-17 16:22:22 -08:00
5803933aea
First implementation of HDF5 serial IO writer, reader is still empty
2017-01-17 16:21:18 -08:00
Lanny91
8ae1a95ec6
Legal banners and module descriptions
2017-01-17 18:14:20 +00:00
Lanny91
82b7d4eaf0
Added noise loop dependencies
2017-01-17 15:58:32 +00:00
Lanny91
78774fbdc0
Construct loop propagator
2017-01-17 15:29:45 +00:00
Guido Cossu
924130833e
Moved more parameters to serialization
2017-01-17 13:22:18 +00:00
Guido Cossu
7cf833dfe9
Fixed compilation error in tests hadrons (capital letter in dir name)
2017-01-17 11:00:54 +00:00
Guido Cossu
0157274762
HMC factories
2017-01-17 10:46:49 +00:00
Guido Cossu
87e8aad5a0
Added support for input file HMC modules (missing the actions yet)
2017-01-16 16:07:12 +00:00
Guido Cossu
c6f59c2933
Adding factories
2017-01-16 10:18:09 +00:00
91a3534054
Lattice slice utilities now thread safe
2017-01-16 06:32:25 +00:00
16a8e3d0d4
gitignore update for ST3
2017-01-16 06:32:05 +00:00
Lanny91
b7f90aa011
Added momentum choice for wall source
2017-01-13 15:54:19 +00:00
92f8950a56
Charged scalar prop: cleaning and output
2017-01-13 13:30:56 +00:00
65987a8a58
First implementation of the scalar QED propagator, runs but absolutely not checked
2017-01-12 20:44:23 +00:00
889d828bc2
Code cleaning
2017-01-12 18:17:44 +00:00
Lanny91
f22b79da8f
Added missing type aliases
2017-01-12 12:52:12 +00:00
Lanny91
3855673ebf
Added header for wall source
2017-01-12 11:42:37 +00:00
Lanny91
4db82da0db
Wall sources
2017-01-12 11:41:10 +00:00
Lanny91
0cdc3d2fa5
Merge remote-tracking branch 'refs/remotes/paboyle/feature/hadrons' into feature/hadrons
2017-01-12 11:26:55 +00:00
ad98b6193d
creating the necessary caches for the FFT EM scalar propagator
2017-01-11 18:40:43 +00:00
fc760016b3
More uniform cache name for scalar momentum propagators
2017-01-11 18:39:58 +00:00
2da86f7dae
Merge branch 'feature/hadrons' into feature/qed-fvol
2017-01-11 18:38:05 +00:00
41df1db811
Hadrons: number of dimensions entirely determined by the initial grid
2017-01-11 18:37:49 +00:00
Guido Cossu
0dfda4bb90
Working on the RNGModule
2017-01-09 11:06:18 +00:00
Guido Cossu
1189ebc8b5
Cleaning up the checkpointers interface
2017-01-05 15:52:52 +00:00
97843e2b58
Hadrons: free scalar buffer fix and output
2017-01-05 14:58:55 +00:00
82b3f54697
scalar free propagator fix
2017-01-05 14:58:07 +00:00
Guido Cossu
1bb8578173
Added module for checkpointers
2017-01-05 13:09:32 +00:00
Peter Boyle
c3b6d573b9
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
2016-12-30 22:42:17 +00:00
673994b281
Hadrons: modules for scalar propagators
2016-12-29 22:44:58 +01:00
bbc0eff078
Hadrons: scalar sources
2016-12-29 22:44:22 +01:00
4c60e31070
Hadrons: code cleaning
2016-12-29 22:44:08 +01:00
afbf7d4c37
QED Gimpl moved in Photon.h
2016-12-29 22:43:38 +01:00
8c3cc32364
Scalar action
2016-12-29 22:42:58 +01:00
Peter Boyle
1e179c903d
Worried about integer; suspect where statements are broken
2016-12-27 17:46:38 +00:00
Peter Boyle
669cfca9b7
No inline
2016-12-27 17:45:40 +00:00
Peter Boyle
ff2f559a57
Remove inline on gather optimised path
2016-12-27 17:45:19 +00:00
Peter Boyle
03c81bd902
Merge branch 'feature/bgq-asm' of https://github.com/paboyle/Grid into feature/bgq-asm
2016-12-27 11:25:35 +00:00
Peter Boyle
a869addef1
Stats switch off
2016-12-27 11:25:22 +00:00
Peter Boyle
1caa3fbc2d
LOCK UNLOCK only
2016-12-27 11:24:45 +00:00
Peter Boyle
3d21297bbb
Call the fast path compressor for wilson kernels to avoid if else on projector
2016-12-27 11:23:13 +00:00
Peter Boyle
25efefc5b4
Back to original thread policy post test
2016-12-23 09:49:04 +00:00
Peter Boyle
eabf316ed9
BGQ performance ASM
2016-12-22 21:56:08 +00:00
Peter Boyle
04ae7929a3
BGQ or KNL assembler now
2016-12-22 17:53:22 +00:00
Peter Boyle
caba0d42a5
L1p controls
2016-12-22 17:52:55 +00:00
Peter Boyle
9ae81c06d2
L1p controls for BG/Q
2016-12-22 17:52:21 +00:00
Peter Boyle
0903c48caa
Hot start SU3
2016-12-22 17:51:45 +00:00
Peter Boyle
7dc36628a1
QPX finishing
2016-12-22 17:50:48 +00:00
Peter Boyle
b8cdb3e90a
Debug hack; raises from 62GF/s to 72 GF/s per node on BG/Q
2016-12-22 17:50:14 +00:00
Peter Boyle
5241245534
Default to static scheduling
2016-12-22 17:49:21 +00:00
Dr Peter Boyle
960316e207
type conversion in printf
2016-12-22 17:27:01 +00:00
Guido Cossu
5214846341
Adding a resource manager
2016-12-22 12:41:56 +00:00
4c3fd9fa3f
stochastic QED field module in Hadrons
2016-12-22 00:29:41 +01:00
17b3a10d46
stochastic QED: function to cache 1/sqrt(khat^2)
2016-12-22 00:29:19 +01:00
149a46b92c
Merge branch 'feature/hadrons' into feature/qed-fvol
2016-12-22 00:26:43 +01:00
3215ae6b7e
Hadrons: genetic scheduler crashes in multi-thread with 1 module, multi-threading deactivated for now
2016-12-22 00:26:30 +01:00
7a85fddc7e
Hadrons: modification of registration mechanism to allow for persistent caches
2016-12-22 00:25:36 +01:00
Guido Cossu
ce1a115e0b
Removing redundant arguments for integrator functions, step 1
2016-12-20 17:51:30 +00:00
db9c28a773
qed-fvol: Photon parameter name fix
2016-12-20 12:41:39 +01:00
9ac3ac41df
serialisable Photon parameters
2016-12-20 12:41:01 +01:00
2af9ab9034
old Makefile cleaning
2016-12-20 12:40:26 +01:00
6f1ea96293
Merge branch 'develop' into feature/qed-fvol
2016-12-20 12:33:02 +01:00
f8d11ff673
better serialisable enums (can be encapsulated into classes)
2016-12-20 12:31:49 +01:00
paboyle
3f2d53a994
BGQ assembler beginning
2016-12-20 10:21:26 +00:00
paboyle
8a337f3070
Move cayley into mainstream tests
2016-12-18 02:35:31 +00:00
paboyle
a59f5374d7
Evade warning
2016-12-18 02:23:55 +00:00
paboyle
4b220972ac
Warning fix
2016-12-18 02:14:17 +00:00
paboyle
629f43e36c
Return statement needed
2016-12-18 02:09:37 +00:00
paboyle
a3172b3455
Precision error
2016-12-18 02:07:45 +00:00
paboyle
3e6945cd65
Fixing AVX Z-mobius
2016-12-18 02:05:11 +00:00
paboyle
87be03006a
AVX 512 code broke other compiles; fixing
2016-12-18 01:45:09 +00:00
paboyle
f17436fec2
Bad commit fixed
2016-12-18 01:27:34 +00:00
Peter Boyle
4d8b01b7ed
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-12-18 00:56:57 +00:00
Peter Boyle
fa6acccf55
Zmobius asm
2016-12-18 00:56:19 +00:00
Peter Boyle
55cb22ad67
Z mobius bmark
2016-12-18 00:55:37 +00:00
azusayamaguchi
df9108154d
Debugged 2 versions of assembler; ls vectorised, xyzt vectorised
2016-12-17 23:47:51 +00:00
azusayamaguchi
b3e7f600da
Partial implementation of 4d vectorisation assembler
2016-12-16 23:50:30 +00:00
azusayamaguchi
d4071daf2a
Template specialise
2016-12-16 22:28:29 +00:00
azusayamaguchi
a2a6329094
AVX512 only for ASM compilation
2016-12-16 22:03:29 +00:00
azusayamaguchi
eabc577940
Assembler possibly working
2016-12-16 16:55:36 +00:00
2e3c5890b6
qed-fvol: build fix
2016-12-15 20:06:46 +00:00
bc6678732f
Merge branch 'feature/hadrons' into feature/qed-fvol
...
# Conflicts:
# Makefile.am
# configure.ac
# lib/qcd/action/gauge/Photon.h
2016-12-15 19:53:00 +00:00
b10ae00c8a
Merge commit '6ad73145bc9754a5f26093eee5a34473ba0cff82' into feature/qed-fvol
2016-12-15 19:48:58 +00:00
67d72000e7
Hadrons: more legal banner fixes
2016-12-15 18:26:39 +00:00
80cef1c78f
Hadrons: legal banner fix
2016-12-15 18:21:52 +00:00
91e98b1dd5
Merge branch 'feature/hadrons' into develop
2016-12-15 18:15:56 +00:00
b791c274b0
Revert "AVX: uninitialised variable fix"
...
This reverts commit c22c3db9ad .
2016-12-15 18:15:35 +00:00
596dd570c7
Linux linking fix
2016-12-15 12:26:53 +00:00
cad158e42f
Hadrons: tests improvement
2016-12-14 19:41:51 +00:00
f63fac0c69
Hadrons: the XML runner can use a precomputed schedule
2016-12-14 19:41:30 +00:00
ab92de89ab
Hadrons: utility to schedule a run
2016-12-14 19:41:04 +00:00
846272b037
Hadrons: option to save and load a schedule
2016-12-14 19:40:36 +00:00
f3e49e4b73
Hadrons: module templates update
2016-12-14 18:19:46 +00:00
decbb61ec1
Hadrons: XML driven program is again a binary installed with Grid
2016-12-14 18:19:24 +00:00
7e2482aad1
Hadrons: cpde cleaning
2016-12-14 18:04:21 +00:00
e1653a9f94
Hadrons: size fix in DWF module
2016-12-14 18:02:36 +00:00
ea40854e0b
Hadrons: type names are demangled
2016-12-14 18:02:18 +00:00
34df71e755
Hadrons: function to save an application as an XML file
2016-12-14 18:01:56 +00:00
3af663e17b
Hadrons: modules remember their factory registration name
2016-12-14 17:59:45 +00:00
Azusa Yamaguchi
0cd6b1858c
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2016-12-14 09:23:22 +00:00
Guido Cossu
0bd296dda4
Adding check of the Dag part in the benchmark
2016-12-14 03:15:09 +00:00
Guido Cossu
af0ccdd8e9
Moving output order
2016-12-14 02:02:42 +00:00
c22c3db9ad
AVX: uninitialised variable fix
2016-12-13 19:05:58 +00:00
013e710c7d
Hadrons: 3pt function test improvement
2016-12-13 19:04:43 +00:00
16693bd69d
Hadrons: scheduler heuristic benchmark
2016-12-13 19:02:32 +00:00
de8f80cf94
Hadrons: genetic operators improvement
2016-12-13 19:02:05 +00:00
Guido Cossu
2fb92dbc6e
Cleaning up previous debug lines
2016-12-13 07:53:43 +00:00
Guido Cossu
5c74b6028b
Commit for debugging, lot of IO
2016-12-13 06:35:30 +00:00
Guido Cossu
e0be2b6e6c
Adding a new tests for the Ls vec CG
2016-12-13 04:59:18 +00:00
Guido Cossu
ef72f322d2
consistency of tests
2016-12-13 02:24:20 +00:00
Azusa Yamaguchi
426197e446
Nc=3
2016-12-12 09:10:54 +00:00
Azusa Yamaguchi
99e2c1e666
Kernels options
2016-12-12 09:08:53 +00:00
Azusa Yamaguchi
1440565a10
Decrease verbosity
2016-12-12 09:08:04 +00:00
Azusa Yamaguchi
e9f0c0ea39
Staggered kernels options
2016-12-12 09:07:38 +00:00
Guido Cossu
7bc2065113
Adding report at the end of the DWF HMC tests
2016-12-12 04:21:34 +00:00
4a87486365
Hadrons: a bit of cleaning in the scheduler
2016-12-10 21:14:13 +01:00
Peter Boyle
fe187e9ed3
Compiles and passes under ZMobius with assembler
2016-12-10 00:47:48 +00:00
Peter Boyle
0091b50f49
Zmobius working -- not asm yet
2016-12-09 22:51:32 +00:00
Peter Boyle
fb8d4b2357
Lots of debug on performance Mobius
2016-12-08 17:28:28 +00:00
Peter Boyle
ff71a8e847
Ready for sim
2016-12-08 17:00:32 +00:00
Peter Boyle
83fa038bdf
Streaming stores
2016-12-08 16:58:42 +00:00
Peter Boyle
7a61feb6d3
Allocator added with caching for Linux VM subsystem optimisation
2016-12-08 16:58:01 +00:00
Peter Boyle
69ae817d1c
Updates for supporting Mobius better
2016-12-08 16:43:28 +00:00
Guido Cossu
2bd4233919
Completed testing of the HMC for Ls vectorised version (on AVX2)
2016-12-07 04:56:37 +00:00
Guido Cossu
143c70e29f
Debugged the threaded version. Cleaning up
2016-12-07 04:40:25 +00:00
51322da6f8
Hadrons: genetic scheduler improvement
2016-12-07 09:00:45 +09:00
49c3eeb378
Hadrons: more verbose genetic parameters
2016-12-07 08:59:58 +09:00
c56707e003
useless debug message removed
2016-12-07 08:59:20 +09:00
Guido Cossu
b812d5e39c
Added single threaded version of the derivative for the Ls vectorised DWF
2016-12-06 16:31:13 +00:00
5b3edf08a4
Hadrons: sequential gamma source
2016-12-06 12:13:19 +09:00
bd1d1cca34
Hadrons: code cleaning
2016-12-06 12:12:59 +09:00
646b11f5c2
Hadrons: exposing scheduler settings
2016-12-06 12:12:05 +09:00
a683a0f55a
Hadrons: meson tests renamed spectrum
2016-12-06 12:11:44 +09:00
e6effcfd95
Hadrons: more contractions in the spectrum test
2016-12-05 17:41:58 +09:00
aa016f61b9
Hadrons: empty baryon contractions
2016-12-05 17:26:57 +09:00
d42a1b73c4
Hadrons: code cleaning
2016-12-05 17:26:36 +09:00
d292657ef7
Hadrons: more module templates
2016-12-05 17:26:17 +09:00
d1f7c6b94e
Hadrons: templatisation of the fermion implementation
2016-12-05 16:47:29 +09:00
7ae734103e
Hadrons: namespace macro to tackle GCC 5 bug
2016-12-05 14:29:32 +09:00
Guido Cossu
01480da0a8
Merge branch 'develop' into feature/hmc_generalise
2016-12-05 05:10:27 +00:00
7a1ac45679
Hadrons: configure.ac Linux typo
2016-12-05 14:00:10 +09:00
320268fe93
Hadrons: code cleaning
2016-12-05 13:57:34 +09:00
dd6fb140c5
Hadrons: big module reorganisation
2016-12-05 13:53:31 +09:00
0b4f680d28
Hadrons: meson run test
2016-12-05 11:44:58 +09:00
a69086ba1f
Hadrons: application run minor fixes
2016-12-05 11:44:36 +09:00
7433eed274
Hadrons: module creation fix
2016-12-05 11:44:16 +09:00
ee5b1fe043
Hadrons: freeing object message fix
2016-12-05 09:08:45 +09:00
1540616b22
Hadrons: integer types cleanup
2016-12-05 08:53:48 +09:00
8190523e4c
Hadrons: type fix in module creation
2016-12-02 11:04:34 +09:00
b5555d85a7
Hadrons: generelalised FImpl for actions
2016-12-02 11:04:15 +09:00
Peter Boyle
e27c6b217c
Updating
2016-12-01 12:42:53 +00:00
9ad3d3453e
Hadrons is now a library, the previous XML driven program is now a test
2016-12-01 21:36:29 +09:00
Peter Boyle
f7a6b8e5ed
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-12-01 11:39:52 +00:00
paboyle
6adf35da54
Faster Mobius
2016-12-01 11:39:04 +00:00
d8b716d2cd
Hadrons: static initialisation fixed
2016-12-01 15:43:16 +09:00
Peter Boyle
cd01c1dbe9
Ls 16 more relevant
2016-11-30 22:11:10 +00:00
James Harrison
6ad73145bc
Calculate Wilson loop average over multiple configurations.
2016-11-30 15:17:22 +00:00
paboyle
bd0430b34f
Serialisation in malloc fixed
2016-11-29 22:27:55 +00:00
Azusa Yamaguchi
c097fd041a
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2016-11-29 13:44:17 +00:00
Azusa Yamaguchi
77fb25fb29
Push 5d tests
2016-11-29 13:43:56 +00:00
Azusa Yamaguchi
389e0a77bd
Staggerd Fermion 5D
2016-11-29 13:13:56 +00:00
paboyle
2f92b4860b
Test the full Mooee sector
2016-11-29 00:15:08 +00:00
paboyle
4704f2d009
Actions updated
2016-11-29 00:14:36 +00:00
Guido Cossu
ae9688e343
Reporting also the total mflops
2016-11-28 11:37:02 +00:00
43928846f2
first steps to make Hadrons a library
2016-11-28 16:02:15 +09:00
fabcd4179d
Hadrons: propagator type coming from the fermion implementation
2016-11-28 14:02:10 +09:00
a8843c9af6
Code cleaning, the fermion implementation can be sepcified using the macro FIMPL
2016-11-27 16:47:22 +09:00
7a1a7a685e
Merge branch 'feature/fft-opt' into feature/hadrons
2016-11-27 15:32:03 +09:00
Guido Cossu
1e44fd3094
Added some details on the mpi flags for Cray machines
2016-11-26 18:30:53 +00:00
Guido Cossu
d8258f0758
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-11-26 18:25:32 +00:00
Guido Cossu
6c0cc5676b
Adding Eigen.inc to the gitignore
2016-11-26 18:25:12 +00:00
f7293f2ddb
Merge pull request #69 from jch1g10/feature/qed-fvol
2016-11-26 07:04:04 +09:00
11dc0b398b
Merge pull request #74 from Lanny91/develop
2016-11-26 07:01:51 +09:00
Lanny91
b18950f776
Added simd real divide test with QPX divide fixes
2016-11-25 13:21:33 +00:00
Lanny91
0acbf77bc6
Add QPX Div structure
2016-11-24 13:24:12 +00:00
3cdf945d84
Test_fftf fix
2016-11-24 09:10:03 +09:00
5833f247fa
more FFt optimisations
2016-11-24 09:09:48 +09:00
Azusa Yamaguchi
95f43d27ae
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2016-11-22 13:49:22 +00:00
Azusa Yamaguchi
668ca57702
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/staggering
2016-11-22 13:49:11 +00:00
a2cffb0304
AVXFMA target fixed
2016-11-21 17:47:18 +01:00
bafbac6ac4
Merge branch 'feature/gen-simd' into develop
2016-11-19 13:45:30 +01:00
595f1ce371
GEN SIMD build fix
2016-11-19 13:45:12 +01:00
6d7cde4eb4
README update
2016-11-19 13:17:35 +01:00
97cddda49e
Merge branch 'feature/gen-simd' into feature/doxygen
...
# Conflicts:
# Makefile.am
# configure.ac
2016-11-19 13:11:13 +01:00
433afd36f5
Makefile rule for simple_* objects
2016-11-19 01:33:13 +01:00
b873504b90
fully generic SIMD
2016-11-19 01:32:39 +01:00
Guido Cossu
62749d05a6
Naming the scalar action
2016-11-17 12:26:20 +00:00
Guido Cossu
3834feb4b7
Adding action names
2016-11-16 16:46:49 +00:00
James Harrison
6b8ee7bae0
Merge branch 'feature/feynman-rules' into feature/qed-fvol
2016-11-15 13:08:08 +00:00
James Harrison
739c2308b5
Set imaginary part of stochastic QED field to zero using real() instead of conjugate().
2016-11-15 13:07:52 +00:00
Guido Cossu
454302414d
Small modif at the test hmc
2016-11-15 12:31:13 +00:00
042ae5b87c
generic 256bits SIMD
2016-11-15 12:16:15 +00:00
James Harrison
a71b69389b
QedFVol: calculate square Wilson loops up to 10x10
2016-11-14 18:23:04 +00:00
James Harrison
d49e502f53
Merge branch 'feature/feynman-rules' into feature/qed-fvol
2016-11-14 18:00:33 +00:00
James Harrison
92ec3404f8
Set imaginary part of stochastic QED field to zero after FFT into position space
2016-11-14 17:59:02 +00:00
James Harrison
f4ebea3381
QedFVol: add functions for computing spatial and timelike Wilson loops
2016-11-14 17:51:53 +00:00
James Harrison
cf167d0cd1
QedFVol: implement exponentiation of photon field
2016-11-14 17:02:29 +00:00
Guido Cossu
6f8b771a37
Adding date of the last commit
2016-11-10 18:52:00 +00:00
Guido Cossu
4e1ffdd17c
Adding git info to the configure output
2016-11-10 18:44:36 +00:00
1aa695cd78
Hadrons: merge typo
2016-11-10 18:38:30 +00:00
Guido Cossu
a783282b8b
Merge branch 'develop' into feature/hmc_generalise
2016-11-10 18:13:07 +00:00
Guido Cossu
19b85d8486
Some comments in the hmc files
2016-11-10 17:55:58 +00:00
paboyle
58f4950652
Merge branch 'release/v0.6.0' into develop
2016-11-09 12:44:00 +00:00
paboyle
c363bdd784
Merge branch 'release/v0.6.0'
2016-11-09 12:43:14 +00:00
paboyle
604f0ea2f6
Merge branch 'develop' into release/v0.6.0
2016-11-09 04:13:01 -08:00
paboyle
42c912f608
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-11-09 04:12:15 -08:00
paboyle
33dc1f51b5
Final sign off commits from Cori-1
2016-11-09 04:11:03 -08:00
James Harrison
c30d96ea50
QedFVol: x86intrin.h namespace fix
2016-11-09 11:06:20 +00:00
13a8997789
Merge branch 'release/v0.6.0' into feature/hadrons
...
# Conflicts:
# Makefile.am
2016-11-08 20:43:39 +00:00
9576f0903d
namespace fix
2016-11-08 19:07:47 +00:00
34cf702b24
README is now a symlink to README.md
2016-11-08 17:00:38 +00:00
8a5e3a917c
Merge branch 'develop' into release/v0.6.0
...
# Conflicts:
# tests/core/Test_fft_gfix.cc
2016-11-08 16:53:42 +00:00
65bcf281d0
Merge branch 'develop' of github.com:paboyle/Grid into develop
2016-11-08 16:51:19 +00:00
cd0be8cb24
Test_fft_gfix.c precision fix
2016-11-08 15:32:05 +00:00
3d2a22a14d
include fix for MKL
2016-11-08 15:31:47 +00:00
James Harrison
7ffe17ada1
Merge branch 'feature/feynman-rules' into feature/qed-fvol
2016-11-08 14:52:43 +00:00
a26adfb090
README: only markdown
2016-11-08 14:11:18 +00:00
f6e1a5b348
building tests depends on building the library at the top level
2016-11-08 14:08:33 +00:00
c5a025d421
README typo
2016-11-08 14:07:59 +00:00
azusayamaguchi
50d277d8d9
Merge branch 'develop' into release/v0.6.0
2016-11-08 13:44:14 +00:00
azusayamaguchi
343f3e829f
Fixes prerelease to make all tests
2016-11-08 13:42:12 +00:00
azusayamaguchi
f85b35314d
Fix a routine for single node processor coor from rank
2016-11-08 11:49:13 +00:00
azusayamaguchi
3dc2e05d6e
Time as well since MKL returns zero for Mflops
2016-11-08 11:36:18 +00:00
azusayamaguchi
0cff8754d1
Usecs
2016-11-08 11:35:41 +00:00
Guido Cossu
afc8d3e524
Adding support for parallel recursive compilation for the tests
2016-11-07 11:13:43 +00:00
azusayamaguchi
692b44dac1
Merge branch 'develop' into release/v0.6.0
2016-11-04 22:48:11 +00:00
azusayamaguchi
96ba42a297
omm buf
2016-11-04 22:47:25 +00:00
7df940dc3e
homemade test recusrive target for old autotools versions
2016-11-04 22:32:25 +00:00
azusayamaguchi
f7b60004f3
Merge branch 'develop' into release/v0.6.0
2016-11-04 16:08:07 +00:00
8af8b047fd
tests is now a recusrsive target
2016-11-04 13:44:21 +00:00
6592078fef
Make.inc removed, once again don't commit it!
2016-11-04 13:43:40 +00:00
ad971ca07b
fftw3.h is now expected to be an external header
2016-11-04 13:12:35 +00:00
f2f16eb972
fftw3.h removed, please don't commit this file back
2016-11-04 13:11:05 +00:00
azusayamaguchi
b7d55f7dfb
Fix a typo in reorg of the --dslash-asm
2016-11-04 11:35:08 +00:00
azusayamaguchi
6e548a8ad5
Linux compile needed
2016-11-04 11:34:16 +00:00
Azusa Yamaguchi
ee686a7d85
Compiles now
2016-11-03 16:58:23 +00:00
Azusa Yamaguchi
1c5b7a6be5
Staggered phases first cut, c1, c2, u0
2016-11-03 16:26:56 +00:00
a5dd4a9bab
Merge branch 'feature/fft-opt' into develop
2016-11-03 14:34:46 +00:00
ec232af851
Photon.h references removed
2016-11-03 14:34:16 +00:00
17e30281e9
Merge branch 'develop' into feature/fft-opt
...
# Conflicts:
# lib/FFT.h
2016-11-03 14:14:03 +00:00
2854e601e6
FFT test typo
2016-11-03 14:09:47 +00:00
aee44dc694
Photon.h removed from develop branch
2016-11-03 13:54:15 +00:00
75bbf6a0af
Merge branch 'develop' into feature/feynman-rules
2016-11-03 13:52:11 +00:00
c65d23935a
README update
2016-11-03 13:48:20 +00:00
92cd797636
MPI auto configure fix
2016-11-03 13:48:07 +00:00
paboyle
111bfbc6bc
notimestamp by default
2016-11-03 11:40:26 +00:00
paboyle
f41a230b32
Decrease mpi3l verbose
2016-11-02 19:54:03 +00:00
paboyle
c067051d5f
Merge branch 'develop' into release/v0.6.0
2016-11-02 13:59:18 +00:00
paboyle
afdeb2b13c
Merge branch 'feature/mpi3-master-slave' into develop
2016-11-02 13:43:20 +00:00
paboyle
9e2ec2719b
Merge branch 'develop' into feature/mpi3-master-slave
2016-11-02 13:02:56 +00:00
paboyle
757a928f9a
Improvement to use own SHM_OPEN call to avoid openmpi bug.
2016-11-02 12:37:46 +00:00
Guido Cossu
bc248b6948
Merge branch 'release/v0.6.0' into feature/KNL_double_prec
...
Conflicts:
lib/simd/Grid_avx512.h
2016-11-02 10:40:49 +00:00
Guido Cossu
ae8561892e
Eliminating useless defines
2016-11-02 10:21:06 +00:00
paboyle
32375aca65
Semaphore sleep/wake up on remote processes.
2016-11-02 09:27:20 +00:00
paboyle
bb94ddd0eb
Tidy up of mpi3; also some cleaning of the dslash controls.
2016-11-02 08:07:09 +00:00
330a9b3f4c
Merge pull request #65 from jch1g10/feature/qed-fvol
2016-11-01 19:53:25 +00:00
c2d78493c8
Merge pull request #64 from jch1g10/feature/feynman-rules
2016-11-01 19:53:08 +00:00
James Harrison
28ff66a381
Merge branch 'feature/feynman-rules' into feature/qed-fvol
2016-11-01 16:07:46 +00:00
James Harrison
78c7bcee36
QedFVol: Change variables of type "double" to type "Real".
2016-11-01 16:06:05 +00:00
James Harrison
7f0fc0eff5
Remove explicit use of double-precision types in photon.h
2016-11-01 16:02:35 +00:00
Azusa Yamaguchi
164d3691db
Staggered
2016-11-01 14:24:22 +00:00
paboyle
791cb050c8
Comms improvements
2016-11-01 11:35:43 +00:00
00a7b95631
Merge remote-tracking branch 'gh-james/feature/qed-fvol' into feature/qed-fvol
2016-10-31 18:46:23 +00:00
94d8321d01
Merge branch 'feature/feynman-rules' into feature/qed-fvol
2016-10-31 18:41:30 +00:00
d5e95bc350
Merge branch 'release/v0.6.0' into feature/feynman-rules
2016-10-31 18:36:21 +00:00
6efac3a252
Merge pull request #61 from jch1g10/feature/feynman-rules
...
Add missing volume factor in stochastic QED field
2016-10-31 18:35:22 +00:00
7a84906b5f
Merge branch 'release/v0.6.0' into feature/fft-opt
2016-10-31 18:31:49 +00:00
07416e4567
README update
2016-10-31 18:21:52 +00:00
66d832c733
FFTW header fix
2016-10-31 16:39:29 +00:00
e74417ca12
big build system polish
2016-10-31 16:31:27 +00:00
7bd0084b5d
Merge branch 'develop' into release/v0.6.0
2016-10-31 16:30:22 +00:00
Guido Cossu
e8c3174ae2
Small change in the defines
2016-10-30 12:23:11 +00:00
Guido Cossu
9b066e94d0
Compilation with both single and double precision
2016-10-30 12:04:06 +00:00
James Harrison
ac24cc9f99
Merge branch 'feature/feynman-rules' into feature/qed-fvol
2016-10-29 11:05:26 +01:00
James Harrison
618abdf302
Add missing volume factor in stochastic QED field
2016-10-29 11:04:02 +01:00
Guido Cossu
e1042aef77
First version of the doube prec for testing purposes
...
It does not compile single and double version at the same time
2016-10-28 17:20:04 +01:00
paboyle
aa6a839c60
avx512 build fix; detect clang/gcc intrinsics vs. ICPC
2016-10-28 09:13:09 +01:00
ac99a56237
Merge branch 'develop' into release/v0.6.0
2016-10-27 11:53:24 +01:00
b4d2af8c89
threaded FFT
2016-10-26 19:46:36 +01:00
434af6aeaa
Merge branch 'develop' into feature/fft-opt
2016-10-26 18:50:38 +01:00
e90f8ac841
Merge branch 'develop' into feature/feynman-rules
2016-10-26 18:50:21 +01:00
a1705a8d53
debug message removed
2016-10-26 18:50:07 +01:00
ca21003f01
Merge branch 'feature/fft-opt' into feature/feynman-rules
...
# Conflicts:
# lib/FFT.h
# lib/qcd/action/fermion/WilsonFermion5D.h
# tests/core/Test_fft.cc
2016-10-26 18:44:47 +01:00
14ddf2c234
more FFT optimisations
2016-10-26 17:36:26 +01:00
Guido Cossu
1d666771f9
Debugging the RNG, eliminate the barrier after broadcast
2016-10-26 16:08:23 +01:00
Guido Cossu
d50055cd96
Making the ILDG support optional
2016-10-26 09:48:01 +01:00
Azusa Yamaguchi
bca861e112
Note:FFT shoud be GridFFT (Not change yet).
...
Gauge fix with FFt is added (tests/core)
2016-10-25 14:21:48 +01:00
James Harrison
3ab4c8c0bb
QedFVol: calculate plaquette and 2x2 Wilson loop of stochastic QED field
2016-10-25 13:32:02 +01:00
33d199a0ad
temporary thread safety in FFT
2016-10-25 12:56:40 +01:00
paboyle
93896ce59e
Roll version number
2016-10-25 06:12:49 +01:00
paboyle
b1508e4124
Merge branch 'feature/mpi3' into develop
2016-10-25 06:06:36 +01:00
paboyle
b820076b91
Merge branch 'develop' into feature/mpi3
2016-10-25 06:02:33 +01:00
paboyle
09f66100d3
MPI 3 compile on non-linux
2016-10-25 06:01:12 +01:00
azusayamaguchi
d7d92af09d
Travis fail fix attempt
2016-10-25 01:45:53 +01:00
azusayamaguchi
460d0753a1
Merge branch 'develop' into feature/mpi3
...
Conflicts:
lib/simd/Grid_avx512.h
2016-10-25 01:08:51 +01:00
azusayamaguchi
8f8058f8a5
More random bits on parallel seeding
2016-10-25 01:05:52 +01:00
azusayamaguchi
d97a27f483
Verbose
2016-10-25 01:05:31 +01:00
azusayamaguchi
7c3363b91e
Compiles all comms targets
2016-10-25 00:04:17 +01:00
azusayamaguchi
b94478fa51
mpi, mpi3, shmem all compile.
...
mpi, mpi3 pass single node multi-rank
2016-10-24 23:45:31 +01:00
Guido Cossu
47c7159177
ILDG reader/writer works
...
Fill the xml header with the required information, todo.
2016-10-24 21:57:54 +01:00
13bf0482e3
FFT optimisation
2016-10-24 19:25:40 +01:00
a795b5705e
memory optimisation
2016-10-24 19:25:15 +01:00
392e064513
fast local peek-poke
2016-10-24 19:24:21 +01:00
azusayamaguchi
b6a65059a2
Update to use shared memory to contain the stencil comms buffers
...
Tested on 2.1.1.1 1.2.1.1 4.1.1.1 1.4.1.1 2.2.1.1 subnode decompositions
2016-10-24 17:30:43 +01:00
Guido Cossu
f415db583a
Adding ILDG format
2016-10-24 15:48:22 +01:00
Guido Cossu
f55c16f984
Adding a barrier in the RNG save
2016-10-24 11:02:14 +01:00
azusayamaguchi
ea25a4d9ac
Works
2016-10-23 06:10:05 +01:00
azusayamaguchi
c190221fd3
Internal SHM comms in non-simd directions working
...
Need to fix simd directions
2016-10-22 18:14:27 +01:00
Guido Cossu
df67e013ca
More debug output for the RNG
2016-10-22 13:34:17 +01:00
Guido Cossu
3e990c9d0a
Reverting the broadcast change
2016-10-22 13:26:43 +01:00
Guido Cossu
4b740fc8fd
Debugging the RNG state save
2016-10-22 13:06:00 +01:00
azusayamaguchi
0fcd2e7188
Simplify the comms structure prior to implementing Shared memory direct bouncs
2016-10-21 22:44:10 +01:00
azusayamaguchi
910b8dd6a1
use simd type
2016-10-21 22:35:29 +01:00
azusayamaguchi
75ebd3a0d1
Typo fixes and rotate for CLANG
2016-10-21 22:34:29 +01:00
Guido Cossu
cccd14b09e
Small cleanup
2016-10-21 17:20:54 +01:00
Guido Cossu
e6acffdfc2
Fixing the plaquette computation
2016-10-21 16:06:34 +01:00
26d124283e
Merge branch 'feature/feynman-rules' into feature/qed-fvol
2016-10-21 15:23:31 +01:00
0d889b7041
QedFVol: first attempt at generating a QED field
2016-10-21 15:21:32 +01:00
7c8f79b147
more stochastic QED fixes
2016-10-21 15:20:12 +01:00
azusayamaguchi
09fd5c43a7
Reasonably fast version
2016-10-21 15:17:39 +01:00
ab31ad006a
Merge branch 'feature/feynman-rules' into feature/qed-fvol
2016-10-21 14:42:18 +01:00
462921e549
QED: fix stochastic field
2016-10-21 14:41:08 +01:00
Guido Cossu
392130a537
Working on the 5d
2016-10-21 14:22:25 +01:00
azusayamaguchi
f22317748f
Merge branch 'feature/mpi3' of https://github.com/paboyle/Grid into feature/mpi3
2016-10-21 13:36:35 +01:00
azusayamaguchi
6a9eae6b6b
Reporting improvements
2016-10-21 13:36:18 +01:00
azusayamaguchi
fad96cf250
StencilBufs
2016-10-21 13:36:00 +01:00
azusayamaguchi
f331809c27
Use variable type for loop
2016-10-21 13:35:37 +01:00
bd6a228af6
Merge commit '20a091c3eddfdb67a82ece6413740a93650a2f98' into feature/feynman-rules
2016-10-21 13:10:30 +01:00
63d219498b
first (dirty) implementation of Feynman stoctachtic EM field
2016-10-21 13:10:13 +01:00
paboyle
2c54a53d0a
Compile verbose reduce
2016-10-21 12:12:14 +01:00
paboyle
306160ad9a
bcopy threaded
2016-10-21 12:07:28 +01:00
azusayamaguchi
20a091c3ed
Intel vs. Clang intrinsics differences absorbed
2016-10-21 09:08:36 +01:00
azusayamaguchi
202078eb1b
Cray / OpenSHMEM ordering differs
2016-10-21 09:07:20 +01:00
paboyle
a762b1fb71
MPI3 working with a bounce through shared memory on my laptop.
...
Longer term plan: make the "u_comm_buf" in Stencil point to the shared region and avoid the
send between ranks on same node.
2016-10-21 09:03:26 +01:00
Guido Cossu
deef2673b2
Separating the Lattice theories stub from the QCD.h file
2016-10-20 17:24:08 +01:00
paboyle
5b5925b8e5
Forgot to add
2016-10-20 17:09:40 +01:00
Guido Cossu
977b0a6dd9
Merge branch 'develop' into feature/hmc_generalise
2016-10-20 17:04:41 +01:00
Guido Cossu
977d844394
Few modifications on stdout messages
2016-10-20 17:01:59 +01:00
paboyle
b58adc6a4b
commVector
2016-10-20 17:00:15 +01:00
paboyle
f9d5e95d72
allocator template typedefs moved to AlignedAllocator
2016-10-20 16:59:39 +01:00
paboyle
4f8e636a43
commVector
2016-10-20 16:59:16 +01:00
paboyle
9b39f35ae6
commVector different for SHMEM compat
2016-10-20 16:58:53 +01:00
paboyle
5fe2b85cbd
MPI3 and shared memory support
2016-10-20 16:58:01 +01:00
paboyle
c7cccaaa69
Comm vector for shmem
2016-10-20 16:57:31 +01:00
paboyle
cbcfea466f
MPI3
2016-10-20 16:57:14 +01:00
paboyle
4955672fc3
MPI3
2016-10-20 16:57:00 +01:00
paboyle
39f1c880b8
mpi3
2016-10-20 16:56:40 +01:00
paboyle
8c043da5b7
SHMEM and comms allocator made different
2016-10-20 16:56:05 +01:00
paboyle
3cbe974eb4
Layout
2016-10-20 16:55:21 +01:00
6e4a06e180
qed-fvol: initial commit
2016-10-20 15:04:00 +01:00
997fd882ff
Merge branch 'develop' into feature/feynman-rules
...
# Conflicts:
# lib/Threads.h
# lib/qcd/action/fermion/WilsonFermion.cc
# lib/qcd/action/fermion/WilsonFermion.h
# lib/qcd/utils/SUn.h
# lib/simd/Grid_avx.h
# lib/simd/Intel512common.h
2016-10-19 18:35:18 +01:00
Guido Cossu
590675e2ca
Csum in hex format
2016-10-19 17:26:25 +01:00
Guido Cossu
8c65bdf6d3
Printing checksum for the RNG file
2016-10-19 16:56:11 +01:00
Guido Cossu
74f1ed3bc5
Adding some documentation for HMC
2016-10-19 10:51:13 +01:00
paboyle
7af9b87318
Cache face tables to improve performance.
...
Extract merge now looking poor.
2016-10-18 09:51:37 +01:00
paboyle
811ca45473
GNU clang hack for AVX512 since there are missing reduce intrinsics in Clang 3.9 and GCC-6 AVX512 support
2016-10-17 16:23:21 +01:00
paboyle
bc1a4d40ba
Faster integer handling avoid push_back
2016-10-17 16:16:44 +01:00
Guido Cossu
79270ef510
Added a test for EODWF Scaled Shamir with general HMC
2016-10-14 17:34:26 +01:00
Guido Cossu
e250e6b7bb
Moving parameters outside of the HMCrunner
2016-10-14 17:22:32 +01:00
paboyle
c8079e6621
Time the face gateher in x-dir more carefully
2016-10-13 22:28:50 +01:00
Guido Cossu
261342c15f
Adding gh-pages
2016-10-13 11:51:25 +01:00
azusayamaguchi
8b0d171c9a
32bit issue on the KNL code variant where byte offsets were stored
2016-10-12 17:49:32 +01:00
azusayamaguchi
1f293b76b4
Merge branch 'feature/knl-stats' into develop
2016-10-12 13:47:58 +01:00
azusayamaguchi
8bbd9ebc27
Reversing changes to Stencil class
2016-10-12 13:47:20 +01:00
azusayamaguchi
6472b431f0
__rdpmc needed for gcc, clang++
2016-10-12 12:29:08 +01:00
azusayamaguchi
bd205a3293
Fixing for non x86 and non KNL
2016-10-12 12:09:15 +01:00
azusayamaguchi
496beffa88
Fix non-KNL build
2016-10-12 12:06:08 +01:00
azusayamaguchi
9b63e97108
align not absolutely required and confuses clang++
2016-10-12 11:51:21 +01:00
azusayamaguchi
81f2aeaece
KNL streaming stores, and KNL performance coutners
2016-10-12 11:45:22 +01:00
paboyle
2d4a45c758
Typecast pointer
2016-10-12 09:14:15 +01:00
paboyle
a123dcd7e9
Static required for shmem. Reading same object twice requires csum reset
2016-10-12 00:29:57 +01:00
paboyle
6b27c42dfe
Cosmetic
2016-10-12 00:29:39 +01:00
paboyle
f7c2aa3ba5
runtime by default
2016-10-12 00:29:13 +01:00
paboyle
0f182f033b
Drop macos with gcc
2016-10-11 22:29:06 +01:00
paboyle
7240d73184
Parallelise the x faces; fix the segv on KNL with comms
2016-10-11 22:21:07 +01:00
paboyle
42cd148f5e
Base pointer for comms buffer under AVX512 assembly
2016-10-11 16:06:06 +01:00
Guido Cossu
eda4dd622e
Some more edit
2016-10-11 15:45:20 +01:00
paboyle
6e01264bb7
don't use static by default
2016-10-11 10:03:39 +01:00
paboyle
6f408256bc
FMA4 option moved on the align
2016-10-11 10:03:01 +01:00
paboyle
8d11681aac
verbose remove
2016-10-10 23:50:42 +01:00
paboyle
3d5c9a1ee9
No compile fix on clang++ 3.9
2016-10-10 23:50:13 +01:00
paboyle
db749f103f
Add Wilson, DWF, Overlap feynman rule tests
2016-10-10 23:48:35 +01:00
paboyle
dc389e467c
axpy_ssp for any coeff type via template
2016-10-10 23:48:05 +01:00
paboyle
3619167d62
Mass parameter
2016-10-10 23:47:33 +01:00
paboyle
96f1d1b828
Debugged Domain wall and Overlap feynman rules (infinite Ls, finite mass).
2016-10-10 23:46:45 +01:00
paboyle
657e0a8f4d
Mass parameter
2016-10-10 23:46:10 +01:00
paboyle
616e7cd83e
Mass parameter
2016-10-10 23:45:48 +01:00
paboyle
6f26d2e8d4
Overlap tree level feynman rule
2016-10-10 23:45:18 +01:00
paboyle
c014574504
A "please implement me" feynman rule. If this were abstract virtual it would
...
require/force implementation
2016-10-10 23:44:00 +01:00
paboyle
d7ce164e6e
Feynman rule for DWF
2016-10-10 23:43:36 +01:00
paboyle
c0d5b99016
Dminus
2016-10-10 23:43:19 +01:00
paboyle
09ca32d678
Dminus added for Cayley
2016-10-10 23:42:55 +01:00
paboyle
082ae350c6
static schedule by default
2016-10-10 23:42:30 +01:00
Guido Cossu
611b5d74ba
Fix for AVX+FMA3 compilation
2016-10-10 15:26:17 +01:00
Guido Cossu
b56c9ffa52
Fix for AVXFMA
2016-10-10 14:43:37 +01:00
Guido Cossu
c68a2b9637
Minor fix
2016-10-10 11:54:58 +01:00
Guido Cossu
293df6cd20
Generalising the HMCRunner and moving parameters to the user level
2016-10-10 11:49:55 +01:00
Guido Cossu
65f61bb3bf
Reset QCD colours to 3
2016-10-10 09:46:17 +01:00
Guido Cossu
26b9740d53
Some fix for the GenericHMCrunner
2016-10-10 09:43:05 +01:00
cb02b7088f
Merge branch 'develop' into feature/doxygen
...
# Conflicts:
# configure.ac
2016-10-09 13:35:44 +01:00
70c32fa49b
Merge branch 'develop' of github.com:paboyle/Grid into develop
2016-10-09 12:55:46 +01:00
77c8a94dae
AVXFMA4 flag fix for Intel Compiler
2016-10-09 12:55:12 +01:00
Guido Cossu
6eb873dd96
Added scalar action phi^4
...
Check Norm2 output (Complex type assumption)
2016-10-07 17:28:46 +01:00
Guido Cossu
11b4c80b27
Added support for hmc and binary IO for a general field
2016-10-07 13:37:29 +01:00
Guido Cossu
2e453dfbf5
Added some instrumentation to benchmark the force computation
2016-10-06 17:52:45 +01:00
Guido Cossu
c065e454c3
Adding Binrary IO, untested
2016-10-06 10:12:11 +01:00
paboyle
4089984431
Timing hooks
2016-10-06 09:25:12 +01:00
98439847cf
configure portability fix
2016-10-05 14:57:20 +01:00
Guido Cossu
c78bbd0f8c
Fix ASM compilation
2016-10-04 15:37:32 +01:00
Guido Cossu
d9b5fbd374
In the middle of adding a general binary writer
2016-10-04 11:24:08 +01:00
Guido Cossu
cfbc1a26b8
Now the gauge implementation has to take care of the Nexp
2016-10-03 16:20:06 +01:00
Guido Cossu
257f69f931
One more function to generalise the HMC integrator
2016-10-03 15:50:04 +01:00
Guido Cossu
e415260961
First cut on generalised HMC
...
Backward compatibility OK
2016-10-03 15:28:00 +01:00
7ea4b959a4
hopefully more portable configure output
2016-09-27 11:54:37 +01:00
536e2ff073
*.inc removed: please don't commit these files either!
2016-09-27 11:54:03 +01:00
798ff34d7e
configure removed: please don't commit configure!
2016-09-27 11:29:31 +01:00
paboyle
87acd06990
Use streaming stores
2016-09-26 10:11:34 +01:00
paboyle
9353b6edfe
Fenv out of grid namespace
2016-09-26 10:09:13 +01:00
paboyle
167cc2650e
GNU SOURCE problem on travis
2016-09-26 09:58:09 +01:00
paboyle
34f887ca1c
Test_fft not complete; preparing for tests of momentum space DWF and Overlap feynman rules but not there yet.
2016-09-26 09:44:36 +01:00
paboyle
7089b6d5a5
Setting up but not implemented some QED rules
2016-09-26 09:43:40 +01:00
paboyle
2ba7d43ddd
Divide handling
2016-09-26 09:43:14 +01:00
paboyle
836e929565
Divide handling improved
2016-09-26 09:42:22 +01:00
paboyle
b6713ecb60
Momentum space rules for Overlap, DWF untested to date
2016-09-26 09:39:09 +01:00
paboyle
52a39f0fcd
Divide in ET
2016-09-26 09:38:38 +01:00
paboyle
81a7a03076
Integer <<
2016-09-26 09:38:17 +01:00
paboyle
16b37b956c
divide goes to ET
2016-09-26 09:37:59 +01:00
paboyle
567b6cf23f
demangle moves to logging
2016-09-26 09:36:51 +01:00
paboyle
296396646d
FPE's on macos set up
2016-09-26 09:36:14 +01:00
Guido Cossu
04a437c92c
Minor modification to the filelist script
2016-09-23 11:12:45 +01:00
Guido Cossu
5c190a1b8c
Merge branch 'develop' into feature/hirep
2016-09-23 11:06:06 +01:00
Guido Cossu
15d8f5c88c
Small change to the configure.ac to include the canonical names
2016-09-23 11:05:36 +01:00
Guido Cossu
c4ac6e7e8f
Consolidating HMC interface
...
Uniformed interface for standard action in fundamental rep and Hirep
2016-09-23 10:47:42 +01:00
Guido Cossu
510e340e16
Debugged last commit for the Two index representation
2016-09-22 22:16:21 +01:00
Guido Cossu
6ffadca153
Restored number of colours to 3
2016-09-22 14:22:54 +01:00
Guido Cossu
b6597b74e7
Added support for the Two index Symmetric and Antisymmetric representations
...
Tested for HMC convergence: OK
Added also a test file showing an example for mixed representations
2016-09-22 14:17:37 +01:00
a034e9901b
Merge branch 'develop' into feature/hadrons
2016-09-20 13:49:33 +01:00
d2573189d8
build system: FFTW fix
2016-09-20 12:30:24 +01:00
65ca174dbb
gitignore update
2016-09-20 11:25:06 +01:00
Antonin Portelli
0724f7af75
QPX single precision implementation
2016-09-19 18:09:12 +01:00
2e74520821
removed libtool use (BG/Q compatibility)
2016-09-16 15:25:49 +01:00
Antonin Portelli
6dd75ad9e5
Merge branch 'develop' of github.com:paboyle/Grid into feature/bgq
2016-09-16 15:07:54 +01:00
Guido Cossu
fda408ee6f
Added first lines for supporting Two Index representations
2016-09-13 10:43:30 +01:00
Guido Cossu
b9c80318a2
Merge branch 'develop' into feature/hirep
2016-09-13 10:01:51 +01:00
Guido Cossu
5df5d52d41
Fix for the Intel compiler
2016-09-12 17:17:20 +01:00
Guido Cossu
f76f281e58
Cleaning files after fix
2016-09-09 11:34:25 +01:00
Guido Cossu
aa20cc8b52
Fixing compilation error with AVX512 flag
2016-09-09 02:58:52 -07:00
Guido Cossu
0fd179fb33
Merge branch 'develop' into feature/hirep
2016-09-01 12:59:53 +01:00
Guido Cossu
f45ef8d114
Minor modification in ActionBase.h
2016-09-01 11:46:46 +01:00
paboyle
7422953e36
Poisson solver example
2016-08-31 00:42:47 +01:00
paboyle
8535d433a7
Cold or hot must support any precisoin
2016-08-31 00:27:53 +01:00
paboyle
b573d1f35a
Wilson tree level added
2016-08-31 00:27:04 +01:00
paboyle
0c1d7e4daf
Mom space prop for Wilson action
2016-08-31 00:26:36 +01:00
paboyle
02e983a0cd
Momentum space prop and free prop convolution
2016-08-31 00:26:02 +01:00
paboyle
d15ab66aae
FFT moves higher in include order
2016-08-31 00:25:22 +01:00
paboyle
9005b82c6d
Multi dim FFT, and normalisation fix
2016-08-31 00:24:52 +01:00
paboyle
3475f45ce7
Demangle support for typeid stuff
2016-08-31 00:23:48 +01:00
paboyle
0744f38866
Demangle support is useful
2016-08-31 00:23:28 +01:00
paboyle
62febd2823
Wilson prop test
2016-08-31 00:23:09 +01:00
Guido Cossu
fd5614738d
Merge branch 'develop' into feature/hirep
2016-08-30 18:21:36 +01:00
Guido Cossu
005dcc51aa
Reset travis
2016-08-30 14:44:10 +01:00
Guido Cossu
655c893f86
Another test on travis
2016-08-30 14:38:42 +01:00
Guido Cossu
843f5783b4
Again travis test separating single and double
2016-08-30 14:29:09 +01:00
Guido Cossu
8986c9fedd
Single and double precision travis matrix
2016-08-30 14:25:24 +01:00
Guido Cossu
c80a1d427c
Retest original version of travis yaml
2016-08-30 14:05:05 +01:00
Guido Cossu
ae57032500
Separate single and double builds in travis
2016-08-30 14:00:34 +01:00
Guido Cossu
f75468728f
Another error on travis
2016-08-30 13:56:23 +01:00
Guido Cossu
5acd856663
Correction of error in travis
2016-08-30 13:49:49 +01:00
Guido Cossu
b0d3e4bb2c
Separating travis builds
2016-08-30 13:44:07 +01:00
Guido Cossu
b512ccbee6
HMC for Adjoint fermions works
...
Accepts and reproduces known results
Check initial instability of inverters
when starting from hot configurations
2016-08-30 11:31:25 +01:00
paboyle
8c89391c02
FFTW unresolved fixed when no fftw3.h
2016-08-24 16:41:47 +01:00
paboyle
bfac5195b8
tidy up
2016-08-24 16:38:36 +01:00
paboyle
a782ca3238
Merge branch 'feature/fft-flop-count' into develop
2016-08-24 15:06:17 +01:00
paboyle
744691097f
Printing
2016-08-24 15:05:56 +01:00
paboyle
ff6da364e8
FFT double and single precision gives good performance now in multithreaded code.
2016-08-24 15:05:00 +01:00
4d11a6f5f2
first commit for QPX intrinsics
2016-08-23 14:41:44 +01:00
paboyle
88be3b39bb
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-08-22 18:29:36 +01:00
paboyle
8a02824e08
Merge branch 'feature/FFT' into develop
2016-08-22 16:25:04 +01:00
paboyle
356e7940fd
fftw can be switched off
2016-08-22 16:24:49 +01:00
paboyle
73ce476890
Include fftw headers
2016-08-22 16:24:21 +01:00
paboyle
29c4ef41de
Adding a test for libfftw3
2016-08-22 16:21:01 +01:00
paboyle
e423a09974
FFT improved and test_FFT passing under MPI 8 processes, 8^4 for LatticeComplexD and LatticeSpinMatrixD
2016-08-18 02:23:21 +01:00
paboyle
17097a93ec
FFTW test ran over 4 mpi processes.
2016-08-17 01:33:55 +01:00
paboyle
94a6373a7f
Merge branch 'feature/eigen-cleanup' into develop
2016-08-15 23:58:34 +01:00
paboyle
4ab7dbfd57
Instantiate
2016-08-15 23:00:40 +01:00
paboyle
90e70790f3
Feature for z-Mobius prep
2016-08-15 22:31:29 +01:00
Guido Cossu
9c2e8d5e28
Nc=3 just to let all the test pass in Travis
2016-08-09 15:46:57 +01:00
Guido Cossu
147e2025b9
Added unit tests on the representation transformations
...
Status: Passing all tests
2016-08-08 16:54:22 +01:00
573b8c6020
build system: -O3 is not overriden by env CXXFLAGS
2016-08-06 01:26:24 +01:00
15218ec57f
more Travis MPI fix
2016-08-06 00:49:14 +01:00
ec68e08dd2
Travis MPI fix
2016-08-06 00:36:05 +01:00
paboyle
fc25d2295c
fftw download
2016-08-06 00:28:52 +01:00
paboyle
8dc2cfcedb
Adding fftw header pulling
2016-08-06 00:28:28 +01:00
17c843700e
missing doxygen.inc added
2016-08-05 15:38:21 +01:00
7b56f63a5c
configure Doxygen output fix
2016-08-05 15:35:29 +01:00
b1cfb4d661
first try at a nicer Doxygen implementation
2016-08-05 15:29:18 +01:00
836f93780c
first try at including MPI tests in Travis
2016-08-05 13:41:52 +01:00
paboyle
5a68715be3
Richards sweep test
2016-08-05 10:51:57 +01:00
paboyle
32bc7a6ab8
MPI back out of change that hangs
...
AVX2 for clang, gcc needs the -mfma flag.
2016-08-05 10:36:00 +01:00
b65e72e521
Merge pull request #43 from rprollins/bench/output-format
...
Benchmark_dwf_sweep and Benchmark_zmm output formats
2016-08-04 16:47:01 +01:00
d1aaff65e8
README update
2016-08-04 16:27:02 +01:00
7ff7c7d90d
Merge branch 'develop' into feature/hadrons
2016-08-04 16:22:10 +01:00
93d29bb699
build system improvements after discussion with Peter
2016-08-04 16:19:59 +01:00
a2e9430abe
Hadrons: fix after build system update
2016-08-03 17:14:32 +01:00
2485ef9c9c
Merge branch 'feature/new-build' into feature/hadrons
...
# Conflicts:
# Makefile.am
# scripts/copyright
2016-08-03 16:49:16 +01:00
3b376ed54e
build system: error if MPI not found
2016-08-03 15:23:38 +01:00
d5c1f614ba
gitignore update
2016-08-03 15:14:33 +01:00
2edc24225d
untracking ltmain.sh
2016-08-03 15:12:44 +01:00
629283726b
build system: local Grid link flag moved to configure.ac
2016-08-03 15:07:42 +01:00
6adb66dd08
build system: finer management of GMP/MPFR dependence
2016-08-03 15:06:45 +01:00
5be92bb708
link fix in README
2016-08-03 12:40:56 +01:00
f4c049ea6d
README update
2016-08-03 12:38:54 +01:00
bc092ad30f
build system fix
2016-08-03 11:47:38 +01:00
dad642ed1b
various build system fixes and improvements
2016-08-03 11:39:20 +01:00
63ae39abc7
proper propagation of OpenMP flags
2016-08-02 17:41:32 +01:00
9e5b934d21
improved LAPACK configuration
2016-08-02 17:26:54 +01:00
a7b483d67a
Tests in subdirectories are not built by default
2016-08-02 12:14:28 +01:00
bb99ce0680
bootstrap script fix
2016-08-01 09:51:06 +01:00
83307df1af
travis update for new build system
2016-08-01 09:38:40 +01:00
Guido Cossu
49b5c49851
Checked the hermiticity of the op in derivative, ok
...
Still CG fails to converge
2016-07-31 12:37:33 +01:00
e9f30cab2c
first working version for the new build system
2016-07-30 17:53:18 +01:00
Guido Cossu
089f0ab582
Debugged HMC for Creutz relation
2016-07-28 16:44:41 +01:00
Richard Rollins
df6c9f55d1
Use common benchmark output format for dwf_sweep and zmm
2016-07-20 17:38:56 +01:00
Guido Cossu
b93e18ed50
Modified the Dirac Kernel class to compile with different number of colours
...
Added the general push_back functionality to accomodate for all defined representations
Compiles, not tested
2016-07-18 16:36:28 +01:00
Guido Cossu
9c77bb69a5
Added all elements for Hirep HMC
...
TODO: Test and debug
2016-07-18 12:05:23 +01:00
paboyle
27f3ecc833
Merge branch 'feature/bugfix-ck-cj' into develop
2016-07-16 01:59:52 +01:00
paboyle
f9e90eeb1f
Sign error on the force for 4d fields fixed
2016-07-16 01:52:44 +01:00
paboyle
fad5c675eb
sign error on the 4d gparity force
2016-07-16 01:51:56 +01:00
paboyle
4908b77d46
Fixed conflicts. PLEASE avoid making wholesale cosmetic only changes, this created
...
a HUGE amount of difficult to resolve and understand conflicts .
Wholesale formatting, reordering functions etc... in a central file like Tensor_class
or Grid_vector_types while others are also editing without making substantial functionality
changes creates pain.
2016-07-15 20:59:07 +01:00
paboyle
f4dd5062d7
Merge branch 'develop' of https://github.com/paboyle/Grid into develop
2016-07-15 19:26:06 +01:00
paboyle
da34d75841
Merge branch 'feature/Ls-vectorised-actions' into develop
2016-07-15 19:09:47 +01:00
paboyle
980ff18956
Solving the instantiation no compile issue
2016-07-15 17:19:44 +01:00
Guido Cossu
7edf4c6c04
Added HMC utitities for the higher representations
...
TODO: Inherit types for the pseudofermions, Debugging, testing
2016-07-15 13:39:47 +01:00
paboyle
1a6c7204ac
Disable instantiation; Use cache version instead
2016-07-15 00:34:39 +01:00
paboyle
49310fbab3
Done with red black change over
2016-07-15 00:08:43 +01:00
paboyle
6049d5ac47
Update
2016-07-15 00:08:32 +01:00
paboyle
35d0d35238
Updated file list
2016-07-15 00:02:53 +01:00
paboyle
c0e878705e
Updated file list
2016-07-15 00:02:39 +01:00
paboyle
5c0c8efb9e
Updated file list
2016-07-15 00:02:11 +01:00
paboyle
dfd714e1ef
Multiple implementations for the 5d hopping terms, depending on cache friendly
...
ops and/or the 5th direction being vectorised
All use 4d redblack.
2016-07-15 00:00:09 +01:00
paboyle
79a8ca1a62
Rewrite for performance. Impl dependent instantiations give
...
4d linalg impls of the 5d hopping terms (and inverse)
Cache friendly loop orderings of the above
Dense matrix stored and apply to the above
-- Switch to Ls vectorised, and use dense matrix approach for the MooeeInv
and rotate/shift of the Mooee M5D routines.
2016-07-14 23:58:15 +01:00
paboyle
fb45eb2eb2
5d ls vec rename of impl class
2016-07-14 23:57:26 +01:00
paboyle
a307274c96
Fermion impl rename for ls vectorised 5d approaches
2016-07-14 23:56:13 +01:00
paboyle
3f2c44a5fe
Updating the class to 5d selection based on impl type
2016-07-14 23:55:26 +01:00
paboyle
48fb1cdc11
Update domain 5d vectorised impl type, move the type over to 4d redblack with
...
the dense OO inverse
2016-07-14 23:54:35 +01:00
paboyle
8a79e93cc2
Rename the 5d domain wall fermion vectorised Ls impl class
2016-07-14 23:53:00 +01:00
paboyle
3493b51879
Modest updates
2016-07-14 23:52:13 +01:00
paboyle
de3e79d300
red black for Ls vectorised is 4d red black. Update accordingly now I've made this choice
2016-07-14 23:49:42 +01:00
paboyle
dd62a61c5c
Added broadcast and rotation of simd vectors
2016-07-14 23:49:00 +01:00
paboyle
8f47d0b5ab
Rotation needed for hopping term in fifth dim with Ls vectorised fields
2016-07-14 23:45:36 +01:00
paboyle
42af132dab
Fix for chris kellys request to peek poke on checkerboarded fields
2016-07-14 23:44:48 +01:00
paboyle
9db2c6525d
updating benchmarks for red black 4d for Ls vectorised code
2016-07-14 23:44:02 +01:00
paboyle
adbc7c1188
Adding files for multiple implementations (cache opt) and Ls vectorisation
...
of the 5D cayley form chiral fermions for the 5d matrix. With Ls entirely
in the vector direction, s-hopping terms involve rotations.
The serial dependence of the LDU inversion for Mobius and 4d even odd
checkerboarding is removed by simply applying Ls^2 operations (vectorised
many ways) as a dense matrix operation.
This should give similar throughput but high flops (non-compulsory flops)
but enable use of the KNL cache friendly kernels throughout the code.
Ls is still constrained to be a multiple of Nsimd, which is as much as 8 for AVX512
with single precision.
2016-07-14 22:59:21 +01:00
Guido Cossu
9dc345e8e8
Debugged smearing and adding HMC functions for hirep
2016-07-13 17:51:18 +01:00
Christopher Kelly
8b9301a74c
Merge branch 'feature/bugfixes' into develop
2016-07-13 12:31:34 -04:00
Christopher Kelly
6f47fbb1e2
Disabled parallel for loops in ExtractSlice and InsertSlice due to race conditions. Likely will need to do so for localConvert too.
2016-07-13 10:49:18 -04:00
Guido Cossu
a9ae30f868
Added representations definitions for the HMC
2016-07-12 13:36:10 +01:00
Christopher Kelly
a3c0fb79b6
Fix to iVector and iMatrix pokeIndex and checkerboard local site indexing.
2016-07-11 17:15:22 -04:00
paboyle
62601bb649
Bug fix
2016-07-08 20:46:29 +01:00
paboyle
ef97e32152
Adding persistent communicators
2016-07-08 17:16:08 +01:00
Guido Cossu
daea5297ee
Wrote the projector in the adjoint representation algebra
2016-07-08 16:14:16 +01:00
Guido Cossu
5028969d4b
Added generators for the adjoint representation
2016-07-08 15:40:11 +01:00
paboyle
c667d9fdcc
Trying to make compile clean on travis; seem to have a make -j 4 problem with fftw
2016-07-07 23:26:39 +01:00
paboyle
7dbb94bab2
Update
2016-07-07 22:51:37 +01:00
paboyle
236dcc820b
typo fix
2016-07-07 22:46:11 +01:00
paboyle
a42a441a6a
Rename the reconfigure script to ./autogen.sh
2016-07-07 22:35:45 +01:00
paboyle
a0676beeb1
Open up dependency on Eigen and FFTW
2016-07-07 22:31:07 +01:00
Christopher Kelly
c5106d0c03
Bugfix
2016-07-07 16:06:30 -04:00
Guido Cossu
fbf96b1bbb
]Merge branch 'develop' into feature/hirep
2016-07-07 14:20:10 +01:00
Guido Cossu
3c49ddfaa4
Merge branch 'temporary-smearing' into develop
2016-07-07 14:04:59 +01:00
Guido Cossu
ffb8b3116c
Tested smeared RHMC Wilson1p1, accepting
2016-07-07 11:49:36 +01:00
Christopher Kelly
290493e162
Merge branch 'feature/multi_prec' into develop
2016-07-06 19:29:57 -04:00
Christopher Kelly
dd8cfff111
Another fix for pedantic compilers
2016-07-06 18:22:15 -04:00
Christopher Kelly
184642adb0
Fix for pedantic compilers
2016-07-06 18:15:15 -04:00
Christopher Kelly
4774a3bcd2
Generalized HotConfiguration and functions it calls to accept gauge fields with precision other than the default.
2016-07-06 18:01:08 -04:00
Christopher Kelly
25fafa9a89
Comment
2016-07-06 16:19:41 -04:00
Christopher Kelly
713520d3d2
Added tester for mixed CG
2016-07-06 16:18:19 -04:00
Christopher Kelly
85ed8175cb
Implemented mixed precision CG. Fixed filelist to exclude lib/Old directory and include Config.h.
2016-07-06 15:57:04 -04:00
Christopher Kelly
df5c788ef2
Merge branch 'develop' into feature/multi_prec
2016-07-06 14:52:28 -04:00
Christopher Kelly
15f22425c8
Added option to prevent CG from exiting when it fails to converge
2016-07-06 14:50:01 -04:00
Guido Cossu
e87182cf98
Debugged the copy constructor of the Lattice class
2016-07-06 15:31:00 +01:00
Guido Cossu
e3d5319470
Debugged the real() and imag() functions and added tests to Test_Simd
2016-07-06 14:16:03 +01:00
Guido Cossu
ffedeb1c58
Minor modifications
2016-07-06 11:41:27 +01:00
Guido Cossu
3e3b367aa9
Small changes in the Log files
2016-07-05 15:05:28 +01:00
Guido Cossu
3e80947c2b
Cleaned up HMC output. Tested smeared HMCs for single precision (OK)
2016-07-05 12:03:54 +01:00
Guido Cossu
fdfbf11c6d
Merge branch 'develop' into temporary-smearing
2016-07-04 18:45:10 +01:00
Guido Cossu
9cb90f714e
Merge remote-tracking branch 'origin/develop' into temporary-smearing
2016-07-04 17:28:40 +01:00
Guido Cossu
6ce174cd60
Testing smearing for RHMC routines
2016-07-04 16:36:49 +01:00
Guido Cossu
17ca5240f7
Testet smeared EOWilsonRatio, accepts
2016-07-04 16:25:15 +01:00
Guido Cossu
2daffdf95d
Tested smeared WilsonRatio action, accepts
2016-07-04 16:17:28 +01:00
Guido Cossu
149f826601
Tested smearing for Nf2 WilsonFermionAction, non EO: accepts
2016-07-04 16:09:19 +01:00
Guido Cossu
cd8ee27080
Simple change in iGamma for smearing
2016-07-04 16:02:57 +01:00
Guido Cossu
0fa66e8f3c
Debugged smearing for EOWilson, accepts
2016-07-04 15:35:37 +01:00
Guido Cossu
8dd099267d
Corrected a bug in the Expression Templates (acso and asin were wrong)
2016-07-03 12:28:25 +01:00
Guido Cossu
1a6d65c6a4
Converted set_uw and set_fj to all complex functions
2016-07-03 10:27:43 +01:00
paboyle
fc4a043663
Colors and banner clean up
2016-07-02 16:15:38 +01:00
paboyle
61ba50665e
Merge branch 'hotfix/v0.5.1' into develop
2016-07-01 16:34:30 +01:00
paboyle
446c768cd3
Merge branch 'hotfix/v0.5.1'
...
Double precision compile fix
2016-07-01 16:33:59 +01:00
paboyle
bfe14000a9
Double compile fix
2016-07-01 16:33:51 +01:00
Guido Cossu
092fa0d8da
Debugged set_fj,
...
to be fixed: BUG in imag()
2016-07-01 16:06:20 +01:00
e0b7004f96
Merge branch 'master' into feature/hadrons
2016-07-01 15:54:34 +01:00
paboyle
1ceff48133
Merge branch 'release/v0.5.0' into develop
2016-06-30 15:15:59 -07:00
paboyle
680645f849
Merge branch 'release/v0.5.0'
2016-06-30 15:15:03 -07:00
paboyle
3fc6e03ad1
Version file
2016-06-30 14:44:09 -07:00
paboyle
2d6614f3a1
Merge branch 'feature/knl-cache-opt' into develop
2016-06-30 14:36:20 -07:00
paboyle
4e041b5103
Merge branch 'feature/knl-cache-opt' of https://github.com/paboyle/Grid into feature/knl-cache-opt
2016-06-30 14:36:08 -07:00
paboyle
712b9a3489
Asm only for avx512
2016-06-30 14:35:02 -07:00
paboyle
bdaa5b1767
Updated to have perfect prefetching for the s-vectorised kernel with any cache blocking.
2016-06-30 14:35:02 -07:00
paboyle
8fcefc021a
Improved the prefetching when using cache blocking codes
2016-06-30 14:35:02 -07:00
paboyle
1445189361
COntrol the prefetch strategy
2016-06-30 14:35:02 -07:00
paboyle
05c884a62a
Prefetch change
2016-06-30 14:35:01 -07:00
paboyle
a25bec87d9
Prefetch during save
2016-06-30 14:35:01 -07:00
paboyle
2d8bb4c594
Tweaks
2016-06-30 14:35:01 -07:00
paboyle
51cb2d4328
update file lists
2016-06-30 14:35:01 -07:00
paboyle
6d58cb2a68
Enable reordering of the loops in the assembler for cache friendly.
...
This gets in the way of L2 prefetching however. Do next next link in stencil
prefetching.
2016-06-30 14:35:01 -07:00
paboyle
c8b35d960c
Merge branch 'develop' of https://github.com/paboyle/Grid into feature/knl-cache-opt
2016-06-30 14:30:49 -07:00
paboyle
532f41dd61
Asm only for avx512
2016-06-30 14:00:34 -07:00
paboyle
661b0ab45d
Updated to have perfect prefetching for the s-vectorised kernel with any cache blocking.
2016-06-30 13:07:42 -07:00
Guido Cossu
565e9329ba
Changed the colouring classes
2016-06-30 16:51:03 +01:00
paboyle
4bc08ed995
Improved the prefetching when using cache blocking codes
2016-06-26 12:54:14 -07:00
paboyle
b2933a0557
COntrol the prefetch strategy
2016-06-25 12:55:25 -07:00
paboyle
db057cc276
Prefetch change
2016-06-25 12:54:50 -07:00
paboyle
22e88eaf54
Prefetch during save
2016-06-25 12:54:14 -07:00
paboyle
09fe3caebd
Tweaks
2016-06-25 11:08:05 -07:00
Guido Cossu
5e02392f9c
Fixed compilation error for benchmark_dwf
...
Some parts were assuming floating point precision
2016-06-20 12:30:51 +01:00
paboyle
17a8f51a9b
update file lists
2016-06-19 11:59:10 -07:00
paboyle
1b7f88dd00
Enable reordering of the loops in the assembler for cache friendly.
...
This gets in the way of L2 prefetching however. Do next next link in stencil
prefetching.
2016-06-19 11:45:58 -07:00
d6737e4bd8
Travis fix for Linux clang builds
2016-06-14 19:15:08 +01:00
75fc295f6e
Merge branch 'hadrons' into feature/hadrons
2016-06-14 17:51:15 +01:00
d539888e57
Merge pull request #37 from rprollins/fix/mpi_communicator
...
Removed write to stdout in constructor for MPI CartesianCommunicator
2016-06-14 17:25:40 +01:00
Richard Rollins
86187d7cca
Removed write to stdout in constructor for MPI CartesianCommunicator
2016-06-14 15:34:20 +01:00
paboyle
87418e7df1
Slightly faster prefetching perf.
2016-06-13 02:32:52 -07:00
paboyle
55f65b81b5
Improvements to the assembler interface that let us move chunks of the
...
site and s loop into the kernels. This will save on function call overhead and
guarantee L2 prefetching strategy is right since OMP can't distribute the
sub-chunks of work.
2016-06-09 01:12:36 -07:00
Azusa Yamaguchi
d9408893b3
Prefetching in the normal kernel implementation.
2016-06-08 05:43:48 -07:00
paboyle
05acc22920
placeholder for non temporal loads optimisation
2016-06-07 13:18:21 -07:00
paboyle
8ac021de73
Added a test an fixed it for red black precon Ls innermost vectorised DWF
2016-06-07 13:16:56 -07:00
paboyle
e503ef5590
Cleaned up
2016-06-07 00:11:36 +01:00
paboyle
a7682b0060
Only instantiate the one routine to avoid duplicate symbol under g++5/MacOS
2016-06-06 23:48:21 +01:00
0b731b5d80
Hadrons: genetic scheduler parameter fix
2016-06-06 17:46:53 +01:00
8e2078be71
Hadrons: environment with fully generic object store
2016-06-06 17:45:37 +01:00
paboyle
d4c9d71fc8
Merge branch 'master' of https://github.com/paboyle/Grid
2016-06-06 07:06:54 -07:00
paboyle
786ca52c43
Problems remain in the red black preconditioning of the Ls vectorisation
2016-06-06 07:05:51 -07:00
Peter Boyle
048ac04abc
Update Benchmark_dwf.cc
2016-06-03 13:44:41 +01:00
Peter Boyle
f78d89bcbe
Update Lebesgue.cc
...
kill verbose
2016-06-03 13:33:42 +01:00
paboyle
53d06046b0
Compiling updates for KNL
2016-06-03 03:47:54 -07:00
paboyle
5d3a1a025d
timers flag
2016-06-03 03:25:38 -07:00
paboyle
139cc5f1ae
Large change with KNL preparation
2016-06-03 03:24:26 -07:00
1826ed06a3
Merge branch 'master' into hadrons
2016-05-27 16:50:31 +01:00
1c0e922585
Merge pull request #35 from aportelli/master
...
empty SIMD fix
2016-05-27 16:49:13 +01:00
9d5f693cbe
empty SIMD fix
2016-05-24 10:56:27 +01:00
3ff96c502b
Merge branch 'master' into hadrons
2016-05-12 19:24:18 +01:00
15a0908bfc
Merge branch 'master' into hadrons
2016-05-12 18:35:46 +01:00
bb2125962b
Hadrons: finished implementation of 5D quarks
2016-05-12 18:34:42 +01:00
232fda5fe1
Hadrons: DWF action
2016-05-12 18:34:10 +01:00
2b31bf61ff
Hadrons: message fix
2016-05-12 18:33:49 +01:00
afe5a94745
Hadrons: getModule with upcast
2016-05-12 18:33:36 +01:00
7ae667c767
Hadrons: module template update
2016-05-12 18:33:08 +01:00
07f0b69784
Merge branch 'master' into hadrons
2016-05-12 13:02:18 +01:00
5c06e89d69
Hadrons: code cleaning
2016-05-12 12:49:49 +01:00
3d75e0f0d1
Hadrons: MQuark fix
2016-05-12 12:02:15 +01:00
362f255100
Hadrons: module parameters can now be accessed from outside
2016-05-12 11:59:28 +01:00
3d78ed03ef
Merge branch 'master' into hadrons
2016-05-11 15:21:46 +01:00
835003b3c5
Hadrons: removed useless gauge global parameters
2016-05-11 15:01:52 +01:00
328d213c9e
Hadrons: FS case sensitivity fix
2016-05-11 14:44:14 +01:00
56a8d7a5bc
Hadrons: build system fix
2016-05-11 10:27:14 +01:00
78198d1b04
Hadrons: size fix for module graph with one vertex
2016-05-10 20:13:28 +01:00
84fa2bdce6
Hadrons: modules moved in their own directory & utility script to add new modules
2016-05-10 20:12:48 +01:00
29dfe99e7c
Hadrons: more scheduler optimizations
2016-05-10 19:19:38 +01:00
d604580e5a
Hadrons: all objects/modules mapped to an integer address system to remove string operations from scheduling
2016-05-10 19:07:41 +01:00
7dfdc9baa0
Hadrons: lattice dynamic cast fix
2016-05-10 10:41:20 +01:00
9e986654e6
Hadrons: first version of the genetic scheduler
2016-05-09 14:49:06 +01:00
df3fbc477e
Hadrons: code cleaning
2016-05-07 13:26:56 -07:00
bb580ae077
Hadrons: significant overhaul of the object registration system, previous version didn't allow dry runs
2016-05-07 13:19:38 -07:00
2c226753ab
Hadrons: comments on graph theory algorithm complexity
2016-05-06 06:35:11 -07:00
ea0cea668e
Hadrons: minor code cleaning
2016-05-05 16:13:14 -07:00
75cd72a421
Hadrons: memory management for fermion matrices, dynamic ownership in garbage collector
2016-05-04 19:11:03 -07:00
cbe52b0659
Hadrons: debug message removed
2016-05-04 12:20:33 -07:00
3aa6463ede
Hadrons: general lattice store & a lot of code cleaning
2016-05-04 12:17:27 -07:00
312637e5fb
Merge branch 'master' into hadrons
...
# Conflicts:
# lib/Log.h
2016-05-04 12:16:18 -07:00
798d8f7340
Hadrons: Modules: better log messages
2016-05-03 18:17:58 -07:00
ba878724ce
Hadrons: sources are now independent modules
2016-05-03 18:17:28 -07:00
b865dd9da8
Hadrons: solver renaming
2016-05-03 18:16:57 -07:00
8b313a35ac
Hadrons: random and NERSC gauge configurations
2016-05-03 17:08:42 -07:00
02ec23cdad
Hadrons: Fermion actions and gauge fields are modules now
2016-05-03 17:08:42 -07:00
6e83b6a203
Hadrons: namespace reorganisation, now everything is in Grid::Hadrons, the 'using Grid::operator<<' statement is used to prevent a very nasty compilation error with GCC.
2016-05-02 19:31:21 -07:00
48fcc34d72
CMeson: first implementation, still need proper output
2016-05-01 18:31:40 -07:00
d08d93c44c
Merge branch 'master' into hadrons
2016-05-01 18:30:44 -07:00
0ab10cdedb
Merge branch 'master' into hadrons
2016-05-01 16:08:05 -07:00
22653edf12
Merge branch 'master' into hadrons
2016-05-01 15:55:58 -07:00
12d2a95846
Merge branch 'master' into hadrons
2016-05-01 15:05:02 -07:00
978cf52f6b
Merge branch 'master' into hadrons
2016-05-01 14:53:38 -07:00
63b730de80
Hadrons: for the moment, test with unit gauge
2016-05-01 14:50:57 -07:00
7905c5b8e5
Hadrons: Z2 source code fix
2016-05-01 14:49:45 -07:00
5e4b58ac40
Hadrons: Z2 source expression fix
2016-05-01 12:49:26 -07:00
468d8dc682
Merge branch 'master' into hadrons
2016-05-01 12:03:24 -07:00
beb11fd4ef
Merge branch 'master' into hadrons
2016-05-01 10:32:24 -07:00
d7662b5175
Merge branch 'master' into hadrons
2016-04-30 00:24:59 -07:00
dc5f32e5f0
Merge branch 'master' into hadrons
2016-04-30 00:18:31 -07:00
1869d28429
Hadrons: first prototype with working inversions
2016-04-30 00:17:04 -07:00
405b175665
Merge branch 'master' into hadrons
2016-04-30 00:16:06 -07:00
e33b0f6ff7
cleaner output
2016-04-16 08:41:53 +01:00
9ee54e0db7
debug output removed
2016-04-16 08:41:28 +01:00
feae35d92c
Hadrons: pass strings by value
2016-04-16 08:41:12 +01:00
3834d81181
Merge branch 'master' into hadrons
2016-04-14 15:15:45 +01:00
neo
339be37dba
Debugging smeared HMC
2016-04-13 17:00:14 +09:00
neo
a87b744621
HMC runs but does not accept with smearing on
2016-04-07 16:45:11 +09:00
97d0d56bcb
Debugging Smearing routines (set_fj)
2016-04-06 17:58:43 +09:00
7c7ea35ffb
Putting the Traceless Antihermitian part outside the deriv in pseudofermion actions
2016-04-05 16:28:09 +09:00
4b1cf580e0
Debugging the Smearing routines
2016-04-05 16:19:30 +09:00
179e82b5ca
Merge branch 'master' into hadrons
2016-03-08 12:55:33 +00:00
f2c59c8730
Merge branch 'master' into hadrons
2016-03-02 17:15:05 +00:00
fdd0848593
Hadrons: license text update
2016-02-25 12:07:21 +00:00
92f666905f
copyright script update to 80 column text
2016-02-25 12:06:24 +00:00
5980fa8640
test implementation of DWF inverter
2016-02-25 11:56:16 +00:00
2d8bb356e3
Smearing routines compile (still untested)
2016-02-25 02:43:59 +09:00
a7251f28c7
Stout smearing compiles (untested)
2016-02-24 03:16:50 +09:00
a0d8eb2c24
minor code cleaning
2016-02-23 16:33:00 +00:00
1e10b4571d
fix after Grid update
2016-02-23 16:21:45 +00:00
02f8b84ac9
Merge branch 'master' into hadrons
2016-02-23 16:13:39 +00:00
cfd368596d
Merge branch 'master' into hadrons
2016-02-22 15:25:02 +00:00
neo
c1b1b89d17
More on smearing routines, writing APEsmear (dev)
2016-02-19 17:15:27 +09:00
neo
771235017d
Adding smearing routines (development)
2016-02-19 15:30:41 +09:00
ae682674e0
Hadrons: first full implementation of the scheduler
2016-01-13 20:23:51 -08:00
17c43f49ac
Hadrons: application class now take parameter file name as argument
2016-01-13 20:22:37 -08:00
30146e977c
gitignore update
2016-01-13 20:20:43 -08:00
54eacec261
Hadrons: namespace std not used anymore in compiled sources
2015-12-23 14:30:33 +00:00
76c78f04e2
Hadrons: first complete prototype for run loop
2015-12-23 14:21:35 +00:00
379580cd89
Merge branch 'master' into hadrons
2015-12-23 14:20:22 +00:00
14a80733f9
Merge branch 'master' into hadrons
2015-12-08 13:57:53 +00:00
d4db009a58
Hadrons: starting scheduler implementation
2015-12-07 18:26:38 +00:00
20ce7e0270
Hadrons: algorithm to determine all possible topological ordering
2015-12-07 15:46:36 +00:00
bb195607ab
Hadrons: fix in topological sort algorithm name
2015-12-02 19:40:11 +00:00
6f090e22c0
Hadrons: graph topological sort
2015-12-02 19:33:34 +00:00
339e983172
Merge branch 'master' into hadrons
2015-12-02 14:38:04 +00:00
4a7f3d1b7b
Merge branch 'master' into hadrons
...
# Conflicts:
# configure
2015-12-02 10:57:51 +00:00
c4e2202550
First graph class implementation and test
2015-11-05 14:28:14 +00:00
538b16610b
First commit for measurement software 'Hadrons'
2015-10-27 17:33:18 +00:00