644fd6d32e
Build avx512 clean
2016-03-25 09:35:33 -07:00
f54e0ec9bd
Try lanczos to set up hdcr subspace
2016-03-17 10:36:16 +00:00
090e7aa930
Merge remote-tracking branch 'origin/chulwoo-dec12-2015'
...
Merge Chulwoo's Lanczos related improvements.
Merge Nd!=4 fixes for pure gauge HMC from Evan.
2016-03-08 09:55:14 +00:00
dc72293398
More timing info
2016-03-06 10:46:55 -08:00
497e7e4c53
BG/Q compatibility fix
2016-02-23 15:57:38 +00:00
02a57ffa6f
machine generated. Should remove from git .. but annoys downloaders
2016-02-21 07:57:02 -06:00
9f0d9ade68
Added configure flag for LAPACK. Tested ImplicitlyRestartedLanczos::calc()
...
Checking in before cleaning up
2016-02-20 02:50:32 -05:00
db5e8050a8
Attempts at some optimisation
2016-02-18 22:33:58 +00:00
41c2b09184
Shmem comms [NO MPI] target added. The dwf test runs and passes.
...
Not really shaken out to my satisfaction though as I want more tests done, so don't declare as working.
But committing my current while I try a few experimentals.
2016-02-14 14:24:38 -06:00
7f927a541c
Shmem related fixes for shmem compile
2016-02-11 07:37:39 -06:00
6371676a75
Correcting some compilation errors for clang-sse
2016-02-10 11:37:03 +09:00
6012b0ec23
Checking in changes before changing to chulwoo-dec12-2015
2016-01-25 09:40:58 -05:00
411ac49dd7
GparityWilsonTM typedef added. Not yet tested
...
Conflicts:
configure
lib/qcd/action/fermion/WilsonKernels.h
2016-01-25 01:36:28 -05:00
5c57d4f403
Merge branch 'master' of https://github.com/paboyle/Grid into scidac1_2
...
Conflicts:
lib/qcd/action/fermion/WilsonKernels.h
2016-01-11 11:36:45 -05:00
dafc74020c
Overlap comms compute improvements in hand op kernels, and better timing from Edison and Cori
2016-01-10 16:54:27 -08:00
5924e5a562
Merge branch 'master' of https://github.com/paboyle/Grid into scidac1_2
...
Conflicts:
configure
lib/qcd/action/Actions.h
lib/qcd/action/fermion/WilsonKernels.h
2016-01-06 03:44:57 -05:00
02452afd36
Optional overlap of comms with compute
2016-01-04 14:18:40 +00:00
5710966324
Options to use mersenne twister OR ranlux48 via --enable-rng flag at configure time.
...
Can save and restore RNG state via new (serial) I/O routines in a NERSC header style file.
Store a Parallel (one per site) and a single serial RNG file.
2015-12-19 18:32:25 +00:00
eb1759d7ea
Added Gparity instantiation to no HANDOPT case
...
deleted configure (as intended?)
2015-12-16 00:04:09 -05:00
34a0fde2ad
Fixes to fermion force terms after sign of gamma_mu (0...3) change.
...
Thought I had already committed these.
Believe I have got the Gparity fermion force working.
* tests/Test_gpdwf_force.cc -- correctly predicts dS for two flavour pseudofermion
based on a small dt update of U field.
* tests/Test_hmc_EODWFRatio_Gparity.cc -- ran 1 trajectory on 8^4 with dH=0.21.
Need to accumulate a full plaquette log to believe fully which will take some hours of run time.
2015-12-15 23:14:12 +00:00
967be91692
update merge
2015-11-26 09:51:41 +00:00
98d8ba6d14
Remove autogen files from CVS
2015-11-06 05:29:07 -06:00
d4289a33b8
AMD FMA4 addition
2015-10-09 00:44:20 +02:00
5ef42add2d
Changes to remove warnings under icc; disambiguate AVX512 from IMCI correctly
...
and drop swizzles in AVX512. Don't know why these compiled.
2015-09-23 05:23:45 -07:00
2f38ebc446
Reintroducing the hand unrolled loops
2015-09-08 17:45:30 +01:00
76d752585b
Started a tidy up in the HMC sector. Now comfortable with the two level integrators;
...
to a little figure out what Guido had done & why -- but there is a neat saving of force
evaluations across the nesting time boundary making use of linearity of the leapP in dt.
I cleaned up the printing, reduced the volume of code, in the process sharing printing
between all integrators. Placed an assert that the total integration time for all integrators
must match at end of trajectory.
Have now verified e-dH = 1 for nested integrators in Wilson/Wilson runs with both
Omelyan and with Leapfrog so substantial confidence gained.
2015-08-29 17:18:43 +01:00
772cd8199d
Merge problem fixed
2015-08-01 22:30:00 +09:00
490009745c
Small change in the HMC interface.
...
Example of multiple levels in the WilsonFermion hmc test.
Merge remote-tracking branch 'upstream/master'
Conflicts:
lib/qcd/hmc/HMC.h
lib/qcd/hmc/integrators/Integrator.h
lib/qcd/hmc/integrators/Integrator_algorithm.h
tests/Test_simd.cc
2015-07-30 17:16:57 +09:00
a9c15626e1
Merge remote-tracking branch 'upstream/master'
...
Conflicts:
configure
2015-07-21 11:57:34 +09:00
9adaeb061a
More NEON functionalities
2015-07-21 11:52:15 +09:00
6b8190a032
gitignore update
2015-07-17 11:15:17 +09:00
ab3ad78ece
Skeleton of HMC/Integrators
2015-07-03 16:51:41 +09:00
638d2cda11
Change the SIMD command correctly with precision = double vs. single and
...
connect the "Real" default precisoin to a configure flag.
Have RealF, RealD and Real types, where Real is compile target dependent single/double,
RealF is single and RealD is double etc..
2015-07-01 22:45:15 +01:00
5cce44edb4
Updates to ldop tests
2015-06-10 12:26:25 +01:00
48bf4878c1
Experimental support for ARM
2015-06-09 15:46:21 +09:00
63a61fcc2a
PartialFraction Hw with Zolo and Tanh approx converged under CG and passed EO breakdown
...
and hermiticity tests.
2015-06-04 13:28:37 +01:00
3055d2cf2c
Addedd Ta functionality to the tensor types
...
Merge remote-tracking branch 'upstream/master'
Conflicts:
configure
2015-06-04 18:11:32 +09:00
5a5ee83d28
Small modification to the configure files
2015-06-04 14:17:58 +09:00
1d0df449e8
Reorganise of file naming
2015-06-03 12:47:05 +01:00
727bc32150
Merge remote-tracking branch 'upstream/master'
2015-05-29 11:41:39 +09:00
f41e4e8b1b
Some modifications to the configure to check SIMD support
2015-05-29 11:41:02 +09:00
b72ca15bd2
Improving the reduction to go through our on permute.
...
Must also do this for avx512
2015-05-27 16:07:17 +01:00
19bd6f103a
Check at configure time if CPU supports the requested SIMD optimization
2015-05-27 18:30:11 +09:00
3cb34af82c
Added a .gitignore file to eliminate autoconf files from commits
2015-05-27 11:10:51 +09:00
8abf6403d5
Corrected AVX regression error. Tested.
2015-05-27 10:49:33 +09:00
da46b56e85
Adding support for doxygen generation
2015-05-27 10:34:56 +09:00
1a24801246
checked performance of new vector libaries.
...
Added check for c++11 support on the configure.ac
2015-05-26 12:02:54 +09:00
baa382f055
Added check of mpfr and gmp at configure time
...
It generates automatically the linker flags or complains if not found.
2015-05-19 13:54:55 +09:00
99aecf1f2e
Minor modification to the configure.ac
...
Enables silent rules (use make V=1 to override)
Prints a summary after configure is completed
2015-05-18 17:15:14 +09:00
b4cd37276b
Corrected some compilation errors (zolotarev.h) and SSE4 vsplat and conj to make cshift test pass.
2015-05-18 16:48:14 +09:00