1
0
mirror of https://github.com/paboyle/Grid.git synced 2024-09-19 16:55:37 +01:00
Grid/HMC
2024-07-11 15:34:48 +00:00
..
ComputeWilsonFlow.cc Shuhei's ComputeWilsonFlow main programme 2024-06-05 15:51:11 -04:00
FTHMC2p1f_3GeV.cc 3GeV run setup 2023-10-16 20:49:52 +03:00
FTHMC2p1f.cc FTHMC Status at lattice conference commit 2023-10-13 18:21:56 +03:00
HMC2p1f_3GeV.cc Frontier compiel 2023-11-09 13:58:44 -05:00
Makefile.am Changes locally 2019-04-17 12:03:20 +01:00
Mobius2p1f_DD_EOFA_96I_3level.cc Multilevel integrator test 2022-11-15 16:39:39 -05:00
Mobius2p1f_DD_EOFA_96I_double.cc FTHMC commit 2023-10-13 18:21:57 +03:00
Mobius2p1f_DD_EOFA_96I_mixed.cc FTHMC Status at lattice conference commit 2023-10-13 18:21:56 +03:00
Mobius2p1f_DD_EOFA_96I_mshift.cc Trajectory runs to completion on Crusher within wall clock time 2023-04-07 15:42:45 -04:00
Mobius2p1f_DD_RHMC_96I_mixed.cc Current tests compiling 2022-09-27 10:56:55 -04:00
Mobius2p1f_DD_RHMC_96I.cc mores unused statements removed 2023-04-07 14:27:18 +01:00
Mobius2p1f_DD_RHMC.cc Current tests compiling 2022-09-27 10:56:55 -04:00
Mobius2p1f_EOFA_96I_hmc_double.cc FTHMC commit 2023-10-13 18:21:57 +03:00
Mobius2p1f_EOFA_96I_hmc.cc FTHMC Status at lattice conference commit 2023-10-13 18:21:56 +03:00
Mobius2p1f.cc Respect command line 2024-07-11 15:34:48 +00:00
Mobius2p1fEOFA_F1.cc Current tests compiling 2022-09-27 10:56:55 -04:00
Mobius2p1fEOFA.cc Current tests compiling 2022-09-27 10:56:55 -04:00
Mobius2p1fIDSDRGparityEOFA_40ID.cc Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet 2022-10-04 17:41:48 -04:00
Mobius2p1fIDSDRGparityEOFA_48ID.cc Merge branch 'feature/dirichlet' of https://github.com/paboyle/Grid into feature/dirichlet 2022-10-04 17:41:48 -04:00
Mobius2p1fRHMC.cc Current tests compiling 2022-09-27 10:56:55 -04:00
README F1 ensemble running with 96%~ acceptance etc.. 2019-05-22 09:56:26 +01:00
RNGstate.cc Create a new RNG file 2022-06-30 13:49:50 -04:00
site_autocorrelation.cc Protect vs. missing LIME libarary 2024-06-11 11:08:00 -04:00
site_plaquette.cc Protect vs. missing LIME libarary 2024-06-11 11:08:00 -04:00

********************************************************************
TODO: 
********************************************************************

i) Got mixed precision in 2f and EOFA force and action solves.
   But need mixed precision in the heatbath solve. Best for Fermop to have a "clone" method, to
   reduce the number of solver and action objects. Needed ideally for the EOFA heatbath.
   15% perhaps
   Combine with 2x trajectory length?

ii) Rational on EOFA HB  -- relax order
                         -- Test the approx as per David email

Resume / roll.sh 

----------------------------------------------------------------

- 16^3 Currently 10 traj per hour

- EOFA use a different derivative solver from action solver
- EOFA fix Davids hack to the SchurRedBlack guessing

*** Reduce precision/tolerance  in EOFA with second CG param.                          (10% speed up)
*** Force gradient - reduced precision solve for the gradient                          (4/3x speedup)


*** Need a plan for gauge field update for mixed precision in HMC                      (2x speed up)
    -- Store the single prec action operator.
    -- Clone the gauge field from the operator function argument.
    -- Build the mixed precision operator dynamically from the passed operator and single prec clone.

*** Mixed precision CG into EOFA portion         
*** Further reduce precision in forces to 10^-6 ?

*** Overall: a 3x or so is still possible => 500s -> 160s and 20 traj per hour on 16^3.

- Use mixed precision CG in HMC                           
- SchurRedBlack.h: stop use of operator function; use LinearOperator or similar instead.
- Or make an OperatorFunction for mixed precision as a wrapper

********************************************************************
* Signed off 2+1f HMC with Hasenbush and strange RHMC 16^3 x 32 DWF Ls=16 Plaquette 0.5883 ish
* Signed off 2+1f HMC with Hasenbush and strange EOFA 16^3 x 32 DWF Ls=16 Plaquette 0.5883 ish
* Wilson plaquette cross checked against CPS and literature GwilsonFnone
********************************************************************

********************************************************************
* RHMC: Timesteps & eigenranges matched from previous CPS 16^3 x 32 runs:
********************************************************************

****
Strange (m=0.04)  has eigenspan 
**** 
16^3 done as 1+1+1 with separate PV's. 
/dirac1/archive/QCDOC/host/QCDDWF/DWF/2+1f/16nt32/IWASAKI/b2.13/ls16/M1_8/ms0.04/mu0.01/rhmc_multitimescale/evol5/work
****
2+1f 16^3  - [ 4e^-4, 2.42 ]    for strange

****
24^3 done as 1+1+1 at strange, and single quotient https://arxiv.org/pdf/0804.0473.pdf Eq 83,
****
double lambda_low =   4.0000000000000002e-04 <- strange
double lambda_low =   1.0000000000000000e-02 <- pauli villars
And high = 2.5

Array bsn_mass[3] = { 
double bsn_mass[0] =   1.0000000000000000e+00
double bsn_mass[1] =   1.0000000000000000e+00
double bsn_mass[2] =   1.0000000000000000e+00
}
Array frm_mass[3] = { 
double frm_mass[0] =   4.0000000000000001e-02
double frm_mass[1] =   4.0000000000000001e-02
double frm_mass[2] =   4.0000000000000001e-02
}

***
32^3 
/dirac1/archive/QCDOC/host/QCDDWF/DWF/2+1f/32nt64/IWASAKI/b2.25/ls16/M1_8/ms0.03/mu0.004/evol6/work
***
Similar det scheme
double lambda_low =   4.0000000000000002e-04
double lambda_low =   1.0000000000000000e-02

Array bsn_mass[3] = { 
double bsn_mass[0] =   1.0000000000000000e+00
double bsn_mass[1] =   1.0000000000000000e+00
double bsn_mass[2] =   1.0000000000000000e+00
}
Array frm_mass[3] = { 
double frm_mass[0] =   3.0000000000000002e-02
double frm_mass[1] =   3.0000000000000002e-02
double frm_mass[2] =   3.0000000000000002e-02
}

********************************************************************
* Grid: Power method bounds check
********************************************************************
- Finding largest eigenvalue approx 25 not 2.5
- Conventions:

Grid MpcDagMpc based on:

   (Moo-Moe Mee^-1 Meo)^dag(Moo-Moe Mee^-1 Meo)

- with  Moo = 5-M5 = 3.2
- CPS use(d) Moo = 1
- Eigenrange in Grid is 3.2^2 rescaled so factor of 10 accounted for