A theoretical interpretation of variance-based convergence criteria in perturbation-based theories

Wang, Xiaohui; Sun, Zhaoxi

Physics > Chemical Physics

arXiv:1803.03123 (physics)

[Submitted on 22 Feb 2018 (v1), last revised 4 Oct 2018 (this version, v5)]

Title:A theoretical interpretation of variance-based convergence criteria in perturbation-based theories

Authors:Xiaohui Wang, Zhaoxi Sun

View PDF

Abstract:In QM/MM indirect free energy simulation, QM/MM corrections can be obtained from integration of partial derivatives of alchemical Hamiltonians or from perturbation-based estimators including free energy perturbation (FEP) and acceptance ratio methods. With FEP or exponential averaging, researchers tend to only sample MM states and calculate single point energy to get the free energy estimates. In this case the sample size hysteresis arises and the convergence is determined by bias elimination rather than variance minimization. Various criteria are proposed to evaluate the convergence issue and numerical studies are reported. It has been found that criteria including variance of distribution, effective sample size, information entropies and so on can be used and they are variance-of-distribution-dependent. However, no theoretical interpretation is presented. In this paper we present theoretical interpretations to dig the underlying statistical nature behind the problem. The convergence criteria are proven to be related with variance of distribution in Gaussian approximated Exponential averaging. Further, we prove that these estimators are nonlinearly dependent on the variance of the free energy estimate. As these estimators are often orders of magnitude overestimated, the variance of the FEP estimate is orders of magnitude underestimated. Hence, computing this statistical uncertainty is meaningless. In numerical calculation from timeseries data the effective sample size is bounded by 1 and N and thus the variance of the free energy estimate is proven to be bounded by 0 and 1 (kBT)2 for EXP and 0 and 2 (kBT)2 for BAR, which indicates an inevitable underestimation. Specifically, the upper bounds for these estimators are sample-size dependent. The effective sample size is proven to be a function of the overlap scalar, from which the range of the overlap scalar can also be derived.

Comments:	34 pages, 7 figs. Modifications include equation addition and figure 7 addition. I hope this version is more reader friendly
Subjects:	Chemical Physics (physics.chem-ph); Statistical Mechanics (cond-mat.stat-mech)
Cite as:	arXiv:1803.03123 [physics.chem-ph]
	(or arXiv:1803.03123v5 [physics.chem-ph] for this version)
	https://doi.org/10.48550/arXiv.1803.03123

Submission history

From: Zhaoxi Sun [view email]
[v1] Thu, 22 Feb 2018 11:56:43 UTC (1,477 KB)
[v2] Sat, 9 Jun 2018 11:59:38 UTC (1,597 KB)
[v3] Fri, 10 Aug 2018 06:46:17 UTC (1,639 KB)
[v4] Sun, 9 Sep 2018 23:21:14 UTC (1,656 KB)
[v5] Thu, 4 Oct 2018 14:19:08 UTC (1,718 KB)

Physics > Chemical Physics

Title:A theoretical interpretation of variance-based convergence criteria in perturbation-based theories

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Physics > Chemical Physics

Title:A theoretical interpretation of variance-based convergence criteria in perturbation-based theories

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators