General Bayesian Loss Function Selection and the use of Improper Models

Jewson, Jack; Rossell, David

Statistics > Methodology

arXiv:2106.01214 (stat)

[Submitted on 2 Jun 2021 (v1), last revised 28 Mar 2022 (this version, v2)]

Title:General Bayesian Loss Function Selection and the use of Improper Models

Authors:Jack Jewson, David Rossell

View PDF

Abstract:Statisticians often face the choice between using probability models or a paradigm defined by minimising a loss function. Both approaches are useful and, if the loss can be re-cast into a proper probability model, there are many tools to decide which model or loss is more appropriate for the observed data, in the sense of explaining the data's nature. However, when the loss leads to an improper model, there are no principled ways to guide this choice. We address this task by combining the Hyvärinen score, which naturally targets infinitesimal relative probabilities, and general Bayesian updating, which provides a unifying framework for inference on losses and models. Specifically we propose the H-score, a general Bayesian selection criterion and prove that it consistently selects the (possibly improper) model closest to the data-generating truth in Fisher's divergence. We also prove that an associated H-posterior consistently learns optimal hyper-parameters featuring in loss functions, including a challenging tempering parameter in generalised Bayesian inference. As salient examples, we consider robust regression and non-parametric density estimation where popular loss functions define improper models for the data and hence cannot be dealt with using standard model selection tools. These examples illustrate advantages in robustness-efficiency trade-offs and provide a Bayesian implementation for kernel density estimation, opening a new avenue for Bayesian non-parametrics.

Comments:	Keywords: Loss functions; Improper models; General Bayes; Hyvärinen score; Robust regression; Kernel density estimation
Subjects:	Methodology (stat.ME)
Cite as:	arXiv:2106.01214 [stat.ME]
	(or arXiv:2106.01214v2 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.2106.01214

Submission history

From: Jack Jewson [view email]
[v1] Wed, 2 Jun 2021 15:05:42 UTC (11,971 KB)
[v2] Mon, 28 Mar 2022 09:05:31 UTC (13,538 KB)

Statistics > Methodology

Title:General Bayesian Loss Function Selection and the use of Improper Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:General Bayesian Loss Function Selection and the use of Improper Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators