On the accuracy of self-normalized log-linear models

Andreas, Jacob; Rabinovich, Maxim; Klein, Dan; Jordan, Michael I.

Statistics > Machine Learning

arXiv:1506.04147 (stat)

[Submitted on 12 Jun 2015 (v1), last revised 18 Jun 2015 (this version, v2)]

Title:On the accuracy of self-normalized log-linear models

Authors:Jacob Andreas, Maxim Rabinovich, Dan Klein, Michael I. Jordan

View PDF

Abstract:Calculation of the log-normalizer is a major computational obstacle in applications of log-linear models with large output spaces. The problem of fast normalizer computation has therefore attracted significant attention in the theoretical and applied machine learning literature. In this paper, we analyze a recently proposed technique known as "self-normalization", which introduces a regularization term in training to penalize log normalizers for deviating from zero. This makes it possible to use unnormalized model scores as approximate probabilities. Empirical evidence suggests that self-normalization is extremely effective, but a theoretical understanding of why it should work, and how generally it can be applied, is largely lacking. We prove generalization bounds on the estimated variance of normalizers and upper bounds on the loss in accuracy due to self-normalization, describe classes of input distributions that self-normalize easily, and construct explicit examples of high-variance input distributions. Our theoretical results make predictions about the difficulty of fitting self-normalized models to several classes of distributions, and we conclude with empirical validation of these predictions.

Subjects:	Machine Learning (stat.ML); Computation and Language (cs.CL); Machine Learning (cs.LG); Methodology (stat.ME)
Cite as:	arXiv:1506.04147 [stat.ML]
	(or arXiv:1506.04147v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1506.04147

Submission history

From: Jacob Andreas [view email]
[v1] Fri, 12 Jun 2015 20:00:29 UTC (1,017 KB)
[v2] Thu, 18 Jun 2015 15:22:50 UTC (1,017 KB)

Statistics > Machine Learning

Title:On the accuracy of self-normalized log-linear models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:On the accuracy of self-normalized log-linear models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators