Posterior Calibrated Training on Sentence Classification Tasks

Jung, Taehee; Kang, Dongyeop; Cheng, Hua; Mentch, Lucas; Schaaf, Thomas

Computer Science > Computation and Language

arXiv:2004.14500 (cs)

[Submitted on 29 Apr 2020 (v1), last revised 1 May 2020 (this version, v2)]

Title:Posterior Calibrated Training on Sentence Classification Tasks

Authors:Taehee Jung, Dongyeop Kang, Hua Cheng, Lucas Mentch, Thomas Schaaf

View PDF

Abstract:Most classification models work by first predicting a posterior probability distribution over all classes and then selecting that class with the largest estimated probability. In many settings however, the quality of posterior probability itself (e.g., 65% chance having diabetes), gives more reliable information than the final predicted class alone. When these methods are shown to be poorly calibrated, most fixes to date have relied on posterior calibration, which rescales the predicted probabilities but often has little impact on final classifications. Here we propose an end-to-end training procedure called posterior calibrated (PosCal) training that directly optimizes the objective while minimizing the difference between the predicted and empirical posterior this http URL show that PosCal not only helps reduce the calibration error but also improve task performance by penalizing drops in performance of both objectives. Our PosCal achieves about 2.5% of task performance gain and 16.1% of calibration error reduction on GLUE (Wang et al., 2018) compared to the baseline. We achieved the comparable task performance with 13.2% calibration error reduction on xSLUE (Kang and Hovy, 2019), but not outperforming the two-stage calibration baseline. PosCal training can be easily extendable to any types of classification tasks as a form of regularization term. Also, PosCal has the advantage that it incrementally tracks needed statistics for the calibration objective during the training process, making efficient use of large training sets.

Comments:	Accepted at ACL 2020
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2004.14500 [cs.CL]
	(or arXiv:2004.14500v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2004.14500

Submission history

From: Jung Taehee [view email]
[v1] Wed, 29 Apr 2020 22:13:15 UTC (1,576 KB)
[v2] Fri, 1 May 2020 16:26:16 UTC (1,576 KB)

Computer Science > Computation and Language

Title:Posterior Calibrated Training on Sentence Classification Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Posterior Calibrated Training on Sentence Classification Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators