Measuring Model Biases in the Absence of Ground Truth

Aka, Osman; Burke, Ken; Bäuerle, Alex; Greer, Christina; Mitchell, Margaret

doi:10.1145/3461702.3462557

Computer Science > Computer Vision and Pattern Recognition

arXiv:2103.03417 (cs)

[Submitted on 5 Mar 2021 (v1), last revised 6 Jun 2021 (this version, v3)]

Title:Measuring Model Biases in the Absence of Ground Truth

Authors:Osman Aka, Ken Burke, Alex Bäuerle, Christina Greer, Margaret Mitchell

View PDF

Abstract:The measurement of bias in machine learning often focuses on model performance across identity subgroups (such as man and woman) with respect to groundtruth labels. However, these methods do not directly measure the associations that a model may have learned, for example between labels and identity subgroups. Further, measuring a model's bias requires a fully annotated evaluation dataset which may not be easily available in practice. We present an elegant mathematical solution that tackles both issues simultaneously, using image classification as a working example. By treating a classification model's predictions for a given image as a set of labels analogous to a bag of words, we rank the biases that a model has learned with respect to different identity labels. We use (man, woman) as a concrete example of an identity label set (although this set need not be binary), and present rankings for the labels that are most biased towards one identity or the other. We demonstrate how the statistical properties of different association metrics can lead to different rankings of the most "gender biased" labels, and conclude that normalized pointwise mutual information (nPMI) is most useful in practice. Finally, we announce an open-sourced nPMI visualization tool using TensorBoard.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2103.03417 [cs.CV]
	(or arXiv:2103.03417v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2103.03417
Related DOI:	https://doi.org/10.1145/3461702.3462557

Submission history

From: Ken Burke [view email]
[v1] Fri, 5 Mar 2021 01:23:22 UTC (1,462 KB)
[v2] Tue, 11 May 2021 15:55:18 UTC (8,057 KB)
[v3] Sun, 6 Jun 2021 17:09:58 UTC (8,080 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Measuring Model Biases in the Absence of Ground Truth

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Measuring Model Biases in the Absence of Ground Truth

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators