Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases

Moayeri, Mazda; Wang, Wenxiao; Singla, Sahil; Feizi, Soheil

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.02648 (cs)

[Submitted on 5 Dec 2022 (v1), last revised 30 Oct 2023 (this version, v3)]

Title:Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases

Authors:Mazda Moayeri, Wenxiao Wang, Sahil Singla, Soheil Feizi

View PDF

Abstract:We present a simple but effective method to measure and mitigate model biases caused by reliance on spurious cues. Instead of requiring costly changes to one's data or model training, our method better utilizes the data one already has by sorting them. Specifically, we rank images within their classes based on spuriosity (the degree to which common spurious cues are present), proxied via deep neural features of an interpretable network. With spuriosity rankings, it is easy to identify minority subpopulations (i.e. low spuriosity images) and assess model bias as the gap in accuracy between high and low spuriosity images. One can even efficiently remove a model's bias at little cost to accuracy by finetuning its classification head on low spuriosity images, resulting in fairer treatment of samples regardless of spuriosity. We demonstrate our method on ImageNet, annotating $5000$ class-feature dependencies ($630$ of which we find to be spurious) and generating a dataset of $325k$ soft segmentations for these features along the way. Having computed spuriosity rankings via the identified spurious neural features, we assess biases for $89$ diverse models and find that class-wise biases are highly correlated across models. Our results suggest that model bias due to spurious feature reliance is influenced far more by what the model is trained on than how it is trained.

Comments:	Accepted to NeurIPS '23 (Spotlight). Camera ready version
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
Cite as:	arXiv:2212.02648 [cs.CV]
	(or arXiv:2212.02648v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.02648

Submission history

From: Mazda Moayeri [view email]
[v1] Mon, 5 Dec 2022 23:15:43 UTC (15,929 KB)
[v2] Thu, 5 Oct 2023 17:59:06 UTC (13,172 KB)
[v3] Mon, 30 Oct 2023 18:22:35 UTC (13,263 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators