Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers

Jin, Gaojie; Huang, Tianjin; Mu, Ronghui; Huang, Xiaowei

Computer Science > Machine Learning

arXiv:2503.17172 (cs)

[Submitted on 21 Mar 2025]

Title:Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers

Authors:Gaojie Jin, Tianjin Huang, Ronghui Mu, Xiaowei Huang

View PDF HTML (experimental)

Abstract:Recent studies have identified a critical challenge in deep neural networks (DNNs) known as ``robust fairness", where models exhibit significant disparities in robust accuracy across different classes. While prior work has attempted to address this issue in adversarial robustness, the study of worst-class certified robustness for smoothed classifiers remains unexplored. Our work bridges this gap by developing a PAC-Bayesian bound for the worst-class error of smoothed classifiers. Through theoretical analysis, we demonstrate that the largest eigenvalue of the smoothed confusion matrix fundamentally influences the worst-class error of smoothed classifiers. Based on this insight, we introduce a regularization method that optimizes the largest eigenvalue of smoothed confusion matrix to enhance worst-class accuracy of the smoothed classifier and further improve its worst-class certified robustness. We provide extensive experimental validation across multiple datasets and model architectures to demonstrate the effectiveness of our approach.

Comments:	Under Review
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2503.17172 [cs.LG]
	(or arXiv:2503.17172v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.17172

Submission history

From: Gaojie Jin [view email]
[v1] Fri, 21 Mar 2025 14:18:18 UTC (156 KB)

Computer Science > Machine Learning

Title:Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators