TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective

Dan, Jun; Liu, Yang; Xie, Haoyu; Deng, Jiankang; Xie, Haoran; Xie, Xuansong; Sun, Baigui

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.10133 (cs)

[Submitted on 20 Aug 2023]

Title:TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective

Authors:Jun Dan, Yang Liu, Haoyu Xie, Jiankang Deng, Haoran Xie, Xuansong Xie, Baigui Sun

View PDF

Abstract:Vision Transformers (ViTs) have demonstrated powerful representation ability in various visual tasks thanks to their intrinsic data-hungry nature. However, we unexpectedly find that ViTs perform vulnerably when applied to face recognition (FR) scenarios with extremely large datasets. We investigate the reasons for this phenomenon and discover that the existing data augmentation approach and hard sample mining strategy are incompatible with ViTs-based FR backbone due to the lack of tailored consideration on preserving face structural information and leveraging each local token information. To remedy these problems, this paper proposes a superior FR model called TransFace, which employs a patch-level data augmentation strategy named DPAP and a hard sample mining strategy named EHSM. Specially, DPAP randomly perturbs the amplitude information of dominant patches to expand sample diversity, which effectively alleviates the overfitting problem in ViTs. EHSM utilizes the information entropy in the local tokens to dynamically adjust the importance weight of easy and hard samples during training, leading to a more stable prediction. Experiments on several benchmarks demonstrate the superiority of our TransFace. Code and models are available at this https URL.

Comments:	Accepted by ICCV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.10133 [cs.CV]
	(or arXiv:2308.10133v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.10133

Submission history

From: Jun Dan [view email]
[v1] Sun, 20 Aug 2023 02:02:16 UTC (38,540 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators