Teaching AI to Teach: Leveraging Limited Human Salience Data Into Unlimited Saliency-Based Training

Crum, Colton R.; Boyd, Aidan; Bowyer, Kevin; Czajka, Adam

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.05527 (cs)

[Submitted on 8 Jun 2023 (v1), last revised 9 Nov 2023 (this version, v2)]

Title:Teaching AI to Teach: Leveraging Limited Human Salience Data Into Unlimited Saliency-Based Training

Authors:Colton R. Crum, Aidan Boyd, Kevin Bowyer, Adam Czajka

View PDF

Abstract:Machine learning models have shown increased accuracy in classification tasks when the training process incorporates human perceptual information. However, a challenge in training human-guided models is the cost associated with collecting image annotations for human salience. Collecting annotation data for all images in a large training set can be prohibitively expensive. In this work, we utilize "teacher" models (trained on a small amount of human-annotated data) to annotate additional data by means of teacher models' saliency maps. Then, "student" models are trained using the larger amount of annotated training data. This approach makes it possible to supplement a limited number of human-supplied annotations with an arbitrarily large number of model-generated image annotations. We compare the accuracy achieved by our teacher-student training paradigm with (1) training using all available human salience annotations, and (2) using all available training data without human salience annotations. We use synthetic face detection and fake iris detection as example challenging problems, and report results across four model architectures (DenseNet, ResNet, Xception, and Inception), and two saliency estimation methods (CAM and RISE). Results show that our teacher-student training paradigm results in models that significantly exceed the performance of both baselines, demonstrating that our approach can usefully leverage a small amount of human annotations to generate salience maps for an arbitrary amount of additional training data.

Comments:	17 pages, 8 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.05527 [cs.CV]
	(or arXiv:2306.05527v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.05527

Submission history

From: Colton Crum [view email]
[v1] Thu, 8 Jun 2023 19:55:44 UTC (11,388 KB)
[v2] Thu, 9 Nov 2023 18:15:05 UTC (11,406 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Teaching AI to Teach: Leveraging Limited Human Salience Data Into Unlimited Saliency-Based Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Teaching AI to Teach: Leveraging Limited Human Salience Data Into Unlimited Saliency-Based Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators