DCNNs on a Diet: Sampling Strategies for Reducing the Training Set Size

Kabkab, Maya; Alavi, Azadeh; Chellappa, Rama

Computer Science > Computer Vision and Pattern Recognition

arXiv:1606.04232 (cs)

[Submitted on 14 Jun 2016]

Title:DCNNs on a Diet: Sampling Strategies for Reducing the Training Set Size

Authors:Maya Kabkab, Azadeh Alavi, Rama Chellappa

View PDF

Abstract:Large-scale supervised classification algorithms, especially those based on deep convolutional neural networks (DCNNs), require vast amounts of training data to achieve state-of-the-art performance. Decreasing this data requirement would significantly speed up the training process and possibly improve generalization. Motivated by this objective, we consider the task of adaptively finding concise training subsets which will be iteratively presented to the learner. We use convex optimization methods, based on an objective criterion and feedback from the current performance of the classifier, to efficiently identify informative samples to train on. We propose an algorithm to decompose the optimization problem into smaller per-class problems, which can be solved in parallel. We test our approach on standard classification tasks and demonstrate its effectiveness in decreasing the training set size without compromising performance. We also show that our approach can make the classifier more robust in the presence of label noise and class imbalance.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1606.04232 [cs.CV]
	(or arXiv:1606.04232v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1606.04232

Submission history

From: Maya Kabkab [view email]
[v1] Tue, 14 Jun 2016 07:38:13 UTC (922 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2016-06

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Maya Kabkab
Azadeh Alavi
Rama Chellappa

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:DCNNs on a Diet: Sampling Strategies for Reducing the Training Set Size

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DCNNs on a Diet: Sampling Strategies for Reducing the Training Set Size

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators