Many-shot from Low-shot: Learning to Annotate using Mixed Supervision for Object Detection

Biffi, Carlo; McDonagh, Steven; Torr, Philip; Leonardis, Ales; Parisot, Sarah

Computer Science > Computer Vision and Pattern Recognition

arXiv:2008.09694 (cs)

[Submitted on 21 Aug 2020 (v1), last revised 26 Aug 2020 (this version, v2)]

Title:Many-shot from Low-shot: Learning to Annotate using Mixed Supervision for Object Detection

Authors:Carlo Biffi, Steven McDonagh, Philip Torr, Ales Leonardis, Sarah Parisot

View PDF

Abstract:Object detection has witnessed significant progress by relying on large, manually annotated datasets. Annotating such datasets is highly time consuming and expensive, which motivates the development of weakly supervised and few-shot object detection methods. However, these methods largely underperform with respect to their strongly supervised counterpart, as weak training signals \emph{often} result in partial or oversized detections. Towards solving this problem we introduce, for the first time, an online annotation module (OAM) that learns to generate a many-shot set of \emph{reliable} annotations from a larger volume of weakly labelled images. Our OAM can be jointly trained with any fully supervised two-stage object detection method, providing additional training annotations on the fly. This results in a fully end-to-end strategy that only requires a low-shot set of fully annotated images. The integration of the OAM with Fast(er) R-CNN improves their performance by $17\%$ mAP, $9\%$ AP50 on PASCAL VOC 2007 and MS-COCO benchmarks, and significantly outperforms competing methods using mixed supervision.

Comments:	Accepted at ECCV 2020. Camera-ready version and Appendices
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2008.09694 [cs.CV]
	(or arXiv:2008.09694v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2008.09694

Submission history

From: Carlo Biffi [view email]
[v1] Fri, 21 Aug 2020 22:06:43 UTC (68,372 KB)
[v2] Wed, 26 Aug 2020 17:26:13 UTC (68,372 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Many-shot from Low-shot: Learning to Annotate using Mixed Supervision for Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Many-shot from Low-shot: Learning to Annotate using Mixed Supervision for Object Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators