VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products

Han, Xintong; Singh, Bharat; Morariu, Vlad I.; Davis, Larry S.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1512.03384 (cs)

[Submitted on 10 Dec 2015 (v1), last revised 10 Apr 2017 (this version, v3)]

Title:VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products

Authors:Xintong Han, Bharat Singh, Vlad I. Morariu, Larry S. Davis

View PDF

Abstract:VRFP is a real-time video retrieval framework based on short text input queries, which obtains weakly labeled training images from the web after the query is known. The retrieved web images representing the query and each database video are treated as unordered collections of images, and each collection is represented using a single Fisher Vector built on CNN features. Our experiments show that a Fisher Vector is robust to noise present in web images and compares favorably in terms of accuracy to other standard representations. While a Fisher Vector can be constructed efficiently for a new query, matching against the test set is slow due to its high dimensionality. To perform matching in real-time, we present a lossless algorithm that accelerates the inner product computation between high dimensional Fisher Vectors. We prove that the expected number of multiplications required decreases quadratically with the sparsity of Fisher Vectors. We are not only able to construct and apply query models in real-time, but with the help of a simple re-ranking scheme, we also outperform state-of-the-art automatic retrieval methods by a significant margin on TRECVID MED13 (3.5%), MED14 (1.3%) and CCV datasets (5.2%). We also provide a direct comparison on standard datasets between two different paradigms for automatic video retrieval - zero-shot learning and on-the-fly retrieval.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1512.03384 [cs.CV]
	(or arXiv:1512.03384v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1512.03384

Submission history

From: Xintong Han [view email]
[v1] Thu, 10 Dec 2015 19:50:50 UTC (1,833 KB)
[v2] Thu, 7 Apr 2016 01:25:42 UTC (935 KB)
[v3] Mon, 10 Apr 2017 17:28:16 UTC (3,361 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators