Multiview Contrastive Learning for Completely Blind Video Quality Assessment of User Generated Content

Mitra, Shankhanil; Soundararajan, Rajiv

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2207.06148 (eess)

[Submitted on 13 Jul 2022 (v1), last revised 23 Jun 2024 (this version, v2)]

Title:Multiview Contrastive Learning for Completely Blind Video Quality Assessment of User Generated Content

Authors:Shankhanil Mitra, Rajiv Soundararajan

View PDF HTML (experimental)

Abstract:Completely blind video quality assessment (VQA) refers to a class of quality assessment methods that do not use any reference videos, human opinion scores or training videos from the target database to learn a quality model. The design of this class of methods is particularly important since it can allow for superior generalization in performance across various datasets. We consider the design of completely blind VQA for user generated content. While several deep feature extraction methods have been considered in supervised and weakly supervised settings, such approaches have not been studied in the context of completely blind VQA. We bridge this gap by presenting a self-supervised multiview contrastive learning framework to learn spatio-temporal quality representations. In particular, we capture the common information between frame differences and frames by treating them as a pair of views and similarly obtain the shared representations between frame differences and optical flow. The resulting features are then compared with a corpus of pristine natural video patches to predict the quality of the distorted video. Detailed experiments on multiple camera captured VQA datasets reveal the superior performance of our method over other features when evaluated without training on human scores.

Subjects:	Image and Video Processing (eess.IV)
Cite as:	arXiv:2207.06148 [eess.IV]
	(or arXiv:2207.06148v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2207.06148

Submission history

From: Shankhanil Mitra [view email]
[v1] Wed, 13 Jul 2022 12:16:33 UTC (4,408 KB)
[v2] Sun, 23 Jun 2024 13:24:58 UTC (4,408 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Multiview Contrastive Learning for Completely Blind Video Quality Assessment of User Generated Content

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Multiview Contrastive Learning for Completely Blind Video Quality Assessment of User Generated Content

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators