StarVQA+: Co-training Space-Time Attention for Video Quality Assessment

Xing, Fengchuang; Wang, Yuan-Gen; Tang, Weixuan; Zhu, Guopu; Kwong, Sam

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.12298 (cs)

[Submitted on 21 Jun 2023]

Title:StarVQA+: Co-training Space-Time Attention for Video Quality Assessment

Authors:Fengchuang Xing, Yuan-Gen Wang, Weixuan Tang, Guopu Zhu, Sam Kwong

View PDF

Abstract:Self-attention based Transformer has achieved great success in many computer vision tasks. However, its application to video quality assessment (VQA) has not been satisfactory so far. Evaluating the quality of in-the-wild videos is challenging due to the unknown of pristine reference and shooting distortion. This paper presents a co-trained Space-Time Attention network for the VQA problem, termed StarVQA+. Specifically, we first build StarVQA+ by alternately concatenating the divided space-time attention. Then, to facilitate the training of StarVQA+, we design a vectorized regression loss by encoding the mean opinion score (MOS) to the probability vector and embedding a special token as the learnable variable of MOS, leading to better fitting of human's rating process. Finally, to solve the data hungry problem with Transformer, we propose to co-train the spatial and temporal attention weights using both images and videos. Various experiments are conducted on the de-facto in-the-wild video datasets, including LIVE-Qualcomm, LIVE-VQC, KoNViD-1k, YouTube-UGC, LSVQ, LSVQ-1080p, and DVL2021. Experimental results demonstrate the superiority of the proposed StarVQA+ over the state-of-the-art.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2306.12298 [cs.CV]
	(or arXiv:2306.12298v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.12298

Submission history

From: Yuan-Gen Wang [view email]
[v1] Wed, 21 Jun 2023 14:27:31 UTC (26,791 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:StarVQA+: Co-training Space-Time Attention for Video Quality Assessment

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:StarVQA+: Co-training Space-Time Attention for Video Quality Assessment

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators