VideoGigaGAN: Towards Detail-rich Video Super-Resolution

Xu, Yiran; Park, Taesung; Zhang, Richard; Zhou, Yang; Shechtman, Eli; Liu, Feng; Huang, Jia-Bin; Liu, Difan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.12388 (cs)

[Submitted on 18 Apr 2024 (v1), last revised 1 May 2024 (this version, v2)]

Title:VideoGigaGAN: Towards Detail-rich Video Super-Resolution

Authors:Yiran Xu, Taesung Park, Richard Zhang, Yang Zhou, Eli Shechtman, Feng Liu, Jia-Bin Huang, Difan Liu

View PDF HTML (experimental)

Abstract:Video super-resolution (VSR) approaches have shown impressive temporal consistency in upsampled videos. However, these approaches tend to generate blurrier results than their image counterparts as they are limited in their generative capability. This raises a fundamental question: can we extend the success of a generative image upsampler to the VSR task while preserving the temporal consistency? We introduce VideoGigaGAN, a new generative VSR model that can produce videos with high-frequency details and temporal consistency. VideoGigaGAN builds upon a large-scale image upsampler -- GigaGAN. Simply inflating GigaGAN to a video model by adding temporal modules produces severe temporal flickering. We identify several key issues and propose techniques that significantly improve the temporal consistency of upsampled videos. Our experiments show that, unlike previous VSR methods, VideoGigaGAN generates temporally consistent videos with more fine-grained appearance details. We validate the effectiveness of VideoGigaGAN by comparing it with state-of-the-art VSR models on public datasets and showcasing video results with $8\times$ super-resolution.

Comments:	project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.12388 [cs.CV]
	(or arXiv:2404.12388v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.12388

Submission history

From: Yiran Xu [view email]
[v1] Thu, 18 Apr 2024 17:59:53 UTC (21,660 KB)
[v2] Wed, 1 May 2024 21:41:30 UTC (21,661 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VideoGigaGAN: Towards Detail-rich Video Super-Resolution

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VideoGigaGAN: Towards Detail-rich Video Super-Resolution

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators