Multi-rate adaptive transform coding for video compression

Duong, Lyndon R.; Li, Bohan; Chen, Cheng; Han, Jingning

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2210.14308 (eess)

[Submitted on 25 Oct 2022 (v1), last revised 18 Feb 2023 (this version, v2)]

Title:Multi-rate adaptive transform coding for video compression

Authors:Lyndon R. Duong, Bohan Li, Cheng Chen, Jingning Han

View PDF

Abstract:Contemporary lossy image and video coding standards rely on transform coding, the process through which pixels are mapped to an alternative representation to facilitate efficient data compression. Despite impressive performance of end-to-end optimized compression with deep neural networks, the high computational and space demands of these models has prevented them from superseding the relatively simple transform coding found in conventional video codecs. In this study, we propose learned transforms and entropy coding that may either serve as (non)linear drop-in replacements, or enhancements for linear transforms in existing codecs. These transforms can be multi-rate, allowing a single model to operate along the entire rate-distortion curve. To demonstrate the utility of our framework, we augmented the DCT with learned quantization matrices and adaptive entropy coding to compress intra-frame AV1 block prediction residuals. We report substantial BD-rate and perceptual quality improvements over more complex nonlinear transforms at a fraction of the computational cost.

Comments:	5 pages, 4 figures, IEEE ICASSP 2023
Subjects:	Image and Video Processing (eess.IV)
Cite as:	arXiv:2210.14308 [eess.IV]
	(or arXiv:2210.14308v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2210.14308
Journal reference:	2023 IEEE International Conference on Acoustics, Speech and Signal Processing

Submission history

From: Lyndon Duong [view email]
[v1] Tue, 25 Oct 2022 20:11:42 UTC (337 KB)
[v2] Sat, 18 Feb 2023 00:10:04 UTC (337 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Multi-rate adaptive transform coding for video compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Multi-rate adaptive transform coding for video compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators