Deep Video Codec Control for Vision Models

Reich, Christoph; Debnath, Biplob; Patel, Deep; Prangemeier, Tim; Cremers, Daniel; Chakradhar, Srimat

doi:10.1109/CVPRW63382.2024.00582

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2308.16215 (eess)

[Submitted on 30 Aug 2023 (v1), last revised 16 Apr 2024 (this version, v6)]

Title:Deep Video Codec Control for Vision Models

Authors:Christoph Reich, Biplob Debnath, Deep Patel, Tim Prangemeier, Daniel Cremers, Srimat Chakradhar

View PDF

Abstract:Standardized lossy video coding is at the core of almost all real-world video processing pipelines. Rate control is used to enable standard codecs to adapt to different network bandwidth conditions or storage constraints. However, standard video codecs (e.g., H.264) and their rate control modules aim to minimize video distortion w.r.t. human quality assessment. We demonstrate empirically that standard-coded videos vastly deteriorate the performance of deep vision models. To overcome the deterioration of vision performance, this paper presents the first end-to-end learnable deep video codec control that considers both bandwidth constraints and downstream deep vision performance, while adhering to existing standardization. We demonstrate that our approach better preserves downstream deep vision performance than traditional standard video coding.

Comments:	Accepted at CVPR 2024 Workshop on AI for Streaming (AIS)
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
Cite as:	arXiv:2308.16215 [eess.IV]
	(or arXiv:2308.16215v6 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2308.16215
Related DOI:	https://doi.org/10.1109/CVPRW63382.2024.00582

Submission history

From: Christoph Reich [view email]
[v1] Wed, 30 Aug 2023 16:44:38 UTC (24,041 KB)
[v2] Mon, 4 Sep 2023 09:09:31 UTC (24,041 KB)
[v3] Thu, 7 Sep 2023 14:41:22 UTC (24,041 KB)
[v4] Sat, 16 Sep 2023 12:05:09 UTC (24,041 KB)
[v5] Sat, 17 Feb 2024 12:37:23 UTC (24,050 KB)
[v6] Tue, 16 Apr 2024 13:32:25 UTC (24,024 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Deep Video Codec Control for Vision Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Deep Video Codec Control for Vision Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators