Scalable Image Coding for Humans and Machines

Choi, Hyomin; Bajic, Ivan V.

doi:10.1109/TIP.2022.3160602

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2107.08373 (eess)

[Submitted on 18 Jul 2021 (v1), last revised 13 Jan 2022 (this version, v2)]

Title:Scalable Image Coding for Humans and Machines

Authors:Hyomin Choi, Ivan V. Bajic

View PDF

Abstract:At present, and increasingly so in the future, much of the captured visual content will not be seen by humans. Instead, it will be used for automated machine vision analytics and may require occasional human viewing. Examples of such applications include traffic monitoring, visual surveillance, autonomous navigation, and industrial machine vision. To address such requirements, we develop an end-to-end learned image codec whose latent space is designed to support scalability from simpler to more complicated tasks. The simplest task is assigned to a subset of the latent space (the base layer), while more complicated tasks make use of additional subsets of the latent space, i.e., both the base and enhancement layer(s). For the experiments, we establish a 2-layer and a 3-layer model, each of which offers input reconstruction for human vision, plus machine vision task(s), and compare them with relevant benchmarks. The experiments show that our scalable codecs offer 37%-80% bitrate savings on machine vision tasks compared to best alternatives, while being comparable to state-of-the-art image codecs in terms of input reconstruction.

Comments:	Submitted for peer review to IEEE Transactions
Subjects:	Image and Video Processing (eess.IV)
Cite as:	arXiv:2107.08373 [eess.IV]
	(or arXiv:2107.08373v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2107.08373
Related DOI:	https://doi.org/10.1109/TIP.2022.3160602

Submission history

From: Hyomin Choi [view email]
[v1] Sun, 18 Jul 2021 06:05:56 UTC (8,717 KB)
[v2] Thu, 13 Jan 2022 07:08:18 UTC (10,362 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Scalable Image Coding for Humans and Machines

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Scalable Image Coding for Humans and Machines

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators