Uformer-ICS: A U-Shaped Transformer for Image Compressive Sensing Service

Zhang, Kuiyuan; Hua, Zhongyun; Li, Yuanman; Zhang, Yushu; Zhou, Yicong

doi:10.1109/TSC.2023.3334446

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2209.01763 (eess)

[Submitted on 5 Sep 2022 (v1), last revised 2 Jul 2024 (this version, v2)]

Title:Uformer-ICS: A U-Shaped Transformer for Image Compressive Sensing Service

Authors:Kuiyuan Zhang, Zhongyun Hua, Yuanman Li, Yushu Zhang, Yicong Zhou

View PDF HTML (experimental)

Abstract:Many service computing applications require real-time dataset collection from multiple devices, necessitating efficient sampling techniques to reduce bandwidth and storage pressure. Compressive sensing (CS) has found wide-ranging applications in image acquisition and reconstruction. Recently, numerous deep-learning methods have been introduced for CS tasks. However, the accurate reconstruction of images from measurements remains a significant challenge, especially at low sampling rates. In this paper, we propose Uformer-ICS as a novel U-shaped transformer for image CS tasks by introducing inner characteristics of CS into transformer architecture. To utilize the uneven sparsity distribution of image blocks, we design an adaptive sampling architecture that allocates measurement resources based on the estimated block sparsity, allowing the compressed results to retain maximum information from the original image. Additionally, we introduce a multi-channel projection (MCP) module inspired by traditional CS optimization methods. By integrating the MCP module into the transformer blocks, we construct projection-based transformer blocks, and then form a symmetrical reconstruction model using these blocks and residual convolutional blocks. Therefore, our reconstruction model can simultaneously utilize the local features and long-range dependencies of image, and the prior projection knowledge of CS theory.
Experimental results demonstrate its significantly better reconstruction performance than state-of-the-art deep learning-based CS methods.

Subjects:	Image and Video Processing (eess.IV); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2209.01763 [eess.IV]
	(or arXiv:2209.01763v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2209.01763
Related DOI:	https://doi.org/10.1109/TSC.2023.3334446

Submission history

From: Zhongyun Hua [view email]
[v1] Mon, 5 Sep 2022 04:52:12 UTC (6,238 KB)
[v2] Tue, 2 Jul 2024 02:26:09 UTC (22,391 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Uformer-ICS: A U-Shaped Transformer for Image Compressive Sensing Service

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Uformer-ICS: A U-Shaped Transformer for Image Compressive Sensing Service

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators