Progressive Scale-aware Network for Remote sensing Image Change Captioning

Liu, Chenyang; Yang, Jiajun; Qi, Zipeng; Zou, Zhengxia; Shi, Zhenwei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.00355 (cs)

[Submitted on 1 Mar 2023 (v1), last revised 18 Dec 2023 (this version, v2)]

Title:Progressive Scale-aware Network for Remote sensing Image Change Captioning

Authors:Chenyang Liu, Jiajun Yang, Zipeng Qi, Zhengxia Zou, Zhenwei Shi

View PDF HTML (experimental)

Abstract:Remote sensing (RS) images contain numerous objects of different scales, which poses significant challenges for the RS image change captioning (RSICC) task to identify visual changes of interest in complex scenes and describe them via language. However, current methods still have some weaknesses in sufficiently extracting and utilizing multi-scale information. In this paper, we propose a progressive scale-aware network (PSNet) to address the problem. PSNet is a pure Transformer-based model. To sufficiently extract multi-scale visual features, multiple progressive difference perception (PDP) layers are stacked to progressively exploit the differencing features of bitemporal features. To sufficiently utilize the extracted multi-scale features for captioning, we propose a scale-aware reinforcement (SR) module and combine it with the Transformer decoding layer to progressively utilize the features from different PDP layers. Experiments show that the PDP layer and SR module are effective and our PSNet outperforms previous methods. Our code is public at this https URL

Comments:	IGARSS 2023 - 2023 IEEE International Geoscience and Remote Sensing Symposium
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.00355 [cs.CV]
	(or arXiv:2303.00355v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.00355

Submission history

From: Liu Chenyang [view email]
[v1] Wed, 1 Mar 2023 09:33:49 UTC (371 KB)
[v2] Mon, 18 Dec 2023 07:17:26 UTC (3,078 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Progressive Scale-aware Network for Remote sensing Image Change Captioning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Progressive Scale-aware Network for Remote sensing Image Change Captioning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators