ChangeMinds: Multi-task Framework for Detecting and Describing Changes in Remote Sensing

Wang, Yuduo; Yu, Weikang; Kopp, Michael; Ghamisi, Pedram

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.10047 (cs)

[Submitted on 13 Oct 2024 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:ChangeMinds: Multi-task Framework for Detecting and Describing Changes in Remote Sensing

Authors:Yuduo Wang, Weikang Yu, Michael Kopp, Pedram Ghamisi

View PDF HTML (experimental)

Abstract:Recent advancements in Remote Sensing (RS) for Change Detection (CD) and Change Captioning (CC) have seen substantial success by adopting deep learning techniques. Despite these advances, existing methods often handle CD and CC tasks independently, leading to inefficiencies from the absence of synergistic processing. In this paper, we present ChangeMinds, a novel unified multi-task framework that concurrently optimizes CD and CC processes within a single, end-to-end model. We propose the change-aware long short-term memory module (ChangeLSTM) to effectively capture complex spatiotemporal dynamics from extracted bi-temporal deep features, enabling the generation of universal change-aware representations that effectively serve both CC and CD tasks. Furthermore, we introduce a multi-task predictor with a cross-attention mechanism that enhances the interaction between image and text features, promoting efficient simultaneous learning and processing for both tasks. Extensive evaluations on the LEVIR-MCI dataset, alongside other standard benchmarks, show that ChangeMinds surpasses existing methods in multi-task learning settings and markedly improves performance in individual CD and CC tasks. Codes and pre-trained models will be available online.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.10047 [cs.CV]
	(or arXiv:2410.10047v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.10047

Submission history

From: Yuduo Wang [view email]
[v1] Sun, 13 Oct 2024 23:43:10 UTC (2,583 KB)
[v2] Tue, 15 Oct 2024 11:00:13 UTC (2,584 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ChangeMinds: Multi-task Framework for Detecting and Describing Changes in Remote Sensing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ChangeMinds: Multi-task Framework for Detecting and Describing Changes in Remote Sensing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators