Aspect-based Sentiment Classification with Sequential Cross-modal Semantic Graph

Huang, Yufeng; Chen, Zhuo; Zhang, Wen; Chen, Jiaoyan; Pan, Jeff Z.; Yao, Zhen; Xie, Yujie; Chen, Huajun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2208.09417v1 (cs)

[Submitted on 19 Aug 2022 (this version), latest version 24 Jul 2023 (v2)]

Title:Aspect-based Sentiment Classification with Sequential Cross-modal Semantic Graph

Authors:Yufeng Huang, Zhuo Chen, Wen Zhang, Jiaoyan Chen, Jeff Z. Pan, Zhen Yao, Yujie Xie, Huajun Chen

View PDF

Abstract:Multi-modal aspect-based sentiment classification (MABSC) is an emerging classification task that aims to classify the sentiment of a given target such as a mentioned entity in data with different modalities. In typical multi-modal data with text and image, previous approaches do not make full use of the fine-grained semantics of the image, especially in conjunction with the semantics of the text and do not fully consider modeling the relationship between fine-grained image information and target, which leads to insufficient use of image and inadequate to identify fine-grained aspects and opinions. To tackle these limitations, we propose a new framework SeqCSG including a method to construct sequential cross-modal semantic graphs and an encoder-decoder model. Specifically, we extract fine-grained information from the original image, image caption, and scene graph, and regard them as elements of the cross-modal semantic graph as well as tokens from texts. The cross-modal semantic graph is represented as a sequence with a multi-modal visible matrix indicating relationships between elements. In order to effectively utilize the cross-modal semantic graph, we propose an encoder-decoder method with a target prompt template. Experimental results show that our approach outperforms existing methods and achieves the state-of-the-art on two standard datasets MABSC. Further analysis demonstrates the effectiveness of each component and our model can implicitly learn the correlation between the target and fine-grained information of the image.

Comments:	Work in progress
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2208.09417 [cs.CV]
	(or arXiv:2208.09417v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2208.09417

Submission history

From: Zhuo Chen [view email]
[v1] Fri, 19 Aug 2022 16:04:29 UTC (6,835 KB)
[v2] Mon, 24 Jul 2023 03:06:15 UTC (8,169 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Aspect-based Sentiment Classification with Sequential Cross-modal Semantic Graph

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Aspect-based Sentiment Classification with Sequential Cross-modal Semantic Graph

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators