Knowledge-Guided Dynamic Modality Attention Fusion Framework for Multimodal Sentiment Analysis

Xinyu Feng; Yuming Lin; Lihua He; You Li; Liang Chang; Ya Zhou

doi:10.18653/v1/2024.findings-emnlp.865

Knowledge-Guided Dynamic Modality Attention Fusion Framework for Multimodal Sentiment Analysis

Xinyu Feng, Yuming Lin, Lihua He, You Li, Liang Chang, Ya Zhou

Abstract

Multimodal Sentiment Analysis (MSA) utilizes multimodal data to infer the users’ sentiment. Previous methods focus on equally treating the contribution of each modality or statically using text as the dominant modality to conduct interaction, which neglects the situation where each modality may become dominant. In this paper, we propose a Knowledge-Guided Dynamic Modality Attention Fusion Framework (KuDA) for multimodal sentiment analysis. KuDA uses sentiment knowledge to guide the model dynamically selecting the dominant modality and adjusting the contributions of each modality. In addition, with the obtained multimodal representation, the model can further highlight the contribution of dominant modality through the correlation evaluation loss. Extensive experiments on four MSA benchmark datasets indicate that KuDA achieves state-of-the-art performance and is able to adapt to different scenarios of dominant modality.

Anthology ID:: 2024.findings-emnlp.865
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2024
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 14755–14766
Language:
URL:: https://aclanthology.org/2024.findings-emnlp.865
DOI:: 10.18653/v1/2024.findings-emnlp.865
Bibkey:
Cite (ACL):: Xinyu Feng, Yuming Lin, Lihua He, You Li, Liang Chang, and Ya Zhou. 2024. Knowledge-Guided Dynamic Modality Attention Fusion Framework for Multimodal Sentiment Analysis. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 14755–14766, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Knowledge-Guided Dynamic Modality Attention Fusion Framework for Multimodal Sentiment Analysis (Feng et al., Findings 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.findings-emnlp.865.pdf

PDF Cite Search