Back to Search Start Over

Hierarchical denoising representation disentanglement and dual-channel cross-modal-context interaction for multimodal sentiment analysis.

Authors :
Li, Zuhe
Huang, Zhenwei
Pan, Yushan
Yu, Jun
Liu, Weihua
Chen, Haoran
Luo, Yiming
Wu, Di
Wang, Hao
Source :
Expert Systems with Applications. Oct2024:Part B, Vol. 252, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Multimodal sentiment analysis aims to extract sentiment cues from various modalities, such as textual, acoustic, and visual data, and manipulate them to determine the inherent sentiment polarity in the data. Despite significant achievements in multimodal sentiment analysis, challenges persist in addressing noise features in modal representations, eliminating substantial gaps in sentiment information among modal representations, and exploring contextual information that expresses different sentiments between modalities. To tackle these challenges, our paper proposes a new Multimodal Sentiment Analysis (MSA) framework. Firstly, we introduce the Hierarchical Denoising Representation Disentanglement module (HDRD), which employs hierarchical disentanglement techniques. This ensures the extraction of both common and private sentiment information while eliminating interference noise from modal representations. Furthermore, to address the uneven distribution of sentiment information among modalities, our Inter-Modal Representation Enhancement module (IMRE) enhances non-textual representations by extracting sentiment information related to non-textual representations from textual representations. Next, we introduce a new interaction mechanism, the Dual-Channel Cross-Modal Context Interaction module (DCCMCI). This module not only mines correlated contextual sentiment information within modalities but also explores positive and negative correlation contextual sentiment information between modalities. We conducted extensive experiments on two benchmark datasets, MOSI and MOSEI, and the results indicate that our proposed method offers state-of-the-art approaches. • Multimodal sentiment analysis framework using hierarchical denoising representation disentanglement. • Hierarchical Denoising Representation Disentanglement (HDRD) Module to generate noise-free modal representations. • Dual-channel Cross-modal-context Interaction (DCCMCI) Module for intra-modal and inter-modal contextual interaction. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09574174
Volume :
252
Database :
Academic Search Index
Journal :
Expert Systems with Applications
Publication Type :
Academic Journal
Accession number :
177753535
Full Text :
https://doi.org/10.1016/j.eswa.2024.124236