Decoupling and Reconstructing: A Multimodal Sentiment Analysis Framework Towards Robustness

Mingzheng Yang; Kai Zhang; Yuyang Ye; Yanghai Zhang; Runlong Yu; Min Hou

doi:10.24963/ijcai.2025/757

Decoupling and Reconstructing: A Multimodal Sentiment Analysis Framework Towards Robustness

Mingzheng Yang, Kai Zhang, Yuyang Ye, Yanghai Zhang, Runlong Yu, Min Hou

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence

Main Track. Pages 6803-6811. https://doi.org/10.24963/ijcai.2025/757

PDF BibTeX

Multimodal sentiment analysis (MSA) has shown promising results but often poses significant challenges in real-world applications due to its dependence on the complete and aligned multimodal sequences. While existing approaches attempt to address missing modalities through feature reconstruction, they often neglect the complex interplay between homogeneous and heterogeneous relationships in multimodal features. To address this problem, we propose Decoupled-Adaptive Reconstruction (DAR), a novel framework that explicitly addresses these limitations through two key components: (1) a mutual information-based decoupling module that decomposes features into common and independent representations, and (2) a reconstruction module that independently processes these decoupled features before fusion for downstream tasks. Extensive experiments on two benchmark datasets demonstrate that DAR significantly outperforms existing methods in both modality reconstruction and sentiment analysis tasks, particularly in scenarios with missing or unaligned modalities. Our results show improvements of 2.21% in bi-classification accuracy and 3.9% in regression error compared to state-of-the-art baselines on the MOSEI dataset.

Keywords:

Machine Learning: ML: Multi-modal learning

Machine Learning: ML: Robustness

Natural Language Processing: NLP: Sentiment analysis, stylistic analysis, and argument mining