SDDA: A Progressive Self-Distillation with Decoupled Alignment for Multimodal Image-Text 2 Classification
Chen, Xiaohao, Shuai, Qianjun, Hu, Feng and Cheng, Yongqiang (2024) SDDA: A Progressive Self-Distillation with Decoupled Alignment for Multimodal Image-Text 2 Classification. Neurocomputing, 614. ISSN 1872-8286 (In Press)
Item Type: | Article |
---|
Abstract
Multimodal image–text classification endeavors to deduce the correct category based on the information encapsulated in image–text pairs. Despite the commendable performance achieved by current image–text methodologies, the intrinsic multimodal heterogeneity persists as a challenge, with the contributions from diverse modalities exhibiting considerable variance. In this study, we address this issue by introducing a novel decoupled multimodal Self-Distillation (SDDA) approach, aimed at facilitating fine-grained alignment of shared and private features of image–text features in a low-dimensional space, thereby reducing information redundancy. Specifically, each modality representation is decoupled in an autoregressive manner into two segments within a modality-irrelevant/exclusive space. SDDA imparts additional knowledge transfer to each decoupled segment via self-distillation, while also offering flexible, richer multimodal knowledge supervision for unimodal features. Multimodal classification experiments conducted on two publicly available benchmark datasets verified the efficacy of the algorithm, demonstrating that SDDA surpasses the state-of-the-art baselines.
PDF
SDDA_Revision2.pdf Restricted to Repository staff only until 25 October 2026. Available under License Creative Commons Attribution Non-commercial No Derivatives. Download (8MB) | Request a copy |
More Information
Related URLs: |
Depositing User: Yongqiang Cheng |
Identifiers
Item ID: 18460 |
Identification Number: https://doi.org/10.1016/j.neucom.2024.128794 |
ISSN: 1872-8286 |
URI: http://sure.sunderland.ac.uk/id/eprint/18460 | Official URL: https://www.sciencedirect.com/science/article/abs/... |
Users with ORCIDS
Catalogue record
Date Deposited: 05 Dec 2024 17:10 |
Last Modified: 05 Dec 2024 17:15 |
Author: | Xiaohao Chen |
Author: | Qianjun Shuai |
Author: | Yongqiang Cheng |
Author: | Feng Hu |
University Divisions
Faculty of TechnologySubjects
Computing > Data ScienceComputing > Artificial Intelligence
Actions (login required)
View Item (Repository Staff Only) |