Close menu

SURE

Sunderland Repository records the research produced by the University of Sunderland including practice-based research and theses.

SDDA: A Progressive Self-Distillation with Decoupled Alignment for Multimodal Image-Text 2 Classification

Chen, Xiaohao, Shuai, Qianjun, Hu, Feng and Cheng, Yongqiang (2024) SDDA: A Progressive Self-Distillation with Decoupled Alignment for Multimodal Image-Text 2 Classification. Neurocomputing, 614. ISSN 1872-8286 (In Press)

Item Type: Article

Abstract

Multimodal image–text classification endeavors to deduce the correct category based on the information encapsulated in image–text pairs. Despite the commendable performance achieved by current image–text methodologies, the intrinsic multimodal heterogeneity persists as a challenge, with the contributions from diverse modalities exhibiting considerable variance. In this study, we address this issue by introducing a novel decoupled multimodal Self-Distillation (SDDA) approach, aimed at facilitating fine-grained alignment of shared and private features of image–text features in a low-dimensional space, thereby reducing information redundancy. Specifically, each modality representation is decoupled in an autoregressive manner into two segments within a modality-irrelevant/exclusive space. SDDA imparts additional knowledge transfer to each decoupled segment via self-distillation, while also offering flexible, richer multimodal knowledge supervision for unimodal features. Multimodal classification experiments conducted on two publicly available benchmark datasets verified the efficacy of the algorithm, demonstrating that SDDA surpasses the state-of-the-art baselines.

[img] PDF
SDDA_Revision2.pdf
Restricted to Repository staff only until 25 October 2026.
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (8MB) | Request a copy

More Information

Related URLs:
Depositing User: Yongqiang Cheng

Identifiers

Item ID: 18460
Identification Number: https://doi.org/10.1016/j.neucom.2024.128794
ISSN: 1872-8286
URI: http://sure.sunderland.ac.uk/id/eprint/18460
Official URL: https://www.sciencedirect.com/science/article/abs/...

Users with ORCIDS

ORCID for Xiaohao Chen: ORCID iD orcid.org/0009-0002-1219-6488
ORCID for Qianjun Shuai: ORCID iD orcid.org/0000-0003-0062-0077
ORCID for Yongqiang Cheng: ORCID iD orcid.org/0000-0001-7282-7638

Catalogue record

Date Deposited: 05 Dec 2024 17:10
Last Modified: 05 Dec 2024 17:15

Contributors

Author: Xiaohao Chen ORCID iD
Author: Qianjun Shuai ORCID iD
Author: Yongqiang Cheng ORCID iD
Author: Feng Hu

University Divisions

Faculty of Technology

Subjects

Computing > Data Science
Computing > Artificial Intelligence

Actions (login required)

View Item (Repository Staff Only) View Item (Repository Staff Only)