Back to Search Start Over

A multiscale neural architecture search framework for multimodal fusion.

Authors :
Lv, Jindi
Sun, Yanan
Ye, Qing
Feng, Wentao
Lv, Jiancheng
Source :
Information Sciences. Sep2024, Vol. 679, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Multimodal fusion, a machine learning technique, significantly enhances decision-making by leveraging complementary information extracted from different data modalities. The success of multimodal fusion relies heavily on the design of the fusion scheme. However, this process traditionally depends on manual expertise and exhaustive trials. To tackle this challenge, researchers have undertaken studies on DARTS-based Neural Architecture Search (NAS) variants to automate the search of fusion schemes. In this paper, we present theoretical and empirical evidence that highlights the presence of catastrophic search bias in DARTS-based multimodal fusion methods. This bias traps the search into a deceptive optimal childnet, rendering the entire search process ineffective. To circumvent this phenomenon, we introduce a novel NAS framework for multimodal fusion, featuring a robust search strategy and a meticulously designed multi-scale fusion search space. Significantly, the proposed framework is capable of capturing modality-specific information across multiple scales while achieving an automatic balance between intra-modal and inter-modal information. We conduct extensive experiments on three commonly used multimodal classification tasks from different domains and compare the proposed framework against state-of-the-art approaches. The experimental results demonstrate the superior robustness and high efficiency of the proposed framework. • This paper presents the first theoretical and empirical evidence demonstrating that DARTS suffers from catastrophic search bias in multimodal fusion, rendering the entire search ineffective. We term this phenomenon as the "Matthew Effect" and give a profound analysis. • A novel NAS framework for multimodal fusion is proposed, wihch features a robust search strategy and a multi-scale fusion search space. The framework leverages the single-path one-shot NAS algorithm as the search strategy, fully circumventing the occurrence of the "Matthew Effect". • The extensive experimental results effectively showcase the exceptional robustness and high efficiency of the proposed framework. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00200255
Volume :
679
Database :
Academic Search Index
Journal :
Information Sciences
Publication Type :
Periodical
Accession number :
178423618
Full Text :
https://doi.org/10.1016/j.ins.2024.121005