Back to Search Start Over

V2MLP: an accurate and simple multi-view MLP network for fine-grained 3D shape recognition.

Authors :
Zheng, Liang
Bai, Jing
Bai, Shaojin
Li, Wenjing
Peng, Bin
Zhou, Tao
Source :
Visual Computer. Sep2024, Vol. 40 Issue 9, p6655-6670. 16p.
Publication Year :
2024

Abstract

Fine-grained 3D shape recognition (FGSR) is crucial for real-world applications. Existing methods face challenges in achieving high accuracy for FGSR due to high similarity within sub-categories and low dissimilarity between them, especially in the absence of part location or attribute annotations. In this paper, we propose V 2 MLP, a multi-view representation-oriented MLP network dedicated to FGSR, using only class labels as supervision. V 2 MLP comprises two key modules: the cross-view interaction MLP (CVI-MLP) and the cross-view fusion MLP (CVF-MLP). The CVI-MLP module captures contextual information, including local and global contexts through cross-view interactions, to extract discriminative view features that reinforce subtle differences between sub-categories. Meanwhile, the CVF-MLP module performs cross-view aggregation from space and view dimensions to obtain the final 3D shape features, minimizing information loss during the view feature fusion process. Extensive experiments on three categories from the FG3D dataset demonstrate the effectiveness of V 2 MLP in learning discriminative features for 3D shapes, achieving state-of-the-art accuracy for FGSR. Additionally, V 2 MLP performs competitively for meta-category recognition on the ModelNet40 dataset. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
*ANNOTATIONS

Details

Language :
English
ISSN :
01782789
Volume :
40
Issue :
9
Database :
Academic Search Index
Journal :
Visual Computer
Publication Type :
Academic Journal
Accession number :
179041406
Full Text :
https://doi.org/10.1007/s00371-023-03191-4