Back to Search
Start Over
V2MLP: an accurate and simple multi-view MLP network for fine-grained 3D shape recognition.
- Source :
-
Visual Computer . Sep2024, Vol. 40 Issue 9, p6655-6670. 16p. - Publication Year :
- 2024
-
Abstract
- Fine-grained 3D shape recognition (FGSR) is crucial for real-world applications. Existing methods face challenges in achieving high accuracy for FGSR due to high similarity within sub-categories and low dissimilarity between them, especially in the absence of part location or attribute annotations. In this paper, we propose V 2 MLP, a multi-view representation-oriented MLP network dedicated to FGSR, using only class labels as supervision. V 2 MLP comprises two key modules: the cross-view interaction MLP (CVI-MLP) and the cross-view fusion MLP (CVF-MLP). The CVI-MLP module captures contextual information, including local and global contexts through cross-view interactions, to extract discriminative view features that reinforce subtle differences between sub-categories. Meanwhile, the CVF-MLP module performs cross-view aggregation from space and view dimensions to obtain the final 3D shape features, minimizing information loss during the view feature fusion process. Extensive experiments on three categories from the FG3D dataset demonstrate the effectiveness of V 2 MLP in learning discriminative features for 3D shapes, achieving state-of-the-art accuracy for FGSR. Additionally, V 2 MLP performs competitively for meta-category recognition on the ModelNet40 dataset. [ABSTRACT FROM AUTHOR]
- Subjects :
- *ANNOTATIONS
Subjects
Details
- Language :
- English
- ISSN :
- 01782789
- Volume :
- 40
- Issue :
- 9
- Database :
- Academic Search Index
- Journal :
- Visual Computer
- Publication Type :
- Academic Journal
- Accession number :
- 179041406
- Full Text :
- https://doi.org/10.1007/s00371-023-03191-4