Back to Search Start Over

Multi-View Visual Relationship Detection with Estimated Depth Map

Authors :
Xiaozhou Liu
Ming-Gang Gan
Yuxuan He
Source :
Applied Sciences, Vol 12, Iss 9, p 4674 (2022)
Publication Year :
2022
Publisher :
MDPI AG, 2022.

Abstract

The abundant visual information contained in multi-view images is widely used in computer vision tasks. Existing visual relationship detection frameworks have extended the feature vector to improve model performance. However, single view information can not fully reveal the visual relationships in complex visual scenes. To solve this problem and explore the multi-view information in a visual relationship detection (VRD) model, a novel multi-view VRD framework based on a monocular RGB image and an estimated depth map is proposed. The contributions of this paper are threefold. First, we construct a novel multi-view framework which fuses information of different views extracted from estimated RGB-D images. Second, a multi-view image generation method is proposed to transfer flat visual space to 3D multi-view space. Third, we redesign the visual relationship balanced classifier which can process multi-view feature vectors simultaneously. Detailed experiments were conducted on two datasets to demonstrate the effectiveness of the multi-view VRD framework. The experimental results showed that the multi-view VRD framework resulted in state-of-the-art zero-shot learning performance in specific depth conditions.

Details

Language :
English
ISSN :
20763417
Volume :
12
Issue :
9
Database :
Directory of Open Access Journals
Journal :
Applied Sciences
Publication Type :
Academic Journal
Accession number :
edsdoj.fb5ee27aa42078c4f3a38d436663a
Document Type :
article
Full Text :
https://doi.org/10.3390/app12094674