Back to Search
Start Over
Employing Bilinear Fusion and Saliency Prior Information for RGB-D Salient Object Detection
- Source :
- IEEE Transactions on Multimedia. 24:1651-1664
- Publication Year :
- 2022
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2022.
-
Abstract
- Multi-modal feature fusion and saliency reasoning are two core sub-tasks of RGB-D salient object detection. However, most existing models employ linear fusion strategies (e.g., concatenation) for multi-modal feature fusion and use a simple coarse-to-fine structure for saliency reasoning. Despite their simpleness, they can neither fully capture the cross-modal complementary information nor exploit the multi-level complementary information among the cross-modal features at different levels. To address these issues, a novel RGB-D salient object detection model is presented, where we pay special attention to the aforementioned two sub-tasks. Concretely, a multi-modal feature interaction module is first presented to explore more interactions between the unimodal RGB and depth features. It helps to capture their cross-modal complementary information by jointly using some simple linear fusion strategies and bilinear fusion ones. Then, a saliency prior information guided fusion module is presented to exploit the multi-level complementary information among the fused cross-modal features at different levels. Instead of employing a simple convolutional layer for the final saliency prediction, a saliency refinement and prediction module is designed to better exploit those extracted multi-level cross-modal information for RGB-D saliency detection. Experimental results on several benchmark datasets verify the effectiveness and superiority of the proposed framework over some state-of-the-art methods.
- Subjects :
- Structure (mathematical logic)
Exploit
Computer science
business.industry
Concatenation
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Bilinear interpolation
Pattern recognition
Computer Science Applications
Feature (computer vision)
Signal Processing
Media Technology
Benchmark (computing)
RGB color model
Artificial intelligence
Electrical and Electronic Engineering
Layer (object-oriented design)
business
Subjects
Details
- ISSN :
- 19410077 and 15209210
- Volume :
- 24
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Multimedia
- Accession number :
- edsair.doi...........60a935c2899b8b88525f0fa4b5983acd
- Full Text :
- https://doi.org/10.1109/tmm.2021.3069297