Back to Search
Start Over
Consistent Features From Varied Contexts and Ordered Multiple Attention Forms for RGB-D Salient Object Detection
- Source :
- IEEE Sensors Journal; August 2024, Vol. 24 Issue: 16 p25879-25890, 12p
- Publication Year :
- 2024
-
Abstract
- Attention-based fusion and sophisticated multimodal feature extraction are the key components of many state-of-the-art RGB-D salient object detection (SOD) methods. In this regard, different forms of attention provide a variety of emphasis during fusion in an appropriate order, and extraction of consistent features at multiple levels from a variety of contexts can be leveraged to improve SOD performance further. To this end, this article proposes an RGB-D SOD technique that learns multilevel features considering a variety of local and global contexts along with long-range dependencies. The learning also involves consistency evaluation of saliency features from pixel to region level using cascaded capsule networks to facilitate faithful preservation of intricate salient object details. The fusion of multimodal features leading to the SOD is performed using cross-modal interactions through multiple attention forms ordered in a favorable way. Quantitative and qualitative experimental results using multiple standard datasets and measures demonstrate the effectiveness of the proposed SOD framework in comparison to the state-of-the-art. An ablation study establishes the utility of various components of the proposed approach.
Details
- Language :
- English
- ISSN :
- 1530437X and 15581748
- Volume :
- 24
- Issue :
- 16
- Database :
- Supplemental Index
- Journal :
- IEEE Sensors Journal
- Publication Type :
- Periodical
- Accession number :
- ejs67218857
- Full Text :
- https://doi.org/10.1109/JSEN.2024.3405658