1. Cross-Attentional Bracket-shaped Convolutional Network for semantic image segmentation
- Author
-
Sung-Ho Bae, Sungyoung Lee, Cam-Hao Hua, and Thien Huynh-The
- Subjects
Information Systems and Management ,Computer science ,business.industry ,05 social sciences ,050301 education ,Pattern recognition ,02 engineering and technology ,Pascal (programming language) ,Convolutional neural network ,Computer Science Applications ,Theoretical Computer Science ,Artificial Intelligence ,Control and Systems Engineering ,0202 electrical engineering, electronic engineering, information engineering ,Semantic image segmentation ,020201 artificial intelligence & image processing ,Segmentation ,Artificial intelligence ,business ,0503 education ,computer ,Software ,computer.programming_language - Abstract
As perception-related applications are of great importance in industrial production and daily life nowadays, solutions for understanding given images semantically receive numerous attention from the literature. To this end, significant accomplishments have been reached for such pixel-wise segmentation problem thanks to novel manipulations of integrating global context into local details in convolutional neural networks. However, this strategy in the existing work did not exhaustively exploit middle-level features, which carry reasonable balance between fine-grained and semantic information. Therefore, this paper introduces a Cross-Attentional Bracket-shaped Convolutional Network (CAB-Net) to leverage their contribution to the tournament of constructing pixel-wise labeled map. In concrete, fine-to-coarse feature maps of interest from the backbone network are densely combined by an efficient fusion of channel-wisely and spatially attentional schemes in crossing manner, namely Cross-Attentional Fusion, to embed semantically rich features into finer patterns. Continuously, these newly decoded outputs repeat the same procedure round-by-round until shaping a final feature map having finest resolution for complete scene understanding. Consequently, the proposed CAB-Net achieves competitive mean Intersection of Union performance on PASCAL VOC 2012 (83.6% without MS-COCO pretraining), CamVid (76.4%) and Cityscapes (78.3%) datasets.
- Published
- 2020
- Full Text
- View/download PDF