Back to Search Start Over

A top-down manner-based DCNN architecture for semantic image segmentation.

Authors :
Kai Qiao
Jian Chen
Linyuan Wang
Lei Zeng
Bin Yan
Source :
PLoS ONE, Vol 12, Iss 3, p e0174508 (2017)
Publication Year :
2017
Publisher :
Public Library of Science (PLoS), 2017.

Abstract

Given their powerful feature representation for recognition, deep convolutional neural networks (DCNNs) have been driving rapid advances in high-level computer vision tasks. However, their performance in semantic image segmentation is still not satisfactory. Based on the analysis of visual mechanism, we conclude that DCNNs in a bottom-up manner are not enough, because semantic image segmentation task requires not only recognition but also visual attention capability. In the study, superpixels containing visual attention information are introduced in a top-down manner, and an extensible architecture is proposed to improve the segmentation results of current DCNN-based methods. We employ the current state-of-the-art fully convolutional network (FCN) and FCN with conditional random field (DeepLab-CRF) as baselines to validate our architecture. Experimental results of the PASCAL VOC segmentation task qualitatively show that coarse edges and error segmentation results are well improved. We also quantitatively obtain about 2%-3% intersection over union (IOU) accuracy improvement on the PASCAL VOC 2011 and 2012 test sets.

Subjects

Subjects :
Medicine
Science

Details

Language :
English
ISSN :
19326203
Volume :
12
Issue :
3
Database :
Directory of Open Access Journals
Journal :
PLoS ONE
Publication Type :
Academic Journal
Accession number :
edsdoj.1f74c6077ec34aec8f8c0d2d80625c4c
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pone.0174508