Back to Search Start Over

An efficient semantic segmentation method based on transfer learning from object detection

Authors :
Wei Yang
Jianlin Zhang
Zhongbi Chen
Zhiyong Xu
Source :
IET Image Processing, Vol 15, Iss 1, Pp 57-64 (2021)
Publication Year :
2021
Publisher :
Wiley, 2021.

Abstract

Abstract Nowadays, numerous semantic segmentation techniques were used to complex scenes such as urban streets. However, speed issues are not considered in most of these methods, and real‐time methods do not mainly include enough accuracy. In this paper, an efficient semantic segmentation method is proposed, using the feature extractor of a real‐time object detection model, Darknet53, as the backbone of DeepLabv3+. By the high accuracy of DeepLabv3+ structure and great efficiency of Darknet53, a mean intersection was obtained over union of 76.3% in Cityscapes test set, and fast inference speed simultaneously (0.178 s per frame on one GTX 1080Ti GPU). A huge imbalance of objects was noticed on Cityscapes dataset. To solve this problem, a Focal Loss like loss function was proposed to concentrate more on the hard difficult pixels. Moreover, an atrous convolution block was proposed to extract more high‐level features. Based on the experimental results, it is proved that these changes contribute to a better result on the Cityscapes test set (77.8% mean Intersection over Union) and faster inference speed (0.171 s per frame). Authors' model achieves state‐of‐art results on Cityscapes test set (79.1% mean Intersection over Union) after fine‐tuning on Cityscapes coarsely annotated dataset.

Details

Language :
English
ISSN :
17519667 and 17519659
Volume :
15
Issue :
1
Database :
Directory of Open Access Journals
Journal :
IET Image Processing
Publication Type :
Academic Journal
Accession number :
edsdoj.587b0a60d8649228169ff8ff2f1e1bb
Document Type :
article
Full Text :
https://doi.org/10.1049/ipr2.12005