Back to Search Start Over

Feature pyramid network with multi-scale prediction fusion for real-time semantic segmentation.

Authors :
Quyen, Toan Van
Kim, Min Young
Source :
Neurocomputing. Jan2023, Vol. 519, p104-113. 10p.
Publication Year :
2023

Abstract

Feature pyramid network (FPN) is constructed from a bottom-up pathway and a top-down pathway. The method involves multi-scale features, so it can obtain rich contextual information from lower scales and high resolution from the largest scale. Additionally, different receptive fields are effective to capture both thin and large objects in image scenes. All feature maps concatenate together to predict the targets. However, the average pooling method yields the problem of combining the best predictions with poorer ones. In this paper, we proposed a dual prediction to leverage the useful characteristics of each FPN feature map. A low scale prediction attains good precision for large objects. The other one suitably segments narrow objects. Finally, a multi-scale fusion is deployed with an attention part. The attention module finds pixels of a low scale having high probabilities of wrong labels, and then requires the supplements from a high scale. A multi-scale fusion allows the network to learn across the different scales of predictions. We have achieved good Results 77.9% mIoU at 62 FPS on Cityscapes and 44.1% mIoU on Mapillary Vistas. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09252312
Volume :
519
Database :
Academic Search Index
Journal :
Neurocomputing
Publication Type :
Academic Journal
Accession number :
160539613
Full Text :
https://doi.org/10.1016/j.neucom.2022.11.062