Back to Search Start Over

MSST-Net: A Multi-Scale Adaptive Network for Building Extraction from Remote Sensing Images Based on Swin Transformer

Authors :
Wei Yuan
Wenbo Xu
Source :
Remote Sensing, Vol 13, Iss 4743, p 4743 (2021), Remote Sensing, Volume 13, Issue 23, Pages: 4743
Publication Year :
2021
Publisher :
MDPI AG, 2021.

Abstract

The segmentation of remote sensing images by deep learning technology is the main method for remote sensing image interpretation. However, the segmentation model based on a convolutional neural network cannot capture the global features very well. A transformer, whose self-attention mechanism can supply each pixel with a global feature, makes up for the deficiency of the convolutional neural network. Therefore, a multi-scale adaptive segmentation network model (MSST-Net) based on a Swin Transformer is proposed in this paper. Firstly, a Swin Transformer is used as the backbone to encode the input image. Then, the feature maps of different levels are decoded separately. Thirdly, the convolution is used for fusion, so that the network can automatically learn the weight of the decoding results of each level. Finally, we adjust the channels to obtain the final prediction map by using the convolution with a kernel of 1 × 1. By comparing this with other segmentation network models on a WHU building data set, the evaluation metrics, mIoU, F1-score and accuracy are all improved. The network model proposed in this paper is a multi-scale adaptive network model that pays more attention to the global features for remote sensing segmentation.

Details

Language :
English
ISSN :
20724292
Volume :
13
Issue :
4743
Database :
OpenAIRE
Journal :
Remote Sensing
Accession number :
edsair.doi.dedup.....589c6b0edaa6ffe490dffaa4bfddef89