Back to Search
Start Over
RoadTransNet: advancing remote sensing road extraction through multi-scale features and contextual information.
- Source :
- Signal, Image & Video Processing; Apr2024, Vol. 18 Issue 3, p2403-2412, 10p
- Publication Year :
- 2024
-
Abstract
- Road extraction is a crucial task that requires high-resolution remote sensing images with wide-ranging applications in urban planning, navigation, and autonomous vehicles. However, this task is challenged by complex road structures and the need to capture long-range dependencies. RoadTransNet is a new road extraction architecture that aims to solve these problems that making the power of the Swin Transformer and Feature Pyramid Network (FPN) while introducing Transformer-like attention mechanisms. RoadTransNet combines a robust convolutional backbone, inspired by the Swin Transformer, with an FPN to capture multi-scale features effectively. The Transformer-like attention mechanisms, including multi-head self-attention and cross-attention, enable the network to represent context information on a local and global scale, ensuring accurate road extraction. The skip connections facilitate gradient flow, preserving fine details, and decoding layers transform extracted features into precise road predictions. Our experiments are conducted using the RoadTransNet, which is subject to rigorous assessment on the following datasets: the DeepGlobe road extraction challenge Dataset and the CHN6-cUG roads dataset. The outcomes indicate its superior performance in achieving high-level metrics of precision and recall, as well as achieving high F1 scores and IoU. The comparative evaluations performed against traditional methods showcase RoadTransNet's ability to capture complex road structures and long-range dependencies. The RoadTransNet stands as a comprehensive solution for the extraction of roads in high-resolution remote sensing images, offering promising opportunities for improving urban planning, navigation systems, and autonomous vehicle technologies. Its success lies in the synergy of convolutional and transformer-based architectures, paving the way for advanced remote sensing applications in smart cities and others. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 18631703
- Volume :
- 18
- Issue :
- 3
- Database :
- Complementary Index
- Journal :
- Signal, Image & Video Processing
- Publication Type :
- Academic Journal
- Accession number :
- 176144146
- Full Text :
- https://doi.org/10.1007/s11760-023-02916-1