Back to Search Start Over

Wet-ConViT: A Hybrid Convolutional–Transformer Model for Efficient Wetland Classification Using Satellite Data

Authors :
Ali Radman
Fariba Mohammadimanesh
Masoud Mahdianpari
Source :
Remote Sensing, Vol 16, Iss 14, p 2673 (2024)
Publication Year :
2024
Publisher :
MDPI AG, 2024.

Abstract

Accurate and efficient classification of wetlands, as one of the most valuable ecological resources, using satellite remote sensing data is essential for effective environmental monitoring and sustainable land management. Deep learning models have recently shown significant promise for identifying wetland land cover; however, they are mostly constrained in practical issues regarding efficiency while gaining high accuracy with limited training ground truth samples. To address these limitations, in this study, a novel deep learning model, namely Wet-ConViT, is designed for the precise mapping of wetlands using multi-source satellite data, combining the strengths of multispectral Sentinel-2 and SAR Sentinel-1 datasets. Both capturing local information of convolution and the long-range feature extraction capabilities of transformers are considered within the proposed architecture. Specifically, the key to Wet-ConViT’s foundation is the multi-head convolutional attention (MHCA) module that integrates convolutional operations into a transformer attention mechanism. By leveraging convolutions, MHCA optimizes the efficiency of the original transformer self-attention mechanism. This resulted in high-precision land cover classification accuracy with a minimal computational complexity compared with other state-of-the-art models, including two convolutional neural networks (CNNs), two transformers, and two hybrid CNN–transformer models. In particular, Wet-ConViT demonstrated superior performance for classifying land cover with approximately 95% overall accuracy metrics, excelling the next best model, hybrid CoAtNet, by about 2%. The results highlighted the proposed architecture’s high precision and efficiency in terms of parameters, memory usage, and processing time. Wet-ConViT could be useful for practical wetland mapping tasks, where precision and computational efficiency are paramount.

Details

Language :
English
ISSN :
20724292
Volume :
16
Issue :
14
Database :
Directory of Open Access Journals
Journal :
Remote Sensing
Publication Type :
Academic Journal
Accession number :
edsdoj.6346bdfead4c41619d99fc4ff204a742
Document Type :
article
Full Text :
https://doi.org/10.3390/rs16142673