Start Over

M2Caps: learning multi-modal capsules of optical and SAR images for land cover classification

Authors :: Haodi Zhang
Anzhu Yu
Kuiliang Gao
Xuanbei Lu
Xuefeng Cao
Wenyue Guo
Weiqi Lian
Source :: International Journal of Digital Earth, Vol 18, Iss 1 (2025)
Publication Year :: 2025
Publisher :: Taylor & Francis Group, 2025.
Abstract: Land cover classification (LCC) is essential for monitoring land use and changes. This study examines the integration of optical (OPT) and synthetic aperture radar (SAR) images for precise LCC. The disparity between OPT and SAR images introduces challenges in fusing high-level semantic information and utilizing multi-scale features. To address these challenges, this paper proposes a novel multi-modal capsules model (M²Caps) incorporating multi-modal capsules learning and cascaded features fusion modules. The multi-modal capsules learning module models high-level semantic information and abstract relationships across diverse remote sensing images (RSIs) modalities as vectors, thereby facilitating the induction of joint multi-modal features with high discriminability and robustness. Subsequently, the cascaded features fusion module integrates various feature scales, concurrently processing deep multi modal features, shallow OPT features, and shallow SAR features at each layer. This approach ensures the precise characterization of both local details and global semantics. M²Caps outperformed state-of-the-art models, improving mean intersection over union (mIoU) by 2.86% – 12.9% on the WHU-OPT-SAR dataset and 3.91% – 12.3% on the GF-2 and GF-3 Pohang datasets, demonstrating its effectiveness in high-precision LCC in complex environments.

Subjects :: Land cover classification
multi-modal semantic segmentation
multi-modal capsules learning
cascaded features fusion
optical images
synthetic aperture radar
Mathematical geography. Cartography
GA1-1776

Details

Language :: English
ISSN :: 17538947 and 17538955
Volume :: 18
Issue :: 1
Database :: Directory of Open Access Journals
Journal :: International Journal of Digital Earth
Publication Type :: Academic Journal
Accession number :: edsdoj.bbcb687e0c8140858df90bafd70a81d4
Document Type :: article
Full Text :: https://doi.org/10.1080/17538947.2024.2447347

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

M2Caps: learning multi-modal capsules of optical and SAR images for land cover classification

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

M2Caps: learning multi-modal capsules of optical and SAR images for land cover classification

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources