Back to Search Start Over

Improving global soil moisture prediction through cluster-averaged sampling strategy.

Authors :
Li, Qingliang
Xiao, Qiyun
Zhang, Cheng
Zhu, Jinlong
Chen, Xiao
Yan, Yuguang
Liu, Pingping
Shangguan, Wei
Wei, Zhongwang
Li, Lu
Dong, Wenzong
Dai, Yongjiu
Source :
Geoderma. Sep2024, Vol. 449, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

• Utilizing variables such as Soil Water Content, Slope Terrain, Surface Slope Difference, Soil Carbon, Surface Temperature, and Digital Elevation Model as clustering criteria significantly enhances global soil moisture prediction accuracy, surpassing LSTM-based models. • Our sampling strategy improves various deep learning models, including LSTM, Encoder-Decoder LSTM (EDLSTM), and Attention LSTM (AttLSTM). • Significant performance enhancements are observed in challenging environments, such as arid, cold, and polar regions, with R2 improvements of up to 50 %. Understanding and predicting global soil moisture (SM) is crucial for water resource management and agricultural production. While deep learning methods (DL) have shown strong performance in SM prediction, imbalances in training samples with different characteristics pose a significant challenge. We propose that improving the diversity and balance of batch training samples during gradient descent can help address this issue. To test this hypothesis, we developed a Cluster-Averaged Sampling (CAS) strategy utilizing unsupervised learning techniques. This approach involves training the model with evenly sampled data from different clusters, ensuring both sample diversity and numerical consistency within each cluster. This approach prevents the model from overemphasizing specific sample characteristics, leading to more balanced feature learning. Experiments using the LandBench1.0 dataset with five different seeds for 1-day lead-time global predictions reveal that CAS outperforms several Long Short-Term Memory (LSTM)-based models that do not employ this strategy. The median Coefficient of Determination (R2) improved by 2.36 % to 4.31 %, while Kling-Gupta Efficiency (KGE) improved by 1.95 % to 3.16 %. In high-latitude areas, R2 improvements exceeded 40 % in specific regions. To further validate CAS under realistic conditions, we tested it using the Soil Moisture Active and Passive Level 3 (SMAP-L3) satellite data for 1 to 3-day lead-time global predictions, confirming its efficacy. The study substantiates the CAS strategy and introduces a novel training method for enhancing the generalization of DL models. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00167061
Volume :
449
Database :
Academic Search Index
Journal :
Geoderma
Publication Type :
Academic Journal
Accession number :
179691364
Full Text :
https://doi.org/10.1016/j.geoderma.2024.116999