Back to Search Start Over

Transferring Hydrologic Data Across Continents – Leveraging Data‐Rich Regions to Improve Hydrologic Prediction in Data‐Sparse Regions.

Authors :
Ma, Kai
Feng, Dapeng
Lawson, Kathryn
Tsai, Wen‐Ping
Liang, Chuan
Huang, Xiaorong
Sharma, Ashutosh
Shen, Chaopeng
Source :
Water Resources Research; May2021, Vol. 57 Issue 5, p1-11, 11p
Publication Year :
2021

Abstract

There is a drastic geographic imbalance in available global streamflow gauge and catchment property data, with additional large variations in data characteristics. As a result, models calibrated in one region cannot normally be migrated to another without significant modifications. Currently in these regions, non‐transferable machine learning models are habitually trained over small local data sets. Here we show that transfer learning (TL), in the senses of weight initialization and weight freezing, allows long short‐term memory (LSTM) streamflow models that were pretrained over the conterminous United States (CONUS, the source data set) to be transferred to catchments on other continents (the target regions), without the need for extensive catchment attributes available at the target location. We demonstrate this possibility for regions where data are dense (664 basins in Great Britain), moderately dense (49 basins in central Chile), and scarce with only remotely sensed attributes available (5 basins in China). In both China and Chile, the TL models showed significantly elevated performance compared to locally trained models using all basins. The benefits of TL increased with the amount of available data in the source data set, and seemed to be more pronounced with greater physiographic diversity. The benefits from TL were greater than from pretraining LSTM using the outputs from an uncalibrated hydrologic model. These results suggest hydrologic data around the world have commonalities which could be leveraged by deep learning, and synergies can be had with a simple modification of the current workflows, greatly expanding the reach of existing big data. Finally, this work diversified existing global streamflow benchmarks. Plain Language Summary: We introduced a method to utilize available big data to better start and warm up a machine learning streamflow model that is later fine‐tuned for prediction in basins on other continents (Asia, South America and Europe). This procedure noticeably improved streamflow volume prediction for different scenarios with varying amounts of data in the target basins (in terms of time period, length of collected data, and number of basins having data). This allows thousands of basins across the world with only a few years' worth of streamflow observations to benefit from improved modeling and accuracy resulting from the use of deep learning. Key Points: Basins around the world can be better modeled by applying transfer learning (TL) to a deep network trained in the US, and tuning it locallyThe benefits of TL increased with the amount and diversity of the source data, and were larger than from pretraining with a hydrologic modelThis work greatly expands the reach of deep learning, adds to the value of existing big data, and calls for synergy of global data sets [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00431397
Volume :
57
Issue :
5
Database :
Complementary Index
Journal :
Water Resources Research
Publication Type :
Academic Journal
Accession number :
150514809
Full Text :
https://doi.org/10.1029/2020WR028600