1. Transferring Hydrologic Data Across Continents – Leveraging Data‐Rich Regions to Improve Hydrologic Prediction in Data‐Sparse Regions.
- Author
-
Ma, Kai, Feng, Dapeng, Lawson, Kathryn, Tsai, Wen‐Ping, Liang, Chuan, Huang, Xiaorong, Sharma, Ashutosh, and Shen, Chaopeng
- Subjects
DEEP learning ,MACHINE learning ,CONTINENTS ,STREAMFLOW ,HYDROLOGIC models ,BIG data - Abstract
There is a drastic geographic imbalance in available global streamflow gauge and catchment property data, with additional large variations in data characteristics. As a result, models calibrated in one region cannot normally be migrated to another without significant modifications. Currently in these regions, non‐transferable machine learning models are habitually trained over small local data sets. Here we show that transfer learning (TL), in the senses of weight initialization and weight freezing, allows long short‐term memory (LSTM) streamflow models that were pretrained over the conterminous United States (CONUS, the source data set) to be transferred to catchments on other continents (the target regions), without the need for extensive catchment attributes available at the target location. We demonstrate this possibility for regions where data are dense (664 basins in Great Britain), moderately dense (49 basins in central Chile), and scarce with only remotely sensed attributes available (5 basins in China). In both China and Chile, the TL models showed significantly elevated performance compared to locally trained models using all basins. The benefits of TL increased with the amount of available data in the source data set, and seemed to be more pronounced with greater physiographic diversity. The benefits from TL were greater than from pretraining LSTM using the outputs from an uncalibrated hydrologic model. These results suggest hydrologic data around the world have commonalities which could be leveraged by deep learning, and synergies can be had with a simple modification of the current workflows, greatly expanding the reach of existing big data. Finally, this work diversified existing global streamflow benchmarks. Plain Language Summary: We introduced a method to utilize available big data to better start and warm up a machine learning streamflow model that is later fine‐tuned for prediction in basins on other continents (Asia, South America and Europe). This procedure noticeably improved streamflow volume prediction for different scenarios with varying amounts of data in the target basins (in terms of time period, length of collected data, and number of basins having data). This allows thousands of basins across the world with only a few years' worth of streamflow observations to benefit from improved modeling and accuracy resulting from the use of deep learning. Key Points: Basins around the world can be better modeled by applying transfer learning (TL) to a deep network trained in the US, and tuning it locallyThe benefits of TL increased with the amount and diversity of the source data, and were larger than from pretraining with a hydrologic modelThis work greatly expands the reach of deep learning, adds to the value of existing big data, and calls for synergy of global data sets [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF