Back to Search Start Over

Evaluating epistemic uncertainty estimation strategies in vegetation trait retrieval using hybrid models and imaging spectroscopy data.

Authors :
García-Soria, José Luis
Morata, Miguel
Berger, Katja
Pascual-Venteo, Ana Belén
Rivera-Caicedo, Juan Pablo
Verrelst, Jochem
Source :
Remote Sensing of Environment. Aug2024, Vol. 310, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

The new-generation satellite imaging spectrometers provide an unprecedented data stream to be processed into quantifiable vegetation traits. Hybrid models have gained widespread acceptance in recent years due to their versatility in converting spectral data into traits. In hybrid models, the retrieval is obtained through a machine learning regression algorithm (MLRA) trained on a wide range of simulated data. For instance, they are currently under development for trait retrieval in preparation for the upcoming Copernicus Hyperspectral Imaging Mission for the Environment (CHIME), among others targeting routine estimation of canopy nitrogen content (CNC). However, like any retrieval algorithm, the process is not error-free, and most MLRAs inherently lack an uncertainty estimation related to the retrieved traits, which implies a risk of misinterpretation when applying the model to real-world data. Therefore, this study aimed to assess epistemic uncertainty estimation strategies (Bayesian method, drop-out, quantile regression, and bootstrapping) alongside the estimation of CNC using competitive MLRAs. Each of the regression models was evaluated using three data sets: (1) simulated scenes with varying noise using the SCOPE 2.1 radiative transfer model, (2) hyperspectral images from the PRISMA sensor, and (3) field-measured data. Analysis of generated uncertainty intervals led to the following findings: First, Gaussian processes regression (GPR) offers meaningful uncertainties, primarily attributable to spectral data degradation, which provide supplementary insights into the quality of trait mapping. Second, bootstrapping uncertainties can be used as quality indicators of the reliability of the estimates retrieved by hybrid models. Yet, its variability depends on the used MLRA, which impedes trusting its variance as a confidence interval. Third, quantile regression forest (QRF), despite not being top-performing algorithm, exhibit outstanding robustness estimations and uncertainty when the spectral data is degraded, either by Gaussian noise or by striping, often occurring in satellite imagery. Fourth, bootstrapped kernel ridge regression (KRR) demonstrated comparable performance to the benchmark algorithm GPR; the retrievals and uncertainties of these two MLRAs were highly correlated. Fifth, bootstrapped partial least squares regression (PLSR) estimations and uncertainties exhibit poor robustness to noise degradation, with normalized root mean square error (NRMSE) increasing from 19% to 112%. Additionally, a GUI tool was integrated into the ARTMO software package for assessing epistemic uncertainties from the embedded regression algorithms, providing a trait mapping quality indicator for mapping applications, and improving decision-making. [Display omitted] • Quantified uncertainty provides insight in pixel retrieval from imaging spectroscopy. • Great variability in epistemic uncertainty depends on the ML algorithm. • Bootstrapped KRR and Bayesian GPR achieved top estimates and uncertainty. • PLSR is weak to noise degradation and is unreliable for spatiotemporal transference. • Bootstrapping, Bayesian and dropout uncertainties serve as quality mapping indicators. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00344257
Volume :
310
Database :
Academic Search Index
Journal :
Remote Sensing of Environment
Publication Type :
Academic Journal
Accession number :
177754421
Full Text :
https://doi.org/10.1016/j.rse.2024.114228