Back to Search Start Over

Domain invariant covariate selection (Di-CovSel) for selecting generalized features across domains.

Authors :
Diaz, Valeria Fonseca
Mishra, Puneet
Roger, Jean-Michel
Saeys, Wouter
Source :
Chemometrics & Intelligent Laboratory Systems. Mar2022, Vol. 222, pN.PAG-N.PAG. 1p.
Publication Year :
2022

Abstract

Multivariate spectral signals are highly correlated. Often, variable selection techniques are deployed, aiming at model optimization, identification of key variables to explore the underlying physicochemical system or development of a cheap multi-spectral system based on key variables. However, many times the selected variables do not supply a good estimate of properties when tested on a new setting such as new measurements performed on a different spectrometer, different physical or chemical state of the samples and difference in the environmental factors around the experiment. Often the model based on variables selected in the first domain (specific conditions/instrument) does not generalize on the new domain (specific conditions/instrument). To deal with it, in the present work a new method to variable selection called domain invariant covariate selection (di-CovSel) is proposed. The method selects the most informative variables which are invariant to the differences in the instruments, physical or chemical state of the samples and the differences in the environmental factors around the experiment. The method is inspired by domain invariant partial least-square (di-PLS) and the covariate selection (CovSel). The potential of the method is demonstrated on four real cases related to the calibration of near-infrared (NIR) spectroscopy on agri-food materials. The results show that in all the cases, the domain invariant features selected by the di-CovSel have low prediction error compared to the standard variable selection with the CovSel approach when the models are tested on a new data domain. In summary, domain invariant features selected across domains support the development of calibration models with good generalization and supply a better understanding of the system by bypassing the external factors originating from differences in the instruments, physical or chemical states of the samples and the differences in the environmental factors around the experiment. Note that one key feature of the proposed method is that the most important variables which generalize well across domains can be identified without requiring reference measurements in the target domain. • A domain invariant variable selection technique is presented. • Method selects variables to achieve generalized models. • Several application cases for domain invariant variable selection are presented. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01697439
Volume :
222
Database :
Academic Search Index
Journal :
Chemometrics & Intelligent Laboratory Systems
Publication Type :
Academic Journal
Accession number :
155375768
Full Text :
https://doi.org/10.1016/j.chemolab.2022.104499