Back to Search Start Over

Classification of GC-MS measurements of wines by combining data dimension reduction and variable selection techniques

Authors :
Rasmus Bro
Riccardo Leardi
Thomas Skov
Davide Ballabio
Ballabio, D
Skov, T
Leardi, R
Bro, R
Publication Year :
2008
Publisher :
Wiley, 2008.

Abstract

Different classification methods (Partial Least Squares Discriminant Analysis, Extended Canonical Variates Analysis and Linear Discriminant Analysis), in combination with variable selection approaches (Forward Selection and Genetic Algorithms), were compared, evaluating their capabilities in the geographical discrimination of wine samples. Sixty-two samples were analysed by means of dynamic headspace gas chromatography mass spectrometry (HS-GC-MS) and the entire chromatographic profile was considered to build the dataset. Since variable selection techniques pose a risk of overfitting when a large number of variables is used, a method for coupling data dimension reduction and variable selection was proposed. This approach compresses windows of the original data by retaining only significant components of local Principal Component Analysis models. The subsequent variable selection is then performed on these locally derived score variables. The results confirmed that the classification models achieved on the reduced data were better than those obtained on the entire chromatographic profile, with the exception of Extended Canonical Variates Analysis, which gave acceptable models in both cases. Copyright © 2008 John Wiley & Sons, Ltd.

Details

Language :
English
Database :
OpenAIRE
Accession number :
edsair.doi.dedup.....ad454f6a7fbde6f881fe59dce98cd1da