Back to Search
Start Over
Classification of GC-MS measurements of wines by combining data dimension reduction and variable selection techniques
- Publication Year :
- 2008
- Publisher :
- Wiley, 2008.
-
Abstract
- Different classification methods (Partial Least Squares Discriminant Analysis, Extended Canonical Variates Analysis and Linear Discriminant Analysis), in combination with variable selection approaches (Forward Selection and Genetic Algorithms), were compared, evaluating their capabilities in the geographical discrimination of wine samples. Sixty-two samples were analysed by means of dynamic headspace gas chromatography mass spectrometry (HS-GC-MS) and the entire chromatographic profile was considered to build the dataset. Since variable selection techniques pose a risk of overfitting when a large number of variables is used, a method for coupling data dimension reduction and variable selection was proposed. This approach compresses windows of the original data by retaining only significant components of local Principal Component Analysis models. The subsequent variable selection is then performed on these locally derived score variables. The results confirmed that the classification models achieved on the reduced data were better than those obtained on the entire chromatographic profile, with the exception of Extended Canonical Variates Analysis, which gave acceptable models in both cases. Copyright © 2008 John Wiley & Sons, Ltd.
- Subjects :
- Applied Mathematics
Feature selection
Overfitting
Linear discriminant analysis
chemometrics
Analytical Chemistry
classification
CHIM/01 - CHIMICA ANALITICA
Genetic algorithm
Statistics
Partial least squares regression
Principal component analysis
data reduction
wine
GC-MS
Selection (genetic algorithm)
Mathematics
Data reduction
variable selection
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....ad454f6a7fbde6f881fe59dce98cd1da